Recommended Readings
Most of my blog posts have prerequisites stated at the beginning of the post. If you are unfamiliar with one or another, please refer to the following table for recommended readings.
I only list references that I found helpful, so it is by no means exhaustive nor objective, and you might find better references. If you do, please let me know! I update the table regularly.
Topic | Light introduction | Deep dive |
---|---|---|
A/B testing | Vincent Vanhoucke's blog post on A/B testing | |
AI alignment problem | See "Friendly AI" | See "Friendly AI" |
AI safety | See "Friendly AI" | See "Friendly AI" |
automatic differentiation | Ari Seff's video | |
Bayesian statistics | MacKay | |
classical mechanics | Landau and Lifshitz | |
convex optimization | Intelligent Systems Lab's video | |
deep implicit layers | The NeurIPS 2020 tutorial | |
finite factored sets | Scott Garrabant's lecture | |
friendly AI | Bostrom Eliezer Yudkowski's presentation | Leike et al. Everitt et al. alignmentforum.org * |
functional decision theory | Yudkowski and Soares | |
game theory | Leyton-Brown and Shoham * | |
Gaussian processes | David MacKay's lecture (slides and alternative upload here) | Rasmussen and Williams * |
information theory | MacKay | |
integral equations | Wazwaz * | |
LSTMs | Christopher Olah's blog post | |
model compression | Sam Sučík's blog post | |
neural differential equations | Kidger * | |
neural networks | 3blue1brown’s series of videos | Roberts, Yaida, and Hanin Off the Convex Path blog * |
neural processes | Garnelo et al. | |
object-oriented programming | Graham's blog post | |
reinforcement learning | David Silver's lecture series Berkley deep reinforcement learning lecture series* | Sutton and Barto |
relativity | Einstein's wonderful nearly-equation-free book | Wald Misner et al. Penrose and Rindler |
transformers | Dinan, et al. | |
writing | George Orwell's essay John Wentworth's post on quick and rigorous writing Michael Nielsen's post on Discovery Fiction | Douglas |
Sources with a “*” are those which I have read only partially.
References
{543838:IGHBLB3S};{543838:5VDYM37P};{543838:N8SZTDXE};{543838:DYGTT36E};{543838:WHPMIDUG};{543838:GDT9AQ2C};{543838:IK4IEPA6};{543838:Z9XCML3L};{543838:IGHBLB3S};{543838:HGAHZXQX};{543838:X2F6KKUD};{543838:R9WN9BU8};{543838:DX2QCEFW};{543838:UKNE7FJR};{543838:RDNAER6G};{543838:LCNNCCIG};{543838:P49RPCKJ};{543838:PR6JVQNW};{543838:EFP99ZCK};{543838:NMVNS4RD}
nature
default
asc
no
76233
%7B%22status%22%3A%22success%22%2C%22updateneeded%22%3Afalse%2C%22instance%22%3A%22zotpress-412556a66b555747bdc9699e3ca1b87d%22%2C%22meta%22%3A%7B%22request_last%22%3A0%2C%22request_next%22%3A0%2C%22used_cache%22%3Atrue%7D%2C%22data%22%3A%5B%7B%22key%22%3A%22EFP99ZCK%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Dinan%20et%20al.%22%2C%22parsedDate%22%3A%222023-04-04%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EDinan%2C%20E.%2C%20Yaida%2C%20S.%20%26amp%3B%20Zhang%2C%20S.%20Effective%20Theory%20of%20Transformers%20at%20Initialization.%20Preprint%20at%20%3Ca%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.48550%5C%2FarXiv.2304.02034%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.48550%5C%2FarXiv.2304.02034%3C%5C%2Fa%3E%20%282023%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22preprint%22%2C%22title%22%3A%22Effective%20Theory%20of%20Transformers%20at%20Initialization%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Emily%22%2C%22lastName%22%3A%22Dinan%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sho%22%2C%22lastName%22%3A%22Yaida%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Susan%22%2C%22lastName%22%3A%22Zhang%22%7D%5D%2C%22abstractNote%22%3A%22We%20perform%20an%20effective-theory%20analysis%20of%20forward-backward%20signal%20propagation%20in%20wide%20and%20deep%20Transformers%2C%20i.e.%2C%20residual%20neural%20networks%20with%20multi-head%20self-attention%20blocks%20and%20multilayer%20perceptron%20blocks.%20This%20analysis%20suggests%20particular%20width%20scalings%20of%20initialization%20and%20training%20hyperparameters%20for%20these%20models.%20We%20then%20take%20up%20such%20suggestions%2C%20training%20Vision%20and%20Language%20Transformers%20in%20practical%20setups.%22%2C%22genre%22%3A%22%22%2C%22repository%22%3A%22arXiv%22%2C%22archiveID%22%3A%22arXiv%3A2304.02034%22%2C%22date%22%3A%222023-04-04%22%2C%22DOI%22%3A%2210.48550%5C%2FarXiv.2304.02034%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F2304.02034%22%2C%22language%22%3A%22%22%2C%22collections%22%3A%5B%22GM6225YA%22%5D%2C%22dateModified%22%3A%222023-04-16T14%3A26%3A18Z%22%7D%7D%2C%7B%22key%22%3A%22R9WN9BU8%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Roberts%20et%20al.%22%2C%22parsedDate%22%3A%222021-08-24%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3ERoberts%2C%20D.%20A.%2C%20Yaida%2C%20S.%20%26amp%3B%20Hanin%2C%20B.%20%3Ci%3EThe%20Principles%20of%20Deep%20Learning%20Theory%3C%5C%2Fi%3E.%20%282021%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22The%20Principles%20of%20Deep%20Learning%20Theory%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Daniel%20A.%22%2C%22lastName%22%3A%22Roberts%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sho%22%2C%22lastName%22%3A%22Yaida%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Boris%22%2C%22lastName%22%3A%22Hanin%22%7D%5D%2C%22abstractNote%22%3A%22This%20book%20develops%20an%20effective%20theory%20approach%20to%20understanding%20deep%20neural%20networks%20of%20practical%20relevance.%20Beginning%20from%20a%20first-principles%20component-level%20picture%20of%20networks%2C%20we%20explain%20how%20to%20determine%20an%20accurate%20description%20of%20the%20output%20of%20trained%20networks%20by%20solving%20layer-to-layer%20iteration%20equations%20and%20nonlinear%20learning%20dynamics.%20A%20main%20result%20is%20that%20the%20predictions%20of%20networks%20are%20described%20by%20nearly-Gaussian%20distributions%2C%20with%20the%20depth-to-width%20aspect%20ratio%20of%20the%20network%20controlling%20the%20deviations%20from%20the%20infinite-width%20Gaussian%20description.%20We%20explain%20how%20these%20effectively-deep%20networks%20learn%20nontrivial%20representations%20from%20training%20and%20more%20broadly%20analyze%20the%20mechanism%20of%20representation%20learning%20for%20nonlinear%20models.%20From%20a%20nearly-kernel-methods%20perspective%2C%20we%20find%20that%20the%20dependence%20of%20such%20models%27%20predictions%20on%20the%20underlying%20learning%20algorithm%20can%20be%20expressed%20in%20a%20simple%20and%20universal%20way.%20To%20obtain%20these%20results%2C%20we%20develop%20the%20notion%20of%20representation%20group%20flow%20%28RG%20flow%29%20to%20characterize%20the%20propagation%20of%20signals%20through%20the%20network.%20By%20tuning%20networks%20to%20criticality%2C%20we%20give%20a%20practical%20solution%20to%20the%20exploding%20and%20vanishing%20gradient%20problem.%20We%20further%20explain%20how%20RG%20flow%20leads%20to%20near-universal%20behavior%20and%20lets%20us%20categorize%20networks%20built%20from%20different%20activation%20functions%20into%20universality%20classes.%20Altogether%2C%20we%20show%20that%20the%20depth-to-width%20ratio%20governs%20the%20effective%20model%20complexity%20of%20the%20ensemble%20of%20trained%20networks.%20By%20using%20information-theoretic%20techniques%2C%20we%20estimate%20the%20optimal%20aspect%20ratio%20at%20which%20we%20expect%20the%20network%20to%20be%20practically%20most%20useful%20and%20show%20how%20residual%20connections%20can%20be%20used%20to%20push%20this%20scale%20to%20arbitrary%20depths.%20With%20these%20tools%2C%20we%20can%20learn%20in%20detail%20about%20the%20inductive%20bias%20of%20architectures%2C%20hyperparameters%2C%20and%20optimizers.%22%2C%22date%22%3A%222021-08-24%22%2C%22language%22%3A%22%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F2106.10165%22%2C%22collections%22%3A%5B%22GM6225YA%22%5D%2C%22dateModified%22%3A%222022-04-17T06%3A41%3A12Z%22%7D%7D%2C%7B%22key%22%3A%22HGAHZXQX%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Wazwaz%22%2C%22parsedDate%22%3A%222011%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EWazwaz%2C%20A.-M.%20%3Ci%3ELinear%20and%20nonlinear%20integral%20equations%3A%20methods%20and%20applications%3C%5C%2Fi%3E.%20%28Higher%20Education%20Press%2C%202011%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Linear%20and%20nonlinear%20integral%20equations%3A%20methods%20and%20applications%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Abdul-Majid%22%2C%22lastName%22%3A%22Wazwaz%22%7D%5D%2C%22abstractNote%22%3A%22%22%2C%22date%22%3A%222011%22%2C%22language%22%3A%22eng%22%2C%22ISBN%22%3A%22978-3-642-21449-3%20978-3-642-21448-6%20978-7-04-031694-0%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22LZIQYU5Q%22%5D%2C%22dateModified%22%3A%222022-02-26T17%3A29%3A26Z%22%7D%7D%2C%7B%22key%22%3A%22X2F6KKUD%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Kidger%22%2C%22parsedDate%22%3A%222022-02-04%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EKidger%2C%20P.%20On%20Neural%20Differential%20Equations.%20%3Ci%3EarXiv%3A2202.02435%20%5Bcs%2C%20math%2C%20stat%5D%3C%5C%2Fi%3E%20%282022%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22On%20Neural%20Differential%20Equations%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Patrick%22%2C%22lastName%22%3A%22Kidger%22%7D%5D%2C%22abstractNote%22%3A%22The%20conjoining%20of%20dynamical%20systems%20and%20deep%20learning%20has%20become%20a%20topic%20of%20great%20interest.%20In%20particular%2C%20neural%20differential%20equations%20%28NDEs%29%20demonstrate%20that%20neural%20networks%20and%20differential%20equation%20are%20two%20sides%20of%20the%20same%20coin.%20Traditional%20parameterised%20differential%20equations%20are%20a%20special%20case.%20Many%20popular%20neural%20network%20architectures%2C%20such%20as%20residual%20networks%20and%20recurrent%20networks%2C%20are%20discretisations.%20NDEs%20are%20suitable%20for%20tackling%20generative%20problems%2C%20dynamical%20systems%2C%20and%20time%20series%20%28particularly%20in%20physics%2C%20finance%2C%20...%29%20and%20are%20thus%20of%20interest%20to%20both%20modern%20machine%20learning%20and%20traditional%20mathematical%20modelling.%20NDEs%20offer%20high-capacity%20function%20approximation%2C%20strong%20priors%20on%20model%20space%2C%20the%20ability%20to%20handle%20irregular%20data%2C%20memory%20efficiency%2C%20and%20a%20wealth%20of%20available%20theory%20on%20both%20sides.%20This%20doctoral%20thesis%20provides%20an%20in-depth%20survey%20of%20the%20field.%20Topics%20include%3A%20neural%20ordinary%20differential%20equations%20%28e.g.%20for%20hybrid%20neural%5C%2Fmechanistic%20modelling%20of%20physical%20systems%29%3B%20neural%20controlled%20differential%20equations%20%28e.g.%20for%20learning%20functions%20of%20irregular%20time%20series%29%3B%20and%20neural%20stochastic%20differential%20equations%20%28e.g.%20to%20produce%20generative%20models%20capable%20of%20representing%20complex%20stochastic%20dynamics%2C%20or%20sampling%20from%20complex%20high-dimensional%20distributions%29.%20Further%20topics%20include%3A%20numerical%20methods%20for%20NDEs%20%28e.g.%20reversible%20differential%20equations%20solvers%2C%20backpropagation%20through%20differential%20equations%2C%20Brownian%20reconstruction%29%3B%20symbolic%20regression%20for%20dynamical%20systems%20%28e.g.%20via%20regularised%20evolution%29%3B%20and%20deep%20implicit%20models%20%28e.g.%20deep%20equilibrium%20models%2C%20differentiable%20optimisation%29.%20We%20anticipate%20this%20thesis%20will%20be%20of%20interest%20to%20anyone%20interested%20in%20the%20marriage%20of%20deep%20learning%20with%20dynamical%20systems%2C%20and%20hope%20it%20will%20provide%20a%20useful%20reference%20for%20the%20current%20state%20of%20the%20art.%22%2C%22date%22%3A%222022-02-04%22%2C%22language%22%3A%22%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F2202.02435%22%2C%22collections%22%3A%5B%22LZIQYU5Q%22%5D%2C%22dateModified%22%3A%222022-02-26T17%3A22%3A59Z%22%7D%7D%2C%7B%22key%22%3A%22NMVNS4RD%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Douglas%22%2C%22parsedDate%22%3A%222015%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EDouglas%2C%20Y.%20%3Ci%3EThe%20Reader%26%23x2019%3Bs%20Brain%3A%20How%20Neuroscience%20Can%20Make%20You%20a%20Better%20Writer%3C%5C%2Fi%3E.%20%28Cambridge%20University%20Press%2C%202015%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22The%20Reader%27s%20Brain%3A%20How%20Neuroscience%20Can%20Make%20You%20a%20Better%20Writer%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Yellowlees%22%2C%22lastName%22%3A%22Douglas%22%7D%5D%2C%22abstractNote%22%3A%22The%20first%20comprehensive%2C%20science-based%20approach%20to%20writing%2C%20The%20Reader%27s%20Brain%20employs%20neuroscience%2C%20psychology%2C%20and%20psycholinguistics%20to%20provide%20easy-to-follow%20principles%20for%20writing%20clearly%20and%20effectively.%20The%20book%20provides%20students%20and%20professionals%20from%20any%20field%20with%20the%20tools%20to%20write%20highly%20readable%20documents%20-%20from%20papers%20to%20proposals.%22%2C%22date%22%3A%222015%22%2C%22language%22%3A%22English%22%2C%22ISBN%22%3A%221-107-49650-0%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwww.goodreads.com%5C%2Fwork%5C%2Fbest_book%5C%2F45512302-the-reader-s-brain-how-neuroscience-can-make-you-a-better-writer%22%2C%22collections%22%3A%5B%22FXJRCGAW%22%5D%2C%22dateModified%22%3A%222021-04-29T07%3A58%3A50Z%22%7D%7D%2C%7B%22key%22%3A%22LCNNCCIG%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Wald%22%2C%22parsedDate%22%3A%221984%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EWald%2C%20R.%20M.%20%3Ci%3EGeneral%20relativity%3C%5C%2Fi%3E.%20%28University%20of%20Chicago%20Press%2C%201984%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22General%20relativity%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Robert%20M.%22%2C%22lastName%22%3A%22Wald%22%7D%5D%2C%22abstractNote%22%3A%22%22%2C%22date%22%3A%221984%22%2C%22language%22%3A%22%22%2C%22ISBN%22%3A%220-226-87037-5%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22LZIQYU5Q%22%5D%2C%22dateModified%22%3A%222019-05-17T06%3A51%3A56Z%22%7D%7D%2C%7B%22key%22%3A%22P49RPCKJ%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Misner%20et%20al.%22%2C%22parsedDate%22%3A%221973-09-15%22%2C%22numChildren%22%3A4%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EMisner%2C%20C.%20W.%2C%20Thorne%2C%20K.%20S.%20%26amp%3B%20Wheeler%2C%20J.%20A.%20%3Ci%3EGravitation%3C%5C%2Fi%3E.%20%28W.%20H.%20Freeman%2C%201973%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Gravitation%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Charles%20W.%22%2C%22lastName%22%3A%22Misner%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Kip%20S.%22%2C%22lastName%22%3A%22Thorne%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22John%20Archibald%22%2C%22lastName%22%3A%22Wheeler%22%7D%5D%2C%22abstractNote%22%3A%22%22%2C%22date%22%3A%221973-09-15%22%2C%22language%22%3A%22%22%2C%22ISBN%22%3A%220-7167-0344-0%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22LZIQYU5Q%22%5D%2C%22dateModified%22%3A%222019-05-17T06%3A51%3A48Z%22%7D%7D%2C%7B%22key%22%3A%22RDNAER6G%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Einstein%22%2C%22parsedDate%22%3A%222005%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EEinstein%2C%20A.%20%3Ci%3ERelativity%26%23x202F%3B%3A%20the%20special%20and%20general%20theory.%3C%5C%2Fi%3E%20%282005%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Relativity%20%3A%20the%20special%20and%20general%20theory.%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Albert%22%2C%22lastName%22%3A%22Einstein%22%7D%2C%7B%22creatorType%22%3A%22translator%22%2C%22firstName%22%3A%22Robert%20W%22%2C%22lastName%22%3A%22Lawson%22%7D%5D%2C%22abstractNote%22%3A%22%22%2C%22date%22%3A%222005%22%2C%22language%22%3A%22English%22%2C%22ISBN%22%3A%220-517-88441-0%20978-0-517-88441-6%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22LZIQYU5Q%22%5D%2C%22dateModified%22%3A%222019-05-17T06%3A49%3A24Z%22%7D%7D%2C%7B%22key%22%3A%22PR6JVQNW%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Penrose%20and%20Rindler%22%2C%22parsedDate%22%3A%221984%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EPenrose%2C%20R.%20%26amp%3B%20Rindler%2C%20W.%20%3Ci%3ESpinors%20and%20space-time%3C%5C%2Fi%3E.%20vol.%201%20%28Cambridge%20University%20Press%2C%201984%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Spinors%20and%20space-time%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Roger%22%2C%22lastName%22%3A%22Penrose%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Wolfgang%22%2C%22lastName%22%3A%22Rindler%22%7D%5D%2C%22abstractNote%22%3A%22%22%2C%22date%22%3A%221984%22%2C%22language%22%3A%22English%22%2C%22ISBN%22%3A%220-521-33707-0%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22LZIQYU5Q%22%5D%2C%22dateModified%22%3A%222019-05-17T06%3A48%3A42Z%22%7D%7D%2C%7B%22key%22%3A%225VDYM37P%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Landau%20and%20Lifshitz%22%2C%22parsedDate%22%3A%221982-01-29%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3ELandau%2C%20L.%20D.%20%26amp%3B%20Lifshitz%2C%20E.%20M.%20%3Ci%3EMechanics%3C%5C%2Fi%3E.%20%28Elsevier%2C%201982%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Mechanics%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22L.%20D.%22%2C%22lastName%22%3A%22Landau%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22E.%20M.%22%2C%22lastName%22%3A%22Lifshitz%22%7D%5D%2C%22abstractNote%22%3A%22Devoted%20to%20the%20foundation%20of%20mechanics%2C%20namely%20classical%20Newtonian%20mechanics%2C%20the%20subject%20is%20based%20mainly%20on%20Galileo%27s%20principle%20of%20relativity%20and%20Hamilton%27s%20principle%20of%20least%20action.%20The%20exposition%20is%20simple%20and%20leads%20to%20the%20most%20complete%20direct%20means%20of%20solving%20problems%20in%20mechanics.The%20final%20sections%20on%20adiabatic%20invariants%20have%20been%20revised%20and%20augmented.%20In%20addition%20a%20short%20biography%20of%20L%20D%20Landau%20has%20been%20inserted.%22%2C%22date%22%3A%221982-01-29%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22978-0-08-050347-9%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22LZIQYU5Q%22%5D%2C%22dateModified%22%3A%222019-05-17T06%3A44%3A55Z%22%7D%7D%2C%7B%22key%22%3A%22IK4IEPA6%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Leyton-Brown%20and%20Shoham%22%2C%22parsedDate%22%3A%222008-01-01%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3ELeyton-Brown%2C%20K.%20%26amp%3B%20Shoham%2C%20Y.%20Essentials%20of%20Game%20Theory%3A%20A%20Concise%20Multidisciplinary%20Introduction.%20%3Ci%3ESynthesis%20Lectures%20on%20Artificial%20Intelligence%20and%20Machine%20Learning%3C%5C%2Fi%3E%20%3Cb%3E2%3C%5C%2Fb%3E%2C%201%26%23x2013%3B88%20%282008%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Essentials%20of%20Game%20Theory%3A%20A%20Concise%20Multidisciplinary%20Introduction%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Kevin%22%2C%22lastName%22%3A%22Leyton-Brown%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Yoav%22%2C%22lastName%22%3A%22Shoham%22%7D%5D%2C%22abstractNote%22%3A%22%22%2C%22date%22%3A%22January%201%2C%202008%22%2C%22language%22%3A%22%22%2C%22DOI%22%3A%2210.2200%5C%2FS00108ED1V01Y200802AIM003%22%2C%22ISSN%22%3A%221939-4608%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwww.morganclaypool.com%5C%2Fdoi%5C%2Fabs%5C%2F10.2200%5C%2FS00108ED1V01Y200802AIM003%22%2C%22collections%22%3A%5B%22GM6225YA%22%5D%2C%22dateModified%22%3A%222019-04-04T08%3A33%3A33Z%22%7D%7D%2C%7B%22key%22%3A%22Z9XCML3L%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Rasmussen%20and%20Williams%22%2C%22parsedDate%22%3A%222008%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3ERasmussen%2C%20C.%20E.%20%26amp%3B%20Williams%2C%20C.%20K.%20I.%20%3Ci%3EGaussian%20processes%20for%20machine%20learning%3C%5C%2Fi%3E.%20%28MIT%20Press%2C%202008%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Gaussian%20processes%20for%20machine%20learning%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Carl%20Edward%22%2C%22lastName%22%3A%22Rasmussen%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christopher%20K.%20I.%22%2C%22lastName%22%3A%22Williams%22%7D%5D%2C%22abstractNote%22%3A%22%22%2C%22date%22%3A%222008%22%2C%22language%22%3A%22eng%22%2C%22ISBN%22%3A%22978-0-262-18253-9%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fwww.gaussianprocess.org%5C%2Fgpml%5C%2F%22%2C%22collections%22%3A%5B%22GM6225YA%22%5D%2C%22dateModified%22%3A%222019-03-11T13%3A51%3A25Z%22%7D%7D%2C%7B%22key%22%3A%22WHPMIDUG%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Everitt%20et%20al.%22%2C%22parsedDate%22%3A%222018-05-03%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EEveritt%2C%20T.%2C%20Lea%2C%20G.%20%26amp%3B%20Hutter%2C%20M.%20AGI%20Safety%20Literature%20Review.%20%3Ci%3EarXiv%3A1805.01109%20%5Bcs%5D%3C%5C%2Fi%3E%20%282018%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22AGI%20Safety%20Literature%20Review%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Tom%22%2C%22lastName%22%3A%22Everitt%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Gary%22%2C%22lastName%22%3A%22Lea%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Marcus%22%2C%22lastName%22%3A%22Hutter%22%7D%5D%2C%22abstractNote%22%3A%22The%20development%20of%20Artificial%20General%20Intelligence%20%28AGI%29%20promises%20to%20be%20a%20major%20event.%20Along%20with%20its%20many%20potential%20benefits%2C%20it%20also%20raises%20serious%20safety%20concerns%20%28Bostrom%2C%202014%29.%20The%20intention%20of%20this%20paper%20is%20to%20provide%20an%20easily%20accessible%20and%20up-to-date%20collection%20of%20references%20for%20the%20emerging%20field%20of%20AGI%20safety.%20A%20significant%20number%20of%20safety%20problems%20for%20AGI%20have%20been%20identified.%20We%20list%20these%2C%20and%20survey%20recent%20research%20on%20solving%20them.%20We%20also%20cover%20works%20on%20how%20best%20to%20think%20of%20AGI%20from%20the%20limited%20knowledge%20we%20have%20today%2C%20predictions%20for%20when%20AGI%20will%20first%20be%20created%2C%20and%20what%20will%20happen%20after%20its%20creation.%20Finally%2C%20we%20review%20the%20current%20public%20policy%20on%20AGI.%22%2C%22date%22%3A%222018-05-03%22%2C%22language%22%3A%22%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1805.01109%22%2C%22collections%22%3A%5B%22GM6225YA%22%5D%2C%22dateModified%22%3A%222019-03-09T09%3A50%3A56Z%22%7D%7D%2C%7B%22key%22%3A%22N8SZTDXE%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Bostrom%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EBostrom%2C%20N.%20%3Ci%3ESuperintelligence%3A%20paths%2C%20dangers%2C%20strategies%3C%5C%2Fi%3E.%20%28Oxford%20University%20Press%2C%202014%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Superintelligence%3A%20paths%2C%20dangers%2C%20strategies%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Nick%22%2C%22lastName%22%3A%22Bostrom%22%7D%5D%2C%22abstractNote%22%3A%22%22%2C%22date%22%3A%222014%22%2C%22language%22%3A%22eng%22%2C%22ISBN%22%3A%22978-0-19-967811-2%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22GM6225YA%22%2C%22LZIQYU5Q%22%5D%2C%22dateModified%22%3A%222019-03-09T09%3A43%3A37Z%22%7D%7D%2C%7B%22key%22%3A%22DYGTT36E%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Leike%20et%20al.%22%2C%22parsedDate%22%3A%222018-11-19%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3ELeike%2C%20J.%20%3Ci%3Eet%20al.%3C%5C%2Fi%3E%20Scalable%20agent%20alignment%20via%20reward%20modeling%3A%20a%20research%20direction.%20%3Ci%3EarXiv%3A1811.07871%20%5Bcs%2C%20stat%5D%3C%5C%2Fi%3E%20%282018%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Scalable%20agent%20alignment%20via%20reward%20modeling%3A%20a%20research%20direction%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jan%22%2C%22lastName%22%3A%22Leike%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Krueger%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Tom%22%2C%22lastName%22%3A%22Everitt%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Miljan%22%2C%22lastName%22%3A%22Martic%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Vishal%22%2C%22lastName%22%3A%22Maini%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Shane%22%2C%22lastName%22%3A%22Legg%22%7D%5D%2C%22abstractNote%22%3A%22One%20obstacle%20to%20applying%20reinforcement%20learning%20algorithms%20to%20real-world%20problems%20is%20the%20lack%20of%20suitable%20reward%20functions.%20Designing%20such%20reward%20functions%20is%20difficult%20in%20part%20because%20the%20user%20only%20has%20an%20implicit%20understanding%20of%20the%20task%20objective.%20This%20gives%20rise%20to%20the%20agent%20alignment%20problem%3A%20how%20do%20we%20create%20agents%20that%20behave%20in%20accordance%20with%20the%20user%27s%20intentions%3F%20We%20outline%20a%20high-level%20research%20direction%20to%20solve%20the%20agent%20alignment%20problem%20centered%20around%20reward%20modeling%3A%20learning%20a%20reward%20function%20from%20interaction%20with%20the%20user%20and%20optimizing%20the%20learned%20reward%20function%20with%20reinforcement%20learning.%20We%20discuss%20the%20key%20challenges%20we%20expect%20to%20face%20when%20scaling%20reward%20modeling%20to%20complex%20and%20general%20domains%2C%20concrete%20approaches%20to%20mitigate%20these%20challenges%2C%20and%20ways%20to%20establish%20trust%20in%20the%20resulting%20agents.%22%2C%22date%22%3A%222018-11-19%22%2C%22language%22%3A%22%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1811.07871%22%2C%22collections%22%3A%5B%22GM6225YA%22%5D%2C%22dateModified%22%3A%222019-03-09T09%3A41%3A41Z%22%7D%7D%2C%7B%22key%22%3A%22DX2QCEFW%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Garnelo%20et%20al.%22%2C%22parsedDate%22%3A%222018-07-04%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EGarnelo%2C%20M.%20%3Ci%3Eet%20al.%3C%5C%2Fi%3E%20Neural%20Processes.%20%3Ci%3EarXiv%3A1807.01622%20%5Bcs%2C%20stat%5D%3C%5C%2Fi%3E%20%282018%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Neural%20Processes%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Marta%22%2C%22lastName%22%3A%22Garnelo%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jonathan%22%2C%22lastName%22%3A%22Schwarz%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Dan%22%2C%22lastName%22%3A%22Rosenbaum%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Fabio%22%2C%22lastName%22%3A%22Viola%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Danilo%20J.%22%2C%22lastName%22%3A%22Rezende%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22S.%20M.%20Ali%22%2C%22lastName%22%3A%22Eslami%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Yee%20Whye%22%2C%22lastName%22%3A%22Teh%22%7D%5D%2C%22abstractNote%22%3A%22A%20neural%20network%20%28NN%29%20is%20a%20parameterised%20function%20that%20can%20be%20tuned%20via%20gradient%20descent%20to%20approximate%20a%20labelled%20collection%20of%20data%20with%20high%20precision.%20A%20Gaussian%20process%20%28GP%29%2C%20on%20the%20other%20hand%2C%20is%20a%20probabilistic%20model%20that%20defines%20a%20distribution%20over%20possible%20functions%2C%20and%20is%20updated%20in%20light%20of%20data%20via%20the%20rules%20of%20probabilistic%20inference.%20GPs%20are%20probabilistic%2C%20data-efficient%20and%20flexible%2C%20however%20they%20are%20also%20computationally%20intensive%20and%20thus%20limited%20in%20their%20applicability.%20We%20introduce%20a%20class%20of%20neural%20latent%20variable%20models%20which%20we%20call%20Neural%20Processes%20%28NPs%29%2C%20combining%20the%20best%20of%20both%20worlds.%20Like%20GPs%2C%20NPs%20define%20distributions%20over%20functions%2C%20are%20capable%20of%20rapid%20adaptation%20to%20new%20observations%2C%20and%20can%20estimate%20the%20uncertainty%20in%20their%20predictions.%20Like%20NNs%2C%20NPs%20are%20computationally%20efficient%20during%20training%20and%20evaluation%20but%20also%20learn%20to%20adapt%20their%20priors%20to%20data.%20We%20demonstrate%20the%20performance%20of%20NPs%20on%20a%20range%20of%20learning%20tasks%2C%20including%20regression%20and%20optimisation%2C%20and%20compare%20and%20contrast%20with%20related%20models%20in%20the%20literature.%22%2C%22date%22%3A%222018-07-04%22%2C%22language%22%3A%22%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1807.01622%22%2C%22collections%22%3A%5B%223D85EIQF%22%5D%2C%22dateModified%22%3A%222019-03-09T05%3A52%3A01Z%22%7D%7D%2C%7B%22key%22%3A%22UKNE7FJR%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Sutton%20and%20Barto%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3ESutton%2C%20R.%20S.%20%26amp%3B%20Barto%2C%20A.%20G.%20%3Ci%3EReinforcement%20Learning%3A%20An%20Introduction%3C%5C%2Fi%3E.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Reinforcement%20Learning%3A%20An%20Introduction%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Richard%20S.%22%2C%22lastName%22%3A%22Sutton%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Andrew%20G.%22%2C%22lastName%22%3A%22Barto%22%7D%5D%2C%22abstractNote%22%3A%22%22%2C%22date%22%3A%22%22%2C%22language%22%3A%22%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fincompleteideas.net%5C%2Fbook%5C%2Fthe-book-2nd.html%22%2C%22collections%22%3A%5B%22LZIQYU5Q%22%5D%2C%22dateModified%22%3A%222019-03-07T07%3A14%3A16Z%22%7D%7D%2C%7B%22key%22%3A%22GDT9AQ2C%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Yudkowsky%20and%20Soares%22%2C%22parsedDate%22%3A%222017-10-13%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EYudkowsky%2C%20E.%20%26amp%3B%20Soares%2C%20N.%20Functional%20Decision%20Theory%3A%20A%20New%20Theory%20of%20Instrumental%20Rationality.%20%3Ci%3EarXiv%3A1710.05060%3C%5C%2Fi%3E%20%282017%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Functional%20Decision%20Theory%3A%20A%20New%20Theory%20of%20Instrumental%20Rationality%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Eliezer%22%2C%22lastName%22%3A%22Yudkowsky%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Nate%22%2C%22lastName%22%3A%22Soares%22%7D%5D%2C%22abstractNote%22%3A%22This%20paper%20describes%20and%20motivates%20a%20new%20decision%20theory%20known%20as%20functional%20decision%20theory%20%28FDT%29%2C%20as%20distinct%20from%20causal%20decision%20theory%20and%20evidential%20decision%20theory.%20Functional%20decision%20theorists%20hold%20that%20the%20normative%20principle%20for%20action%20is%20to%20treat%20one%27s%20decision%20as%20the%20output%20of%20a%20fixed%20mathematical%20function%20that%20answers%20the%20question%2C%20%5C%22Which%20output%20of%20this%20very%20function%20would%20yield%20the%20best%20outcome%3F%5C%22%20Adhering%20to%20this%20principle%20delivers%20a%20number%20of%20benefits%2C%20including%20the%20ability%20to%20maximize%20wealth%20in%20an%20array%20of%20traditional%20decision-theoretic%20and%20game-theoretic%20problems%20where%20CDT%20and%20EDT%20perform%20poorly.%20Using%20one%20simple%20and%20coherent%20decision%20rule%2C%20functional%20decision%20theorists%20%28for%20example%29%20achieve%20more%20utility%20than%20CDT%20on%20Newcomb%27s%20problem%2C%20more%20utility%20than%20EDT%20on%20the%20smoking%20lesion%20problem%2C%20and%20more%20utility%20than%20both%20in%20Parfit%27s%20hitchhiker%20problem.%20In%20this%20paper%2C%20we%20define%20FDT%2C%20explore%20its%20prescriptions%20in%20a%20number%20of%20different%20decision%20problems%2C%20compare%20it%20to%20CDT%20and%20EDT%2C%20and%20give%20philosophical%20justifications%20for%20FDT%20as%20a%20normative%20theory%20of%20decision-making.%22%2C%22date%22%3A%222017-10-13%22%2C%22language%22%3A%22%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1710.05060%22%2C%22collections%22%3A%5B%223D85EIQF%22%2C%22LZIQYU5Q%22%5D%2C%22dateModified%22%3A%222019-03-05T13%3A48%3A12Z%22%7D%7D%2C%7B%22key%22%3A%22IGHBLB3S%22%2C%22library%22%3A%7B%22id%22%3A543838%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22MacKay%22%2C%22parsedDate%22%3A%222003%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%202%3B%20%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%20style%3D%5C%22clear%3A%20left%3B%20%5C%22%3E%5Cn%20%20%20%20%3Cdiv%20class%3D%5C%22csl-left-margin%5C%22%20style%3D%5C%22float%3A%20left%3B%20padding-right%3A%200.5em%3B%20text-align%3A%20right%3B%20width%3A%201em%3B%5C%22%3E1.%3C%5C%2Fdiv%3E%3Cdiv%20class%3D%5C%22csl-right-inline%5C%22%20style%3D%5C%22margin%3A%200%20.4em%200%201.5em%3B%5C%22%3EMacKay%2C%20D.%20J.%20C.%20%3Ci%3EInformation%20Theory%2C%20Inference%2C%20and%20Learning%20Algorithms%3C%5C%2Fi%3E.%20%28Cambridge%20University%20Press%2C%202003%29.%3C%5C%2Fdiv%3E%5Cn%20%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Information%20Theory%2C%20Inference%2C%20and%20Learning%20Algorithms%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%20J.%20C.%22%2C%22lastName%22%3A%22MacKay%22%7D%5D%2C%22abstractNote%22%3A%22%22%2C%22date%22%3A%222003%22%2C%22language%22%3A%22%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%22GM6225YA%22%5D%2C%22dateModified%22%3A%222018-05-22T10%3A01%3A40Z%22%7D%7D%5D%7D
1.
Dinan, E., Yaida, S. & Zhang, S. Effective Theory of Transformers at Initialization. Preprint at https://doi.org/10.48550/arXiv.2304.02034 (2023).
1.
Roberts, D. A., Yaida, S. & Hanin, B. The Principles of Deep Learning Theory. (2021).
1.
Wazwaz, A.-M. Linear and nonlinear integral equations: methods and applications. (Higher Education Press, 2011).
1.
Kidger, P. On Neural Differential Equations. arXiv:2202.02435 [cs, math, stat] (2022).
1.
Douglas, Y. The Reader’s Brain: How Neuroscience Can Make You a Better Writer. (Cambridge University Press, 2015).
1.
Wald, R. M. General relativity. (University of Chicago Press, 1984).
1.
Misner, C. W., Thorne, K. S. & Wheeler, J. A. Gravitation. (W. H. Freeman, 1973).
1.
Einstein, A. Relativity : the special and general theory. (2005).
1.
Penrose, R. & Rindler, W. Spinors and space-time. vol. 1 (Cambridge University Press, 1984).
1.
Landau, L. D. & Lifshitz, E. M. Mechanics. (Elsevier, 1982).
1.
Leyton-Brown, K. & Shoham, Y. Essentials of Game Theory: A Concise Multidisciplinary Introduction. Synthesis Lectures on Artificial Intelligence and Machine Learning 2, 1–88 (2008).
1.
Rasmussen, C. E. & Williams, C. K. I. Gaussian processes for machine learning. (MIT Press, 2008).
1.
Everitt, T., Lea, G. & Hutter, M. AGI Safety Literature Review. arXiv:1805.01109 [cs] (2018).
1.
Bostrom, N. Superintelligence: paths, dangers, strategies. (Oxford University Press, 2014).
1.
Leike, J. et al. Scalable agent alignment via reward modeling: a research direction. arXiv:1811.07871 [cs, stat] (2018).
1.
Garnelo, M. et al. Neural Processes. arXiv:1807.01622 [cs, stat] (2018).
1.
Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction.
1.
Yudkowsky, E. & Soares, N. Functional Decision Theory: A New Theory of Instrumental Rationality. arXiv:1710.05060 (2017).
1.
MacKay, D. J. C. Information Theory, Inference, and Learning Algorithms. (Cambridge University Press, 2003).