- Thinking on the Information Plane
- We think in theories; the universe runs on laws
- Laws describe what happens; theories try to explain why
- Examples of laws vs. theories
- Superiority of laws
- generalize
- extrapolate to novel circumstances
- Conservation laws are a recurring motif
- conservation of matter, momentum, charge, energy
- describe conservation of energy in terms of free energy
- Free energy minimization is an overarching, unifying theme of nature
- classical physics
- dynamic chemical equilibrium
- blood sugar
- neuronal free energy signaling hypothesis
- cost estimation (cognitive psychology)
- even intelligence
- Information theory applies energy minimization to probabilistic systems
- Introduce self information with a distribution
- Entropy, cross-entropy, KL divergence, mutual information
- Action and Perception as Divergence Minimization uses this framework to build intelligence
- Neuronal energy homeostasis theory
- Don’t rely on heuristics to learn intelligence. Use the framework of information theory
- Some useful heuristics may deceive us - yet they are not perfect; they break down under
- Weight regularization
- Batch/layer/group/instance normalization
- Backpropagation and
- more examples
- Graphs are a very flexible language - more than linear
- Information-theoretic solutions are superior
- Mutual-information regularization
- solutions to more examples
- We are still looking for a better language
- Pay attention to the plumbing of deep architectures and training
- how the information flows within the network
- Dense networks
- GRU/LSTM
- ResNet
- Inception
- Pyramid Point
- Transformers
- Graph Networks
- Neural turing machine
- how information travels from the objective to the network
- supervised learning
- semi supervised learning (finetuning)
- contrastive learning
- reinforcement learning
- corrupt feedback RL
- Intelligence by Generality
- We need metrics of intelligence to improve
- Intelligence is domain specific skill acquisition efficiency w.r.t. information (On the Measure of Intelligence)
- “our most significant measurement of progress is an agent's ability to achieve goals in a wide range of environments” (Using Unity to Help Solve Intelligence)
- The question is how general is its intelligence
- Virtual particles
- Chemical equilibrium
- Biological adaptation
- Imagine DNA as living through creatures of its kind
- The immune system has a few experts, but is able to quickly train millions more
- Resting, it sits at a “generally diverse” critical point in state space
- Those examples also highlight superior convergence of “blind” optimizers
- Broaden-and-build / low motivational intensity cognitive and behavioral development
- Tight / loose social norms transition when necessary (hopefully)
- Homogeneous components become specialized
- Is it possible to make superhuman narrow AI’s to continuous succession?
- Alignment is Possible by a Heterogeneous Blend of Methods
- Safety contains the criminal; security protects the president
- We cannot totally contain AI. Perception and action together are essential for learning
- We must prepare it for world integration
- Methods of AGI behavioral analysis and forecasting
- Human observation is deceptive
- Neural networks are prone to adversarial attack
- Qualitative functional is probably not sufficient either
- Poor intuition for high dimensionality and numerical sensitivity
- XAI
- Research and list more methods
- Simulation
- Biological methods
- Function and structure
- Psychological methods
- Deterministic complex systems can effectively behave nondeterministically
- Averaging corrupts information flow
- The brain’s action evoked potential is really an average of phase shift
- Corrupting the information mapping from stimulus to response improves security
- We cannot influence these systems without knowledge of internal state
- With hidden information, we shape behavior better
- We must exert energy to shape behavior during training
- The world’s distributions are dangerous
- word embedding unbiasing
- gpt3 error
- more examples
- A mixture of methods will be necessary to train
- Open and closed optimizers
- Behavioral psychology: openness to experience; closed to destructive behavior
- Affective psychology: broaden and build / motivational intensity
- Tight / loose social norm enforcement
- Value shaping
- behavior follows values
- What values should we instill?
- justice
- more values
- integrity
- value consistency
- frame as energy minimization
- Democratic value selection
- Expert panel may not be able to adequately
- BlenderBot’s maximum certainty method may help identify suitable values
- Ideally, a general intelligence can learn to autonomously identify values
- Copy themes of human intelligence
- The human body is innately intelligent
- unsupervised homeostasis learning
- innate and adaptive immune system
- lower sensorimotor pathways reduce information load
- reduced brain activation in mathematicians than lay people
- Neurons are genetically equipped for learning
- The connectome is disposed for HL Intelligence and skills
- Development
- Emotions
- Language
- Social Organization
- Keep AGI Open