A physics approach to classical and quantum machine learning

A physics approach to classical and quantum machine learning Alexey Melnikov Institute for Theoretical Physics, University of Innsbruck Institute for...
Author: Jeffery Fields
2 downloads 1 Views 2MB Size
A physics approach to classical and quantum machine learning Alexey Melnikov

Institute for Theoretical Physics, University of Innsbruck Institute for Quantum Optics and Quantum Information

Supervisor: Hans J. Briegel Co-supervisors: Justus Piater and Gerhard Kirchmair Jointly with Adi Makmal, Vedran Dunjko and Nicolai Friis MIP Seminar April 15, 2015 Alexey Melnikov

A physics approach to machine learning

Interplay between quantum information theory and concepts from AI Quantum physics Artificial intelligence (AI) Quantum computing Quantum error correction Quantum walks

PS model

Intelligent agent Machine learning

PS - projective simulation Alexey Melnikov

A physics approach to machine learning

Outline

◦ Introduction – artificial intelligence (AI) and its applications – projective simulation (PS) model, a physical approach to AI ◦ Standard (classical) PS agent – benchmarking (grid-world and mountain-car problems) – generalization within PS Model ◦ Quantum PS agent – implementation of a quantum agent – superconductiong transmon qubits

Alexey Melnikov

A physics approach to machine learning

Artificial intelligence (AI) and intelligent agents AI is the study of agents that receive percepts from the environment and perform actions.* Any AI program is called intelligent agent.

Environment

Intelligent agent percepts

actions

* S. Russell and P. Norvig. Artificial intelligence: A Modern Approach, 3rd edition (Prentice Hall, 2009).

Alexey Melnikov

A physics approach to machine learning

AI in robotics A robotic agent might have microphones, cameras, touch sensors and various motors for actuators.*

Environment Robot microphones cameras, touch

Applications: • robotics • finance • games • Google

motors, voice

• QEC • ...

* S. Russell and P. Norvig. Artificial intelligence: A Modern Approach, 3rd edition (Prentice Hall, 2009).

Alexey Melnikov

A physics approach to machine learning

AI in finance A trading agent perceives market rates, news and trades in stock market. A robotic agent Applications:

Stock market

Trading agent rates, news

• robotics • finance • games

trades

• Google • QEC • ...

* S. Russell and P. Norvig. Artificial intelligence: A Modern Approach, 3rd edition (Prentice Hall, 2009).

Alexey Melnikov

A physics approach to machine learning

AI in games A game agent plays with you. A robotic agent

You

Game agent your moves

Applications: • robotics • finance • games

it’s own moves

• Google • QEC • ...

* S. Russell and P. Norvig. Artificial intelligence: A Modern Approach, 3rd edition (Prentice Hall, 2009).

Alexey Melnikov

A physics approach to machine learning

AI on the web Search engine interacts with a user. Google

User

Google query

Applications: • robotics • finance • games

web page

• Google • QEC • ...

* S. Russell and P. Norvig. Artificial intelligence: A Modern Approach, 3rd edition (Prentice Hall, 2009).

Alexey Melnikov

A physics approach to machine learning

AI in quantum error correction (QEC) AI can be useful for quantum physics. A QEC agent gets data from syndrome measurements and performs error correction.*

Quantum register QEC agent syndrome data

Applications:

• robotics • finance • games

apply unitaries

• Google • QEC • ...

* J. Combes, et al., arXiv:1405.5656 (2014).

Alexey Melnikov

A physics approach to machine learning

Projective simulation (PS) agent • PS model is a novel physical approach to AI • PS agent process information stochastically in a directed, weighted network of clips (units of memory) • No computations, simple adjustment rules • Natural candidate for quantization, using methods of quantum walks Clip network ...

PS agent

...

percepts

p41 percept clip Clip 1

Clip 4

action clip

p13 Clip 3

actions

input

p12

p23 p32 Clip 2

p35

Clip 6

p56 Clip 5

output

H. J. Briegel and G. De las Cuevas, Scientic reports 2 (2012). Alexey Melnikov

A physics approach to machine learning

Projective simulation (PS) model Each edge connects some clip ci with a clip cj and has a time-dependent weight h(t) (ci , cj ). The h-values represent the unnormalized strength of the edges, and determine the hopping probabilities from clip ci to clip cj according to h(t) (ci , cj ) . p (t) (cj |ci ) = P (t) k h (ci , ck ) h-values are updated according to h(t+1) (ci , cj ) = h(t) (ci , cj ) − γ(h(t) (ci , cj ) − 1) + g (t) (ci , cj )λ, where 0 ≤ γ ≤ 1 is a damping parameter and λ is a non-negative reward given by the environment. Each time an edge is visited, the corresponding g -value is set to 1, following which it is decreased after each time step with a rate η: g (t+1) (ci , cj ) = g (t) (ci , cj )(1 − η). J. Mautner, A. Makmal, D. Manzano, M. Tiersch, and H. J. Briegel, New Generation Computing 33 (2015) Alexey Melnikov

A physics approach to machine learning

Grid-world task • The agent always starts from the (1,3) cell • It can choose among four actions: left, right, up or down • If the agent decides to go to a square labeled as “wall” or to go beyond the grid, then no movement is performed but the time step is counted The grid-world task: The goal of the game is to find the “star”.

• Reward of λ = 1 is received only after reaching the goal • A performance of an agent in this task is evaluated by the number of steps it makes before reaching the goal at each trial

R. S. Sutton, Proc. of the 7th International Conference on Machine Learning (1990) Alexey Melnikov

A physics approach to machine learning

PS network construction

x =1 y =1

x =1 y =2

...

x =6 y =9







hij gij ⇐

Alexey Melnikov

A physics approach to machine learning

PS network construction

x =1 y =1

x =1 y =2

...

x =6 y =9







hij gij ⇐

Alexey Melnikov

A physics approach to machine learning

PS network construction

x =1 y =1

x =1 y =2

...

x =6 y =9







hij gij ⇐

Alexey Melnikov

A physics approach to machine learning

PS in the grid-world task. Learning curves Η=0.03

average number of steps

24

Η=0.12 22

Η=0.15

20 18 16 14 0

50

100

150

200

trials

The learning curves of the PS agent in the grid-world task, with different η values. A trade-off is observed between the best performance and the number of trials required to reach it. Model

# of steps to goal after 100 trials

Parameters

PS† PI∗

15.4 14

λ = 1, η = 0.12, γ = 0 β = 0.1, γ = 0.9, α = 1000

Performance of the PS model in comparison with the PI model †

A. A. Melnikov, A. Makmal, and H. J. Briegel, Artificial Intelligence Research 3 (2014) * R. S. Sutton, Proc. of the 7th International Conference on Machine Learning (1990) Alexey Melnikov

A physics approach to machine learning

Mountain car problem • The agent always starts with a random position and velocity: x ∈ [−1.2, 0.5], v ∈ [−0.7, 0.7] • It can choose among 3 actions: forward thrust (to the right), no thrust, and reverse thrust (to the left) The goal is to find the “star” at x = 0.5

• The next state is defined by the equations vnew xnew

= vold + 0.001 ∗ Action − 0.0025 cos(3xold ) = xold + vold

• Reward of λ = 1 is received only after reaching the goal • A performance of an agent in this task is evaluated by the number of steps it makes before reaching the goal at each trial S. P. Singh and R. S. Sutton, Machine learning 22, 123 (1996). Alexey Melnikov

A physics approach to machine learning

PS network construction

[x0 , x1 ], [v0 , v1 ]

(x1 , x2 ], [v0 , v1 ]

(x19 , x20 ], (v19 , v20 ]

...

hij gij =



Alexey Melnikov

+

A physics approach to machine learning

PS in the mountain car task. Learning curves 450

500

pHtL Hc j Èci L by Eq. 2 HsoftmaxL, Η=0.02

400

æ

300

200

400 à

10

15

20

æ æ à

æ à

æ

300

à

æ

æ æ

à

æ

æ

æ

æ

à

æ æ

200

æ

æ

æ

æ à à

à

150

à

æ

à

250

à à

à

à à

5

æ æ

pHtL Hc j Èci L by Eq. 2 HsoftmaxL

à

æ

350

100 0.00

100 0

pHtL Hc j Èci L by Eq. 1

æ

average number of steps

average number of steps

pHtL Hc j Èci L by Eq. 1, Η=0.02

à

à

à

0.02

à

à

à

à

à

à

0.04

0.06

0.08

0.10

Η parameter

trials

(a) PS learning curves are shown for optimal values of η = 0.02 (for 20 trials).

(b) The dependence of the PS performance on the η parameter is shown after 20 trials.

Model

# of steps to goal after 100 trials

Parameters

PS† SARSA∗

223/trial 450/trial

λ = 1, η = 0.02, γ = 0 5 grids, each of 9 by 9 input space

Performance of the PS model in comparison with the SARSA algorithm †

A. A. Melnikov, A. Makmal, and H. J. Briegel, Artificial Intelligence Research 3 (2014) * S. P. Singh and R. S. Sutton, Machine learning 22, 123 (1996) Alexey Melnikov

A physics approach to machine learning

Generalization. Motivation There are many tasks in which percepts are composed of several elements. Even if two percept clips are different they may contain some common set of elements. This common set of elements should be taken into account in order to share the experience between different inputs. Useful generalization *: ⇐





• An ability for categorization (recognizing that all red signals have a common property, which we can refer to as redness)



• An ability to classify +



• Relevant learned

While driving the agent sees a traffic light with an arrow sign and should choose among two actions: continue driving (+) or stop a car (−).

generalizations

should

be

• Correct actions should be associated with relevant generalized properties • The generalization mechanism should be flexible

* A. A. Melnikov, A. Makmal, and H. J. Briegel, arXiv:1504.02247 (2015). Alexey Melnikov

A physics approach to machine learning

Mechanism of generalization ⇐





















#













+

(a) t ≤ 1000

(b) 1000 < t ≤ 2000





#

+

(a) (1 ≤ t ≤ 1000), the agent is rewarded for stopping at red light and for driving at green light

#

+















#



(c) 2000 < t ≤ 3000

+



(d) 3000 < t ≤ 4000

Alexey Melnikov

(b) (1000 < t ≤ 2000), the agent is rewarded for doing the opposite (c) (2000 < t ≤ 3000), the agent should only follow the arrows (d) (3000 < t ≤ 4000), the environment rewards the agent whenever it chooses to drive

A physics approach to machine learning

Mechanism of generalization (a) (1 ≤ t ≤ 1000), the agent is rewarded for stopping at red light and for driving at green light

1.0

efficiency Et

0.8

(a)

(b)

(c)

(d)

0.6

(b) (1000 < t ≤ 2000), the agent is rewarded for doing the opposite

0.4 0.2 0.0

0

1000

2000

3000

time step

The performance of the PS agent with generalization

4000

(c) (2000 < t ≤ 3000), the agent should only follow the arrows (d) (3000 < t ≤ 4000), the environment rewards the agent whenever it chooses to drive

A. A. Melnikov, A. Makmal, and H. J. Briegel, arXiv:1504.02247 (2015).

Alexey Melnikov

A physics approach to machine learning

Quantum PS agent • PS model is a novel physical approach to AI • PS agent process information stochastically in a directed, weighted network of clips (units of memory) • No computations, simple adjustment rules • Natural candidate for quantization, using methods of quantum walks Quantum clip network classical percepts

...

quantum PS agent

...

p41 percept clip Clip 1

classical actions

Clip 4

action clip

p13 Clip 3

classical input

p12

p23 p32 Clip 2

p35

Clip 6

p56 Clip 5

classical output

*G. D. Paparo, V. Dunjko, A. Makmal, M. A. Martin-Delgado, and H. J. Briegel, Phys. Rev. X 4, 031002 (2014). Alexey Melnikov

A physics approach to machine learning

Quantum PS agent Classical random walk on a network with N clips is characterised by a transition T matrix P, where each clip is a vector ci = [0, . . . , 1, 0, . . . , 0] with unity on the i-th position N X P ci = pij cj , j=1

In the quantum case each clip is a state |ci i. However a single unitary cannot encode the P matrix.

We use the set of N unitaries for a quantum walk Ui |0i =

N X √

pij |cj i .

j=1

Two-qubit probability unitaries for PS network with 4 clips.

V. Dunjko, N. Friis, and H. J. Briegel, New J. Phys. 17 (2015) Alexey Melnikov

A physics approach to machine learning

Nested coherent controlization

Three-qubit probability unitaries for PS network with 8 clips.

Alexey Melnikov

No-go theorem. Additional degrees of freedom are needed.

A physics approach to machine learning

Transmon qubits An aluminum transmon qubit with the dipole antenna is mounted at the center of the cavity.

For one qubit, the system is described by the Hamiltonian  2 2 H/~ = ωr a† a + ωq b † b − χqr /2 a† ab † b − χrr /2 a† a − χqq /2 b † b , where a and b are the dressed mode operators of the resonator and the qubit, respectively, ωr and ωq are their frequencies, χqr is the coupling between them and χrr , χqq are the anharmonicities. H. Paik, et al., Phys. Rev. Lett. 107 (2011). B. Vlastakis, et al., Science 342 (2013). Alexey Melnikov

A physics approach to machine learning

Coherent controlization using transmon qubits We use the cavity as a additional degree of freedom to implement the coherent controlization. a b

c

The resonance frequency of the cavity depends on the state of the qubits. For two superconducting qubits, we may hence label these ω00 , ω01 , ω10 , ω11 , corresponding to the two-qubit states |00i, |01i, |10i, and |11i, respectively. Alexey Melnikov

A physics approach to machine learning

Coherent controlization using transmon qubits

6 4 2 0 2 4 6 6 4 2 0 2 4 6

1

0

1 6

4

2 0

2

4

6

6

4

2 0

2

4

6

Alexey Melnikov

6

4

2 0

2

4

6

6

4

2 0

2

4

6

6

4

A physics approach to machine learning

2 0

2

4

6

Conclusion

◦ Standard (classical) PS agent – is a competitive AI model (grid-world and mountain-car problems) – generalization mechanism improves the model – has potentially many applications ◦ Quantum PS agent – quantization, using known methods of quantum walks – implementation using superconductiong qubits

Thank you for your attention!

Alexey Melnikov

A physics approach to machine learning

Conclusion

◦ Standard (classical) PS agent – is a competitive AI model (grid-world and mountain-car problems) – generalization mechanism improves the model – has potentially many applications ◦ Quantum PS agent – quantization, using known methods of quantum walks – implementation using superconductiong qubits

Thank you for your attention!

Alexey Melnikov

A physics approach to machine learning