Intelligent Agents. Vacuum-cleaner world

' $ Intelligent Agents Chapter 2 & % Z. Duric GMU CS 480 2 ' $ Vacuum-cleaner world A B Percepts: location and contents, e.g., [A, Dirty...

Author: Roderick Anderson

37 downloads 2 Views 43KB Size

Report

Download PDF

Recommend Documents

Intelligent Agents. Chapter 2

AITA : Building Intelligent Agents

Intelligent Agents BISS 2004

Interacting with Intelligent Agents

Introduction to AI & Intelligent Agents

25. Intelligent Agents. Chapter 2

Intelligent Agents for Distance Learning

Sensor scheduling using Intelligent Agents

Agents and environments. Intelligent Agents. Vacuum-cleaner world. Outline. Chapter 2

Agents and environments. Intelligent Agents. Vacuum-cleaner world. Reminders. A vacuum-cleaner agent. Outline. Chapter 2

Agents 1. This course is about designing intelligent agents. Agents and environments. The vacuum-cleaner world Rationality

Intelligent Agents. Philipp Koehn. 17 September 2015

Inteligent Robots and Knowledge Representation. Intelligent Agents

Intelligent Agents. Russell & Norvig Chapter 2

Intelligent Agents. CHAPTER 2 Oliver Schulte Summer2011

Intelligent Agents. Chapter 2. Chapter 2 1

COMP310 MultiAgent Systems. Chapter 2 - Intelligent Agents

Agents and environments. Intelligent Agents. Vacuum-cleaner world. Reminders. A vacuum-cleaner agent. Outline. Chapter 2 Actuators

Vacuum-cleaner world. Intelligent Agents. A vacuum-cleaner agent. Outline. A vacuum-cleaner agent. Agents and environments sensors

CMSC 471. Intelligent Agents. Russell & Norvig Chapter 2

Automated intelligent agents are they trusted members of military teams?

Intelligent Agents for Virtual Simulation of Human-Robot Interaction

Intelligent Agents for Training On-board Fire Fighting

'

$

Intelligent Agents Chapter 2

&

%

Z. Duric

GMU

CS 480

2

'

$

Vacuum-cleaner world

A

B

Percepts: location and contents, e.g., [A, Dirty] Actions: Lef t, Right, Suck, N oOp & Z. Duric

% GMU

'

$

A vacuum-cleaner agent Percept sequence

Action

[A, Clean]

Right

[A, Dirty]

Suck

[B, Clean]

Lef t

[B, Dirty]

Suck

[A, Clean], [A, Clean]

Right

[A, Clean], [A, Dirty] .. .

Suck .. .

& Z. Duric

CS 480

'

% GMU

4

$

function R EFLEX -VACUUM -AGENT ( [location,status]) returns an action if status = Dirty then return Suck else if location = A then return Right else if location = B then return Left

What is the right function? Can it be implemented in a small agent program?

& Z. Duric

% GMU

'

$

Rationality Fixed performance measure evaluates the environment sequence – one point per square cleaned up in time T ? – one point per clean square per time step, minus one per move? – penalize for > k dirty squares? A rational agent chooses whichever action maximizes the expected value of the performance measure given the percept sequence to date Rational 6= omniscient Rational 6= clairvoyant Rational 6= successful Rational ⇒ exploration, learning, autonomy &

%

Z. Duric

GMU

CS 480

6

'

$

PEAS To design a rational agent, we must specify the task environment Consider, e.g., the task of designing an automated taxi: Performance measure?? Environment?? Actuators?? Sensors??

& Z. Duric

% GMU

'

$

PEAS To design a rational agent, we must specify the task environment Consider, e.g., the task of designing an automated taxi: Performance measure?? safety, destination, profits, legality, comfort, . . . Environment?? US streets/freeways, traffic, pedestrians, weather, . . . Actuators?? steering, accelerator, brake, horn, speaker/display, . . . Sensors?? video, accelerometers, gauges, engine sensors, keyboard, GPS, ...

&

%

Z. Duric

GMU

CS 480

8

'

$

Internet shopping agent Performance measure?? Environment?? Actuators?? Sensors??

& Z. Duric

% GMU

'

$

Environment types Solitaire

Backgammon

Internet shopping

Taxi

Observable?? Deterministic?? Episodic?? Static?? Discrete?? Single-agent??

&

%

Z. Duric

GMU

CS 480

10

'

$

Environment types

Observable??

Solitaire

Backgammon

Internet shopping

Taxi

Yes

Yes

No

No

Deterministic?? Episodic?? Static?? Discrete?? Single-agent??

& Z. Duric

% GMU

'

$

Environment types Solitaire

Backgammon

Internet shopping

Taxi

Observable??

Yes

Yes

No

No

Deterministic??

Yes

No

Partly

No

Episodic?? Static?? Discrete?? Single-agent??

&

%

Z. Duric

GMU

CS 480

12

'

$

Environment types Solitaire

Backgammon

Internet shopping

Taxi

Observable??

Yes

Yes

No

No

Deterministic??

Yes

No

Partly

No

Episodic??

No

No

No

No

Static?? Discrete?? Single-agent??

& Z. Duric

% GMU

'

$

Environment types Solitaire

Backgammon

Internet shopping

Taxi

Observable??

Yes

Yes

No

No

Deterministic??

Yes

No

Partly

No

Episodic??

No

No

No

No

Static??

Yes

Semi

Semi

No

Discrete?? Single-agent??

&

%

Z. Duric

GMU

CS 480

14

'

$

Environment types Solitaire

Backgammon

Internet shopping

Taxi

Observable??

Yes

Yes

No

No

Deterministic??

Yes

No

Partly

No

Episodic??

No

No

No

No

Static??

Yes

Semi

Semi

No

Discrete??

Yes

Yes

Yes

No

Single-agent??

& Z. Duric

% GMU

'

$

Environment types Solitaire

Backgammon

Internet shopping

Taxi

Observable??

Yes

Yes

No

No

Deterministic??

Yes

No

Partly

No

Episodic??

No

No

No

No

Static??

Yes

Semi

Semi

No

Discrete??

Yes

Yes

Yes

No

Single-agent??

Yes

No

Yes (except auctions)

No

The environment type largely determines the agent design The real world is (of course) partially observable, stochastic, sequential, dynamic, continuous, multi-agent &

%

Z. Duric

GMU

CS 480

16

'

$

Agent types Four basic types in order of increasing generality: – simple reflex agents – reflex agents with state – goal-based agents – utility-based agents All these can be turned into learning agents

& Z. Duric

% GMU

'

$

Simple reflex agents

Agent

Sensors

Condition−action rules

What action I should do now

Environment

What the world is like now

Actuators

&

%

Z. Duric

GMU

CS 480

18

'

$

Reflex agents with state

Sensors State How the world evolves

What my actions do

Condition−action rules

Agent

What action I should do now

Environment

What the world is like now

Actuators

& Z. Duric

% GMU

'

$

Goal-based agents

Sensors State What the world is like now

What my actions do

What it will be like if I do action A

Goals

What action I should do now

Agent

Environment

How the world evolves

Effectors

&

%

Z. Duric

GMU

CS 480

20

'

$

Utility-based agents

Sensors State What the world is like now

What my actions do

What it will be like if I do action A

Utility

How happy I will be in such a state What action I should do now

Agent

Environment

How the world evolves

Effectors

& Z. Duric

% GMU

'

$

Learning agents Performance standard

Sensors

Critic

changes Learning element

knowledge

Performance element

learning goals

Environment

feedback

Problem generator

Agent

Actuators

&

%

Z. Duric

GMU

CS 480

22

'

$

AIMA code The code for each topic is divided into four directories: – agents: code defining agent types and programs – algorithms: code for the methods used by the agent programs – environments: code defining environment types, simulations – domains: problem types and instances for input to algorithms (Often run algorithms on domains rather than agents in environments.) (setq joe (make-agent :name ’joe :body (make-agent-body) :program (make-dumb-agent-program))) (defun make-dumb-agent-program () (let ((memory nil)) #’(lambda (percept) (push percept memory) ’no-op)))

& Z. Duric

% GMU