CS344M Autonomous Multiagent Systems

CS344M Autonomous Multiagent Systems Todd Hester Department of Computer Science The University of Texas at Austin Good Afternoon, Colleagues Are t...

Author: Charlotte Harvey

5 downloads 4 Views 177KB Size

Report

Download PDF

Recommend Documents

Multiagent Systems. Lecture 2: Conducting Empirical Investigations in Multiagent Systems and Parallel Models. MAS Evolutionary Learning

Designing autonomous advisor systems

COMP310 MultiAgent Systems. Chapter 2 - Intelligent Agents

The Study of Delusion in Multiagent Systems

A Mathematical Model Geared for Multiagent Systems

Integrating Ontologies into Multiagent Systems Engineering

Normative Multiagent Systems: A Dynamic Generalization

COMP310 MultiAgent Systems. Mock Exam Solutions

Multiagent Systems Viewed as Distributed Scheduling Systems: Methodology and Experiments

BAE Systems Autonomous Capability Overview. Richard Williams Director Civil Autonomous Systems BAE Systems (UK & RoW)

Multiagent Interactions

TESIS DOCTORAL. GORMAS: Guidelines for ORganization-based Multiagent Systems

MultiAgent Systems in Power System Protection: Review, Classification and Perspectives

ABLE: A toolkit for building multiagent autonomic systems

Intercommunication framework for autonomous real-time systems

Daniel Serrano Head of Autonomous Systems EURECAT

The Seven Deadly Myths of Autonomous Systems

Safe Testing of Autonomous Systems Performance

Towards Autonomous Wheelchair Systems in Urban Environments

A Foundation for Perception in Autonomous Systems

Aerodynamic Sensing for. Autonomous Unmanned Aircraft Systems

Autonomous Power Systems based on Renewables

Autonomous Navigation Systems (ANS) Standard AET - Endorsement

Designing Progressive MultiAgent Negotiations

CS344M Autonomous Multiagent Systems Todd Hester

Department of Computer Science The University of Texas at Austin

Good Afternoon, Colleagues

Are there any questions?

Todd Hester

Good Afternoon, Colleagues

Are there any questions?

Todd Hester

Logistics

• First assignment: how did it go?

Todd Hester

Logistics

• First assignment: how did it go? • Next soccer assignment: score a goal and passing

Todd Hester

Logistics

• First assignment: how did it go? • Next soccer assignment: score a goal and passing − Help each other with C issues – parsing strings

Todd Hester

Logistics

• First assignment: how did it go? • Next soccer assignment: score a goal and passing − Help each other with C issues – parsing strings − Evaluating mostly on the logic – does the agent “do the right thing?”

Todd Hester

Logistics

• First assignment: how did it go? • Next soccer assignment: score a goal and passing − Help each other with C issues – parsing strings − Evaluating mostly on the logic – does the agent “do the right thing?” • 2D or 3D?

Todd Hester

Self-Introductions

• Speak loudly

Todd Hester

Self-Introductions

• Speak loudly • Name, year, major

Todd Hester

Self-Introductions

• Speak loudly • Name, year, major • At least one other thing about yourself

Todd Hester

Discussion An autonomous agent is a system situated within and a part of an environment that senses that environment and acts on it, over time, in pursuit of its own agenda and so as to affect what it senses in the future. • Is this a good definition? • The authors claim is is a “formal” definition of agents. Is it?

Todd Hester

Discussion An autonomous agent is a system situated within and a part of an environment that senses that environment and acts on it, over time, in pursuit of its own agenda and so as to affect what it senses in the future. • Is this a good definition? • The authors claim is is a “formal” definition of agents. Is it? • Can you do better?

Todd Hester

Discussion An autonomous agent is a system situated within and a part of an environment that senses that environment and acts on it, over time, in pursuit of its own agenda and so as to affect what it senses in the future. • Is this a good definition? • The authors claim is is a “formal” definition of agents. Is it? • Can you do better? • Do they need to be social? persistent? • Can they cease to be agents in a different environment? • Autonomy Todd Hester

Varieties of Autonomy • Do we have complete freedom over our beliefs, goals, and actions?

Todd Hester

Varieties of Autonomy • Do we have complete freedom over our beliefs, goals, and actions? • Software service has no autonomy — does what it’s told.

Todd Hester

Varieties of Autonomy • Do we have complete freedom over our beliefs, goals, and actions? • Software service has no autonomy — does what it’s told. • What’s Wooldridge’s take on where autonomous agents lie on the spectrum?

Todd Hester

Varieties of Autonomy • Do we have complete freedom over our beliefs, goals, and actions? • Software service has no autonomy — does what it’s told. • What’s Wooldridge’s take on where autonomous agents lie on the spectrum? − Decide how to act so as to accomplish delegated goals

Todd Hester

Varieties of Autonomy • Do we have complete freedom over our beliefs, goals, and actions? • Software service has no autonomy — does what it’s told. • What’s Wooldridge’s take on where autonomous agents lie on the spectrum? − Decide how to act so as to accomplish delegated goals • Also mentions adjustable autonomy

Todd Hester

My Requirements of Agents • They must sense their environment. • They must decide what action to take (“think”). • They must act in their environment.

Todd Hester

My Requirements of Agents • They must sense their environment. • They must decide what action to take (“think”). • They must act in their environment. Complete Agents

Todd Hester

My Requirements of Agents • They must sense their environment. • They must decide what action to take (“think”). • They must act in their environment. Complete Agents Multiagent systems: Interact with other agents

Todd Hester

My Requirements of Agents • They must sense their environment. • They must decide what action to take (“think”). • They must act in their environment. Complete Agents Multiagent systems: Interact with other agents Learning agents: Improve performance from experience

Todd Hester

My Requirements of Agents • They must sense their environment. • They must decide what action to take (“think”). • They must act in their environment. Complete Agents Multiagent systems: Interact with other agents Learning agents: Improve performance from experience Autonomous Bidding, Cognitive Systems, Traffic management, Robot Soccer Todd Hester

Environments Environment =⇒ sensations, actions

Todd Hester

Environments Environment =⇒ sensations, actions • fully observable vs. partially observable (accessible)

Todd Hester

Environments Environment =⇒ sensations, actions • fully observable vs. partially observable (accessible) • deterministic vs. non-deterministic

Todd Hester

Environments Environment =⇒ sensations, actions • fully observable vs. partially observable (accessible) • deterministic vs. non-deterministic • episodic vs. non-episodic

Todd Hester

Environments Environment =⇒ sensations, actions • fully observable vs. partially observable (accessible) • deterministic vs. non-deterministic • episodic vs. non-episodic • static vs. dynamic

Todd Hester

Environments Environment =⇒ sensations, actions • fully observable vs. partially observable (accessible) • deterministic vs. non-deterministic • episodic vs. non-episodic • static vs. dynamic • discrete vs. continuous

Todd Hester

Environments Environment =⇒ sensations, actions • fully observable vs. partially observable (accessible) • deterministic vs. non-deterministic • episodic vs. non-episodic • static vs. dynamic • discrete vs. continuous • single-agent vs. multiagent Todd Hester

The Decision

Todd Hester

The Decision

• reactive vs. deliberative

Todd Hester

The Decision

• reactive vs. deliberative • multiagent reasoning?

Todd Hester

The Decision

• reactive vs. deliberative • multiagent reasoning? • learning?

Todd Hester

Formalizing My Example Knowns: • O = {Blue, Red, Green, Black, . . .} • Rewards in IR • A = {W ave, Clap, Stand} o0, a0, r0, o1, a1, r1, o2, . . . Unknowns: • • • •

S = 4x3 grid R : S × A 7→ IR P = S 7→ O T : S × A 7→ S

oi = P(si)

ri = R(si, ai)

si+1 = T (si, ai) Todd Hester

Standard/Reactive/State-based Agents • Observation P , Action A, Internal State I

Todd Hester

Standard/Reactive/State-based Agents • Observation P , Action A, Internal State I • Standard agent:

Todd Hester

Standard/Reactive/State-based Agents • Observation P , Action A, Internal State I • Standard agent:

action : P ∗ 7→ A

Todd Hester

Standard/Reactive/State-based Agents • Observation P , Action A, Internal State I • Standard agent:

action : P ∗ 7→ A

• Reactive agent:

Todd Hester

Standard/Reactive/State-based Agents • Observation P , Action A, Internal State I • Standard agent: • Reactive agent:

action : P ∗ 7→ A action : P 7→ A

Todd Hester

Standard/Reactive/State-based Agents • Observation P , Action A, Internal State I • Standard agent: • Reactive agent:

action : P ∗ 7→ A action : P 7→ A

− Decision based entirely on the present

Todd Hester

Standard/Reactive/State-based Agents • Observation P , Action A, Internal State I • Standard agent: • Reactive agent:

action : P ∗ 7→ A action : P 7→ A

− Decision based entirely on the present • State-based agent:

Todd Hester

Standard/Reactive/State-based Agents • Observation P , Action A, Internal State I • Standard agent: • Reactive agent:

action : P ∗ 7→ A action : P 7→ A

− Decision based entirely on the present • State-based agent: action : I 7→ A, next : I × P 7→ I

Todd Hester

Standard/Reactive/State-based Agents • Observation P , Action A, Internal State I • Standard agent: • Reactive agent:

action : P ∗ 7→ A action : P 7→ A

− Decision based entirely on the present • State-based agent: action : I 7→ A, next : I × P 7→ I It is worth observing that state-based agents as defined here are in fact no more powerful than the standard agents we introduced earlier. In fact, they are identical in their expressive power. Todd Hester

Standard/Reactive/State-based Agents • Observation P , Action A, Internal State I • Standard agent: • Reactive agent:

action : P ∗ 7→ A action : P 7→ A

− Decision based entirely on the present • State-based agent: action : I 7→ A, next : I × P 7→ I It is worth observing that state-based agents as defined here are in fact no more powerful than the standard agents we introduced earlier. In fact, they are identical in their expressive power. Todd Hester

Reactive agents for next Thursday’s assignment task?

Todd Hester

Discussion

What are some tasks that are partially observable, non-deterministic, dynamic, continuous, and multi-agent?

Can we possibly expect an agent to perform well in such tasks?

Todd Hester