Spectrum of Multiprocessor OS. Types of Multiprocessor Systems

12/1/2009 “A distributed system is one where you can’t get your work done because a ’t t k d b computer you didn’t even know existed, has...

Author: Ethan Felix Gardner

4 downloads 0 Views 233KB Size

Report

Download PDF

Recommend Documents

Multiprocessor Operating Systems. Multiprocessor Applications

Multiprocessor Systems

Multiprocessor Scheduling. Multiprocessor Scheduling

Part 5: MULTIPROCESSOR SYSTEMS

Statistical Simulation of Symmetric Multiprocessor Systems

The Future of Multiprocessor Systems-on-Chips

A Lock-Free Multiprocessor OS Kernel

Multiprocessor Systems. Chapter 8, 8.1

Multiprocessor Synchronization

Multiprocessor Scheduling

Creating Multiprocessor Nios II Systems Tutorial

Combinators as control mechansims in multiprocessor systems

Creating Multiprocessor Nios II Systems Tutorial

The Stanford FLASH Multiprocessor

Response-Time Analysis of Conditional DAG Tasks in Multiprocessor Systems

The Partitioned Multiprocessor Scheduling of Sporadic Task Systems

Response-Time Analysis of Conditional DAG Tasks in Multiprocessor Systems

A Survey of Mutual-Exclusion Algorithms for Multiprocessor Operating Systems

4 Multiprocessor Programming

The Stanford Dash Multiprocessor

The Shared-Thread Multiprocessor

The Stanford Dash Multiprocessor

12/1/2009

“A distributed system is one where you can’t get your work done because a ’t t k d b computer you didn’t even know existed, has crashed.”

Distributed Operating Systems

‐ Leslie Lamport

COMP755

Spectrum of Multiprocessor OS • Some distributed OS only share files. Accessing data on another computer is very Accessing data on another computer is very visible to the user. The textbook calls this a Network Operating System. • Other distributed OS try to make a collection of computers look like one large single computer. The textbook calls this a Distributed Operating System. • There are many systems in between.

Types of Multiprocessor Systems • • • • • •

SMP or dual core computer Hypercube, separate memory multiprocessor Beowulf Cluster Cluster of Workstations Ad Hoc collection of computers Client/Server systems

1

12/1/2009

Beowulf Cluster • A Beowulf cluster is a group of usually identical PC computers running an Open identical PC computers running an Open Source Unix‐like operating system. They are networked into a small LAN. • Usually consists of one server node and several client nodes connected together via Ethernet • Server node controls the whole cluster and serves files to the client nodes

Beowulf Programming • Beowulf appears to the user like a multi‐processor lik lti computer instead of a bunch of PCs. • Beowulf clusters often use Parallel Virtual Machine (PVM) or Message Passing Interface (MPI) for parallel programming

Why distributed systems? y Epic poem written about

800 – 1000 CE y Earliest surviving English literature y The hero, Beowulf, battles monsters and dragons. y More about “wurd” M b “ d” than h “worm”

• • • • •

People are distributed, data are distributed Performance / Cost Scalability Modularity Availability & Reliability

2

12/1/2009

Characteristics

Characteristics of a Distributed System •

Resource sharing – –

Hardware like printers, disks, scanners H d lik i t di k Data • • • •

•

Web pages software libraries corporate data cooperative work

Openness – – –

• • • • •

Concurrency Scalability Message Passing Lack of global information Fault Tolerance – availability

interfaces are published uniform communication mechanism possibly heterogeneous

A distributed system can be built from 1. Microsoft Windows PCs PC 2. Linux computers 3. Sun Sparc workstations 4 All of the above 4. All of the above 5. None of the above

– access ‐ local and remote objects accessed by identical operations identical operations – location ‐ do not have to know location of objects or processes – concurrency ‐ multiple access without interference – replication ‐ multiple copies without user or application knowledge application knowledge

M icr

os of t W in Li d. nu .. x c om pu te Su r.. . n Sp ar c w or k. .. A ll of th e ab o. N .. on e of th e a b. ..

20% 20% 20% 20% 20%

Transparency

3

12/1/2009

Transparency (cont.)

Goals •

– failure ‐ continue despite hardware or software failure – migration ‐ objects can move without user knowledge – reconfiguration ‐ system can be changed without user knowledge – scaling li ‐ increasing the system size does not effect i i h i d ff structure or applications

Robustness • Failure detection • Reconfiguration

Efficiency – –

•

Flexibility – –

•

friendliness ability to evolve.

Consistency –

•

Minimize communications l db l i load balancing

transparency

Robustness –

equipped to handle exceptional situations and errors.

Failure Detection • Detecting hardware failure is difficult • To detect a link failure, a handshaking protocol To detect a link failure, a handshaking protocol can be used • If Site A does not receive a reply, it can repeat the message or try an alternate route to Site B

4

12/1/2009

Failure Detection (cont) • If Site A does not ultimately receive a reply from Site B, it concludes some type of failure has Site B, it concludes some type of failure has occurred • Types of failures: ‐ Site B is down ‐ The direct link between A and B is down ‐ The alternate link from A to B is down The alternate link from A to B is down ‐ The message has been lost • However, Site A cannot determine exactly why the failure has occurred

Reconfiguration • When Site A determines a failure has occurred, it must reconfigure the system: reconfigure the system: 1. If the link from A to B has failed, this must be broadcast to every site in the system 2. If a site has failed, every other site must also be notified indicating that the services offered by the failed tifi d i di ti th t th i ff d b th f il d site are no longer available • When the link or the site becomes available again, this information must again be broadcast to all other sites

5