Challenges in modeling enterprise storage systems

Challenges in modeling enterprise storage systems Arif Merchant Hewlett-Packard Laboratories Contact: [email protected] © 2006 Hewlett-Packard Dev...
Author: Percival Foster
0 downloads 1 Views 1000KB Size
Challenges in modeling enterprise storage systems

Arif Merchant Hewlett-Packard Laboratories Contact: [email protected] © 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice

An enterprise storage system Multiple applications

App1 data1

App2

data2

data3

Primary site Running on resources at primary site

Host

Application data data5 objects

App3 data4

Data storage system

Host Storage-area network

Tape lib

Each app has its own data objects

Potentially using secondary site resources

Secondary site(s)

Disk Disk array array

Sites may serve as primary / secondary peers WAN link(s)

Tape transport QEST, September 2006

Arif Merchant: Challenges in modeling enterprise storage systems

2

Storage System models Why should you care? •

Storage is a significant part of IT costs − Over 35% for enterprises with 5000+ employees (IDC 2002) − Large fraction of storage cost is management − Environment is complex: 100s of applications sharing petabytes of data − Stringent application requirements – failures can be catastrophic (Survey: $89K-6.4M/hr of downtime)



Good storage models are crucial for managing storage − Systematic, accurate models needed for informed choices



Uses of models − Capacity planning – green field & consolidation − Storage design − On-going management

QEST, September 2006

Arif Merchant: Challenges in modeling enterprise storage systems

3

Storage management automation

Where storage models fit in [Anderson2002,Alvarez2001] Understand needs

Determine solution

•Workload characterization

•Select devices + configurations

•System components

•Model to verify config. Is good (Changing) business

requirements

•Assign load to devices

Design Design

Iterative feedback loop

Analyze Analyze

Implement Implement

Monitor usage Construct solution

•Measure workload, re-characterize

•Configure devices, LVM, …

•System response

•Migrate data

QEST, September 2006

Arif Merchant: Challenges in modeling enterprise storage systems

4

Components of storage models App1 data1

App2

data2

data3

data4

Primary site

Workloads

Host

Application data data5 objects

App3

Data storage system

Host

Secondary site(s)

Storage-area network

Tape lib

Disk Disk array array WAN link(s)

Tape transport QEST, September 2006

Arif Merchant: Challenges in modeling enterprise storage systems

5

Workload characterization • Need

− Compact characterization − Adequate for predictive storage system models • Minimal

parameter set:

− Size of data − IO rate, IO size distribution − Reads vs. writes − Sequentiality, spatial locality − Temporal locality (e.g., frequency of repeated access) − Concurrency (Simultaneously outstanding requests) QEST, September 2006

Arif Merchant: Challenges in modeling enterprise storage systems

6

Measuring workloads •

How do we acquire workload parameters? − Users do not know workload details − Measurements on production system may add load − Storage devices have limited, vendor-dependent measurement points (although industry standards SMI-S are helpful!) − Workload behavior may depend on hardware & configuration − Open problems:

• Minimum perturbation methods for measuring workload parameters • Tradeoffs between inaccuracies of workload parameters and model accuracy IO trace

running system QEST, September 2006

workload analysis

workload description

workload analysis

Arif Merchant: Challenges in modeling enterprise storage systems

7

Workload attributes – access variability

How do you compactly capture this complex behavior? Sample results for a single run i3125om5, RC = 8GB, WC = 8GB; cache outcome percentages by I/O address range 100%

Coalesced write

average over full trace

90%

80%

Warm write

average for writes second half

70%

Cold write

60%

Delayed write

I/O's

cache hits for a traced openmail workload

50%

40% reads

Compulsory read miss

30%

Replacement read miss 20% read hits

10%

0% 0

400

800

1200

1600

2000

2400

2800

3200

trace time [seconds] QEST, September 2006

3600 ave steady state

Arif Merchant: Challenges in modeling enterprise storage systems

Read hit - reread Read hit from write cache

Graph Graph by by the the HPL HPL Sonora Sonora team team 8

Workload specification challenges • What

is an adequate workload specification? −Complex request arrival patterns

• Self-similar? Dependent on response? • Seasonality, trends

−Complex access patterns −Locality (spatial and temporal) −Concurrency variation −Correlation/interference between workloads • Synthesis

of workloads accurately representing real workloads

QEST, September 2006

Arif Merchant: Challenges in modeling enterprise storage systems

9

Components of storage systems The basic storage device

App1 data1

App2

data2

data3

data4

Primary site Host

Disk drives

Application data data5 objects

App3

Data storage system

Host

Secondary site(s)

Storage-area network

Tape lib

Disk Disk array array WAN link(s)

Tape transport QEST, September 2006

Arif Merchant: Challenges in modeling enterprise storage systems

10

Disk drives simplified

2.5” Seagate Savio

1 0 1 1 1 0 1 0 0 1 1 0 0 0 0 1

Block QEST, September 2006

Arif Merchant: Challenges in modeling enterprise storage systems

11

Disk drives specs

High end (enterprise) disk Seek

Rotation Transfer

3.5ms

• Typical

Seagate Cheetah 15K drive

2ms

1ms

64 KB random read service time

access times (seek + rotation):

− 5.5ms − Not improving quickly (limited by speed of physical movement) • Typical

sustained transfer rate:

− 58 to 96 MB/s or ~1ms for 64KB − Increasing quickly over time (increases with bit density) • Annual QEST, September 2006

failure rate: 0.62% Arif Merchant: Challenges in modeling enterprise storage systems

12

High disk access time

The root of many storage (modeling) problems • Random

IO throughput