Innovative Technology for Insightful Impact

Innovative Technology for Insightful Impact Rich’s Overview… @richniemiec • • • • Advisor to Rolta International Board Former President of TUSC • ...
Author: Brent Hall
2 downloads 1 Views 6MB Size
Innovative Technology for Insightful Impact

Rich’s Overview… @richniemiec • •

• •

Advisor to Rolta International Board Former President of TUSC • • •

Former President Rolta TUSC & President Rolta EICT International Author (3 Oracle Best Sellers – #1 Oracle Tuning Book for a Decade): • • •

• • • • • • • • •

Inc. 500 Company (Fastest Growing 500 Private Companies) 10 Offices in the United States (U.S.); Based in Chicago Oracle Advantage Partner in Tech & Applications

Oracle Performing Tips & Techniques (Covers Oracle7 & 8i) Oracle9i Performance Tips & Techniques Oracle Database 10g Performance Tips & Techniques

Former President of the International Oracle Users Group Current President of the Midwest Oracle Users Group Chicago Entrepreneur Hall of Fame - 1998 E&Y Entrepreneur of the Year & National Hall of Fame - 2001 IOUG Top Speaker in 1991, 1994, 1997, 2001, 2006, 2007 MOUG Top Speaker Twelve Times National Trio Achiever award - 2006 Oracle Certified Master & Oracle Ace Director Purdue Outstanding Electrical & Computer and Engineer - 2007 2

Agenda – 60,000 attendees + On-Line  Oracle Trends

 In-Memory Option

 12c New Features  FMW & Cloud (Apps)

 Engineered Systems  Big Data

IOUG OOW Presentations Available Online IOUG's User Group Sunday and weekly sessions at OpenWorld were a huge success. User presentations were focused on hot topics including:           

Oracle Database 12c Big Data BI Warehousing/Analytics Cloud Computing & Virtualization Engineered Systems Middleware Oracle Enterprise Manager Performance WebCenter Storage Strategic Leadership

Mark Hurd & Incredible Trends

Mark Hurd & Incredible Trends • Average age of Apps is 20 years…back then • • • • •

• • • •

No Search Few using the Internet No Amazon No Facebook No Twitter

40% Data growth per year 90% of Data created in past two years Big bank – Now has 300P & growing at 40% Mobile Retail is getting huge & may be most of the future market • Oracle will spend $5B in R&D this year

Oracle Firsts – Innovation! 1979 First commercial SQL relational database management system 1983 First 32-bit mode RDBMS 1984 First database with read consistency 1987 First client-server database 1994 First commercial and multilevel secure database evaluations 1995 First 64-bit mode RDBMS 1996 First to break the 30,000 TPC-C barrier 1997 First Web database 1998 First Database - Native Java Support; Breaks 100,000 TPC-C 1998 First Commercial RDBMS ported to Linux 2000 First database with XML 2001 First RDBMS with Real Application Clusters & First middle-tier database cache 2004 First True Grid Database 2005 First FREE Oracle Database (10g Express Edition) 2006 First Oracle Support for LINUX Offering 2007 Oracle 11g Released! 2008 Exadata V1 Server Announced (Oracle buys BEA) 2009 Oracle buys Sun – Java; MySQL; Solaris; Hardware; OpenOffice 2010 Oracle announces MySQL Cluster 7.1, Exadata, Exalogic, America’s Cup Win 2011 X2-2 Exadata, ODA, Exalytics, SuperCluster, Big Data, Cloud, Social Network 2012 X3-2 Exadata, Expanded Cloud Offerings, Solaris 11.1 2013 Oracle12c Released! Oracle X3-8 Exadata, Acquisitions (Acme Packet…etc.)! MANY OTHERS WITHIN THIS PRESENTATION! 8

What’s New at Oracle … EVERYTHING!

We’re #1 … in Everything!

Oracle now a GIANT!

In-Memory Columnar Store

Oracle’s new In-Memory Option*  100x faster real time analytics queries  2x faster OLTP & 3-4x faster INSERTS  Oracle demo showed Wikipedia query 1354x faster than NO INDEX vs. NO INDEX using InMemory (Drop analytics indexes?) Easy settings** (“flip a switch”):  inmemory_size = 2000G  alter table EMP inmemory; (also alter for individual partition) • * Announced /not yet available (loaded on startup/first access) • ** No documented/undocumented parameter in current version.

With In-Memory – Drop Analytics Indexes?

With In-Memory – Drop Analytics Indexes?

Oracle’s new In-Memory Option      

NO SQL coding changes necessary. NO SQL restrictions mentioned NO changes required to any applications NO changes to the data itself NO changes if you use multitenant (CDB/PDBs) NO changes if you use it in the Cloud

Some questions remain: How much will it cost – is it a $$ option? How much additional memory do I need? How much physical space will I save? How compressed is the columnar compression?

Oracle DEMO – 2B rows/sec INDEX search

Oracle DEMO – 5M rows/sec No Index 7B rows/sec In-Memory with No Index

Oracle’s In-Memory Option LIVE DEMO 3B row table search:  INDEXED = 2B rows/sec  UNINDEXED = 5M rows/sec  In-Memory = 7B rows/sec (with NO INDEX)

 In-Memory = 1354x faster than UNINDEXED search  Tuning = 400x faster than UNINDEXED search  In-Memory = 3.5x faster than INDEX search  3.5x faster for this example only, but no index saves space and DML costs so there are multiple benefits. If not properly indexed it’s much more!

Oracle’s new In-Memory Option

Oracle’s new In-Memory Option  Great Performance for both OLTP & Analytics  Could substantially reduce Analytics Indexes  Scale Up (larger servers) or Scale Out (clusters)  In-Memory queries can be Parallelized across servers using RAC  NO coding or application changes required  Columnar data is “highly” COMPRESSED  NO Logging on columnar store (near zero overhead on data changes) – transactional integrity exists between row store & column store  Joins tables 10x faster

Oracle’s M6-32 Big Memory Machine

Oracle’s M6-32 Big Memory Machine*  32 M6 Sparc CPUs  384 total cores = 32 CPUs x 12 cores/each  3072 threads = 96 threads/CPU x 32 CPUs

 32T of DRAM  Fastest In-Memory Database  1024 Memory DIMMs (Dual In-Line Memory Module)

 CPUs communicate using Oracle’s System Coherency Interconnect – 384 port silicon switching network - with 3T/sec bandwidth  Enables massive shared memory for In-Memory Applications  M6 boards plug directly into M5 chasis (same) *Available NOW!

Oracle Demo: M6-32 Big Memory Machine

Oracle’s M6-32 SuperCluster

Oracle’s M6-32 SuperCluster Machine*  Fastest Database Machine  Integrated Exadata Storage for 10x database I/O acceleration  Connected to Exadata Storage Expansion Rack  InfiniBand I/O Interconnect  3T Silicon Network for CPU communications  32T Memory for Column Store Oracle Demo:  341 B rows/sec – WOW!! (accessing 218B row table sub-second)

*Available NOW!

Testing the Future Version Version of the Database Version of the Database for 11g R2 Examples (Briefly – See “12c New Features” at IOUG for more)

Multiple Types of Indexes on the Same Column (Using the Invisible Index)

Multiple Types of Indexes on the Same • Create MORE than Column(s) one index on a column • Set only ONE index to VISIBLE • Ok to have ONE + any Function Based Index (exception) • Great to use different types of indexes for batch, query, or data warehousing at different times. • Some restrictions apply…for a give column(s) • You can not create a B-tree AND B-tree cluster index • You can not create a B-tree and an index-organized table (IOT)

• All indexes ARE MAINTAINED during DML • DML could be slow if TOO MANY indexes are created

• Great for variable workloads! 30

Multiple Types of Indexes on Add indexes – the FIVESame Indexex Column(s) on the same column: select a.table_name, a.index_name, b.column_name, a.uniqueness, a.visibility from user_indexes a, user_ind_columns b where a.index_name = b.index_name and a.table_name = ‘DEPT‘; TABLE_NAME ---------DEPT DEPT DEPT DEPT DEPT







Adaptive Optimization

Adaptive Query Optimization • Adaptive query optimization allows optimizer to adjust execution plan at run time when additional/better information is available. • Adaptive Plans: Different Join Methods (change NL to HASH) or Parallel Distribution • Adaptive Statistics: Dynamic stats, Auto Reoptimization, and SQL Plan Directives

• Adaptive Plans does not pick the final plan until execution time based on statistics collection. Information learned at execution time is used in future executions. You’ll see the plan table output in the note section: Note -------------------------- this is an adaptive plan

• The 12c Adaptive Optimizer adapts plans based on not just the original tables stats, but also additional adaptive statistics • There are three types of Adaptive Statistics: • Dynamic Statistics (previously dynamic sampling in 10g/11g) or runtime statistics • Automatic Reoptimization or statistics generated after the initial execution • SQL Plan Directives direct optimizer to dynamic statistics & gets accurate cardinality 33

Runaway Query Management & Partial Indexes for Partitioned Tables

Runaway Query Management • Resource Manager now pro-actively manages problems queries and takes action based on settings for a given consumer group when: • • • •

CPU is exceeded Physical I/O is exceeded (disk) Logical I/O is exceeded (memory) Elapsed Time is exceeded

• This can be automated! • New views allow the DBA to see problem queries that are over the limit for each Consumer Group (can be set to automatically be terminated or can be switched to a new group with lower resources) • Views are persisted in the AWR • Must have the appropriate resources to manage this • Can be set based on start of session or start of SQL or PL/SQL: • SWITCH_FOR_CALL resource plan directive 35

Runaway Query Management (Oracle 12c DBA Guide example…)

Create a Resource plan Directive that kills any session that exceeds 60 seconds of CPU time Create a Resource plan Directive that switches sessions to low_group if > 10000 physical IO’s or >2500M of data transferred. Session returns to original group after bad query ends




Partial Indexes for Partitioned Table CREATE TABLE DEPT3 (DEPTNO NUMBER(2), DEPT_NAME VARCHAR2(30)) INDEXING OFF PARTITION BY RANGE(DEPTNO) (PARTITION D1 VALUES LESS THAN (10) indexing on, PARTITION D2 VALUES LESS THAN (20) indexing on, PARTITION D3 VALUES LESS THAN (MAXVALUE)); Table created. SQL> create index dept3_partial on dept3 (dept_name) local indexing partial; Index created. (Local Index Partitions D1 & D2 will be usable – can create global index instead) 37

Pluggable Databases (See my “12c New Features” Class for much more!)

Thanks: Penny Avril & Byrn Liewellyn & Lucas Niemiec ORA-65052: statement involves operations with different container scope ORA-65040: operation not allowed from within a pluggable database ORA-65017: seed pluggable database may not be dropped or altered

Start with a Pristine Oracle System and Brand New Oracle Database

Install New DB

Add User Data

More Data

Non-CDB Pristine DB

Separate PDB

Keep Pristine DB Separated 39

Pluggable Databases are Here!


Pluggable Databases • • • •

CDB = Container Database (has Root DB & also has a seed PDB) PDB = Pluggable Database (plugged into a CDB) Non-CDB = Original type of Database (neither a CDB or PDB) Why?: Can’t consolidate 100’s of database on one machine … too many resources required when you add the SGAs up! Enter PDBs. • Share: Big Data Sources, Acquisitions, Partners, Shared Research, Governments

• • • • • •

Quickly create a new database (PDB) or copy existing one (PDB) Move existing PDBs to new platform or location or clone it (snapshot) Patch/Upgrade PDB by plugging it into a CDB at a later version Physical machine runs more PDBs old way: Easier to manage/tune Backup entire CDB + any number of PDBs New syntax for commands: PLUGGABLE DATABASE


Pluggable Databases…


Containers 0 - 254 • • • •

Entire CDB => Container ID = 0 Root (CDB$ROOT) => Container ID = 1 Seed (PDB$SEED) => Container ID = 2 PDBs => Container ID = 3 to 254


(Connect to ROOT): SQL> connect / as sysdba SQL> SHO CON_ID CON_NAME

CON_ID -----------------------------3

CON_ID -----------------------------1

CON_NAME -----------------------------PDB1

CON_NAME -----------------------------CDB$ROOT

CDB or PDB created… • Background Processes /SGA (shared by root & all PDBs) • Character Set shared by root & all PDB’s • Redo shared by root and all PDB’s • Undo shared by root and all PDB’s • Temporary Tablespace – can create for each PDB • Time Zones – can be set for each PDB • Initialization parameters – some can be set by PDB • Separate SYSTEM & SYSAUX for root & each PDB • Data files separate for root & each PDB (same block size) 44

Query the PDBs select name, open_mode, open_time from v$pdbs; NAME --------------PDB$SEED PDB1 PDB_SS


OPEN_TIME -------------------------23-FEB-13 AM 23-FEB-13 AM 23-FEB-13 AM


Creating a PDB - Many ways to do it…

• Create a PDB by copying the seed PDB • Create a PDB by cloning another PDB • Create a PDB by using the XML metadata files and other files and plugging them into a CDB • Create a PDB using a non-CDB (multiple ways) • Use DBMS_PDB to create an unplugged PDB • Create an empty PDB and use data pump to move data • Using GoldenGate replication to create


Cloning a PDB (example) CREATE PLUGGABLE DATABASE pdb2 FROM pdb1; CREATE PLUGGABLE DATABASE pdb2 FROM pdb1 PATH_PREFIX = '/disk2/oracle/pdb2' FILE_NAME_CONVERT = ('/disk1/oracle/pdb1/', '/disk2/oracle/pdb2/'); CREATE PLUGGABLE DATABASE pdb2 FROM pdb1 FILE_NAME_CONVERT = ('/disk1/oracle/pdb1/', '/disk2/oracle/pdb2/') STORAGE (MAXSIZE 2G MAX_SHARED_TEMP_SIZE 100M); CREATE PLUGGABLE DATABASE pdb2 FROM [email protected]_link; 47

Moving between CDB/PDBs Switch Containers… SQL> ALTER SESSION SET CONTAINER=PDB1; Session altered. SQL> alter session set container=CDB1; ERROR: ORA-65011: Pluggable database does not exist ALTER SESSION SET CONTAINER=CDB$ROOT; Session altered. ALTER SESSION SET CONTAINER=PDB$SEED; Session altered. ALTER SESSION SET CONTAINER=pdb_ss; (not case sensitive) Session altered.


Open/Close PDBs SQL> ALTER PLUGGABLE DATABASE CLOSE IMMEDIATE; Pluggable database altered. SQL> ALTER PLUGGABLE DATABASE OPEN READ WRITE; Pluggable database altered. SQL> ALTER PLUGGABLE DATABASE CLOSE; (shutdown) Pluggable database altered.

Alter pluggable database open upgrade; (to migrate) Pluggable database altered.


Open/Close PDBs alter pluggable database all except pdb1 close immediate; Pluggable database altered. select name, open_mode, from v$pdbs; NAME ---------PDB$SEED PDB1 PDB_SS



OPEN_TIME ------------------------11-MAR-13 PM 27-MAR-13 AM 27-MAR-13 AM

alter pluggable database pdb$seed close immediate; alter pluggable database pdb$seed close immediate *

ERROR at line 1: ORA-65017: seed pluggable database may not be dropped or altered


Startup PDB Startup pluggable database pdb1 open;(read/write) Pluggable Database opened. (or while in pdb1 just run STARTUP) Startup pluggable database pdb1 open read only; Pluggable Database opened. Startup pluggable database pdb1 force; (closes/opens) Pluggable Database opened. (or while in pdb1 just run STARTUP FORCE)


When you startup the CDB… SQL> startup ORACLE instance started. Total System Global Area Fixed Size Variable Size Database Buffers Redo Buffers Database mounted. Database opened.

626327552 2276008 524289368 92274688 7487488

bytes bytes bytes bytes bytes

select name, open_mode, from v$pdbs; NAME ---------PDB$SEED PDB1 PDB_SS


OPEN_MODE OPEN_TIME ---------- ------------------------READ ONLY 27-MAR-13 AM MOUNTED MOUNTED




Able to modify initialization parameter for a given PDB… SELECT NAME FROM V$PARAMETER WHERE



NAME LIKE 'optim%‘;

= 'TRUE' (without condition – can set 147 parameters out of 357) (There were 341 parameters in 11gR2)

NAME ---------------------------------------optimizer_adaptive_reporting_only optimizer_capture_sql_plan_baselines optimizer_dynamic_sampling optimizer_features_enable optimizer_index_caching optimizer_index_cost_adj optimizer_mode optimizer_use_invisible_indexes optimizer_use_pending_statistics optimizer_use_sql_plan_baselines 10 rows selected.

Key ones modifiable: cursor_sharing, open_cursors, result_cache_mode, sort_area_size Key ones NOT modifiable: shared_pool_size, db_cache_size, memory_target, pga…


Set PDB Resource Plans … • Keep runaway PDBs from affecting other PDBs • Allocate appropriate resource plans (between/within PDBs) • Set min/max CPU / I/O / Parallelism / (Future: Memory / Network / I/O on non-Exadata) alter system set RESOURCE_LIMIT = TRUE_CONTAINER = ALL (dynamically enable resource limits for all containers) alter system set RESOURCE_LIMIT = TRUE_CONTAINER = CURRENT (dynamically enable resource limits for the root)


Set PDB Resource Plans … • If 4 PDBs have 3 shares each, there are 12 shares total and each has 3/12 or 1/4th of the CPU resources. • If 2 PDBs have 3 shares & 2 PDBs have 1 share, then the ones with 3 shares have 3/8ths of the CPU resources and are 3x more likely to queue parallel queries than the ones that have 1 share. • CPU utilization_limit and parallel_server_limit percents also can be set. BEGIN DBMS_RESOURCE_MANAGER.CREATE_CDB_PLAN_DIRECTIVE( plan => 'newcdb_plan', pluggable_database => ‘pdb1', shares => 3, utilization_limit => 70, parallel_server_limit => 70); END; / 56

Monitoring the 12c Database Using Enterprise Manager CC

Create and Manage 12c DB with CC (Use 12c Cloud Control – OEM)


Create and Manage 12c DB with CC (Use 12c Cloud Control – OEM)


Other 12c Features … • Database Instance Smart Flash Cache Support for Multiple Devices (can access/combine) without the overhead of the local volume manager. • Supports In-Memory Jobs & In-Memory Temporary Tablespaces • Active Data Guard Security has in-memory table of failed login attempts • Heat Map that tracks modifications of rows (block level), table, partition levels • Automate policy-driven data movement and compression using Heat Map • Move partitions while ONLINE with DML happening / Flex ASM to other storage • Improved query performance against OLAP cubes (especially Exadata) • Automatic extended stats for groups of columns accessed together • DBMS_STATS.GATHER_TABLE_STATS run on a partitioned table when CONCURRENT is set to TRUE will gather stats using multiple jobs concurrently • Online statistics gathered during a bulk load (similar to rebuild index command) • Flashback Data Archive (FDA) can be fully used on HCC tables on Exadata • Enterprise Manager Database Express 12c ships with every database (NICE!) • “Spot ADDM” triggered by high CPU or I/O into AWR Reports • Mask Data At Source for testing & Oracle Masking templates for EBusiness • Oracle Data Redaction (prevents things like SSN from being displayed) 60

Other 12c Features … • • • • • • •

• • •

• •

• •

Full Transportable support & Point-in-time recovery for PDBs TRUNCATE TABLE …CASCADE (truncate child tables too) Data Pump No Logging Option for import No-echo of Encryption Passwords on expdp/impdp commands Sql*Loader Express Mode – no control file! In-Database MapReduce (Big Data) Update strong user authentication using kerberos & Simplified Vault administration Many Windows enhancements (if you must use Windoze) Fast Application Notification (FAN) gets improved with Application Continuity which helps recover incomplete requests without executing more than once. Real-Time Apply (redo) is now default for Data Guard vs. applying archive logs SQL Apply Support for Objects, Collections, XML Type, & SecureFiles LOBs Oracle Spacial is now Oracle Spacial & Graph – Enhancements include routing engine enhancements, caching of index metadata, vector performance, Asian address support (geocoding), raster algebra & analytics, enhance image processing Many ACFS, Oracle Multimedia, Oracle Text & Oracle XML enhancements VARCHAR2(32767) –not default/4K stored inline/>4K out of line(like a LOB) 61

Database as a Service

Key Points from Juan Loaiza’s Presentation • 12c Support for Smart Scans (Exadata only): Enhanced for Large Objects (LOBs) so that Oracle offloads LOB to Smart Scan • Information Lifecycle Management – Can compress based on Aging (ILM). You can also compress based on a Heat Map & use. • Oracle adds Network Resource Management to things like IORM (Exadata Only), DBRM, and Instance Caging (limits the amount of CPU – must enable Resource Manager before it takes effect) • InfiniBand Network stack bypasses the O/S Network Stack. Helps with both RAC clusters and in-memory column stores.

Oracle Strategy & Fusion Middleware

Unbelievable Tools to build GREAT things!

Products have Web/Mobile/Cloud Focus …  Web Based Applications  Security for BYOD (Bring Your Own Device)  Capable of many Mobile form factors  iPad, iPad Mini, Android, other Tablets  Smart Phones, other Phones

 Development tools - Build once &run on any platform  Cloud Ready!

Key Advancements…  Enhancements to BPM (Business Process Management); Enhanced Business Process Composer. Security:  Oracle extends Identity Management to Mobile  BYOD (Bring your own device) – Phone/iPad  Secure containers with SSO (Single Sign On)  Can add additional security (policy based stepped up security) for a device (phone)

Cloud Application Foundation

Oracle makes you GREAT on the Internet!  With Coherence (in-memory data grid), DB pushes to update stale data to application instead of web server polling database. Real time refresh with GoldenGate. WebLogic has deep integration with 12c (Multi-tenant / RAC)  Exalogic often 20x faster than other hardware  Nimbula Director exposes elastic compute and elastic storage for Exalogic (private cloud Infrustructure as a service).  Enterprise Manager manages the Database, App Server, & Exalogic server


Customer Experience (CX) in the Cloud

Use Many of Oracle’s Products in the Cloud • • • • • • • • • •

Thomas Kurian called it “the most important project” Marketing, Sales, Service HCM / Talent Management ERP – Financials, Procurement, Supply Chain… Social Network, Social Marketing, Social Data and Insight Development – Database, Java, BI, Mobile, Tools, Documents, JDeveloper Infrastructure – Compute, Storage, Messaging Cloud Marketplace Many CX (Customer Experience) products I will show only a couple of examples – Will be a MAJOR player

Cloud: Database/WebLogic/Compute/Storage

Talent Management Example

Check your Current Team…

The Oracle Cloud – starting to hear rumbles • If you haven’t seen some of Oracle’s offerings, it may be in your company’s best interest to do so. • Many are leading edge management tools • Many are the best you’ll find • The future in this area is coming at you fast! • These are ready for Mobile • These are ready and in the Cloud • These will help you to create better customer experience, more transparency, better employees, and more efficient operations

Engineered Systems …

Exadata Database Machine



Software in the Silicon… coming soon…

Database Backup Appliance

Get Ready for Big Data!

What is Big Data and Big Data Analytics? • Big Data applied to unstructured data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time. • Big Data Analytics is the process of leveraging data that is too large in volume, too broad in variety and too high in velocity to be analyzed using traditional methodologies.


Every Organization Will Use Big Data Big Data includes: Social Media, Sensor Data, Biological, Traffic, RFID Data, Environmental, Aerial, Wireless, Security & Video Data, Retail, Medical, Engineering Systems, Search Data, Photographs, Call Records, CRM/ERP data, etc.


Bigger Data - Data Size Matters…  Worldwide, data is growing rapidly over the years….  2000: 800 Terabytes (1012)  2006: 160 Exabytes (1018)

 2009: 500 Exabytes (just Internet)  2012: 2.7 Zettabytes (1021)  2020: 35 Zettabytes …?  Data generated in ONE day….?

 Twitter: 7 TB  Facebook: > 10 TB

Big data: The next frontier for innovation, competition, and productivity McKinsey Global Institute 2011

2.8 x 1020 bits of Memory Space – John von Neumann (“Computer and the Brain”, Harvard Lecture Notes, Half Century ago)

Data collated from various online sources

IOUG Survey – September 2012

Characteristics of Big Data

Finance Telecom Retail Life Sciences Media Government • • • • • •

Big Data Themes HW & SW technologies for large data volumes Focus on Web 2.0 technologies Database Scale-out Relational & Distributed Data Analytics Distributed File Systems Real Time Analytics

• • • • • •

Big Data Domains Digital Marketing Optimization Data Exploration & Discovery Fraud Detection & Prevention Social Network & Relationship Analysis Machine-generated Data Analytics Data Retention

Big Data Providers

Revolution of Big Data Tools…

Google File System (GFS)

Google’s MapReduce

Google BigTable

Apache / Hadoop World Hadoop File System (HDFS)


Apache Hive (DWHSE) ZooKeeper & Pig (coordination) (Manipulate HDFS)


Hypertable (Baidu uses) Cassandra (Based on DynamoDB [Amazon] and BigTable)

IOUG Survey – September 2012

Why so slow? Big B&R issues / HA Issues / Sprawl • Does have Kerberos & Access Control Lists (ACL) • NO role based authentication, LDAP or Active Directory… no encryption on data in transit between nodes. Many tools also lack security. Also no compression with splits. • Companies DO NOT allow certain data on Hadoop!

NoSQL Databases – over 120 (& Data Stores)

IOUG Survey – September 2012

Do you really need all the CPUs / RAM? 4000 nodes at Yahoo: 16P raw disk  32T RAM  30,000 cores

3 Nodes: 504T + 648T + 648T x 10x compression = 18P raw disk equiv.

Oracle Technologies for – Big Data Rapid Deployment – Ready Now!

Savings is in the CODING time… SQL Pattern Matching: • 650 lines of MapReduce Java code or 12 lines of Oracle pattern matching SQL • Runs 50x Faster on1B row table • 18 node Hadoop vs. 2 socket server • Oracle addressing Big Data Issues • Oracle advancing the Big Data Story • Oracle starting to take the lead!

Many ways to query Unstructured Data • Use MapReduce accessing HDFS • Go to Hadoop directly from SQL with Big Data Connector • Go to Hadoop directly with R programming environment • Go to Hadoop directly through an optimized Hive Connector using standard SQL • Fast Data – complementary to Big Data, but requiring faster answers (see next point) • Analyze unstructured data real time (for example – sensor data): persist data in key values using Oracle NoSQL to Coherence data grid (& build hash tables in memory) – results can be pushed to mobile/tablet

My Oracle Big Data Benefits • • • •

• • • • • • •

It’s actually done and complete unlike others Full Hadoop integration and loader Exadata and Exalytics BI integration & solution Big Data hardware which includes Hadoop HDFS, MapReduce, R programming language (statistics and regressions…etc.), Oracle NoSQL, ACID compliant, Simple key-value pair data model (hashes keys over many servers - major/minor keys & byte arrays) Oracle NoSQL is based on Oracle’s BerkeleyDB (commercial 8 years!) which integrates with HDFS (Hadoop File System) using external tables if you want, Oracle Loader for Hadoop (OLH) takes the analyzed data from MapReduce & puts into 11g Database as last step (15T/hr) Concurrency is flexible at any level & it’s horizontally scalable Oracle knows clustering & HA well (no single point of failure!) Oracle Admin tools are great as are Oracle professionals BerkeleyDB is the worlds most widely used DB toolkit >200M deployed copies Oracle can be REAL TIME fast, not batch processing slow

IOUG Survey – September 2012

Final Thoughts… Catching your Wave!

“Things may come to those who wait, but only the things left by those who hustle.” — Abraham Lincoln 101

Build a Successful Team • Use the Technology that Creates the Future! • Make each team member feel Together responsible for the success of Everyone the project • Make each team member accountable Achieves • Share Success with all team members More • Attributes of a Successful Team: Respect Common Goal Honesty Understanding

Loyalty Communication Unselfishness Positive Attitude

Trust Flexibility Support Leadership

#1 Selling Oracle Database Book on Amazon for over a year!

• Also available at other places like Barnes & Noble…etc. • Available on the Kindle and other book readers • Why is it #1?


Rolta– Your Partner in Success…. Accomplished in Oracle! 2012 Oracle Partner of the Year (9 Titans/Excellence Awards)

Prior Years: 2002, 2004*, 2007*, 2008, 2010, 2011 *Won 2 Awards


Copyright Information

• Neither Rolta nor the author guarantee this document to be error-free. Please provide comments/questions to [email protected] I am always looking to improve! • Rich Niemiec/Rolta ©2013. This document cannot be reproduced without expressed written consent from Rich Niemiec or an officer of Rolta, but may be reproduced or copied for presentation/conference use. • References include Rich Niemiec’s Exadata Presentation & Oracle11g Database Performance Tuning Tips & Techniques book,,,,,, Tech Crunch,, The Matrix movie, Information Week, Gartner, Computerworld, & Oracle OpenWorld Contact Information Rich Niemiec: [email protected] 105

Suggest Documents