Get more for less with POWER9

Who doesn’t expect more from a new product, let alone if it is the next generation of that product. Whether it is the “All New 2019 Brand Model” Car/Truck/SUV or, being a Macbook fan, the latest Macbook Pro and IOS (just keep the magnetic power cord)?

We want and expect more.  IBM POWER8 delivered more.  More performance, built-in virtualization on the Enterprise systems, mobile capacity on Enterprise systems to share capacity between like servers, a more robust reliability and availability subsystem as well as improved serviceability features from the low-end to high-end.  Yes, all while dramatically improving performance over previous generations.

How do you improve upon something that is already really good – I’m purposefully avoiding using the word “great” as it’ll make me sound like a sycophant who would accept a rock with a Power badge and call it “great”.  No, I am talking about actual, verifiable features and capabilities delivering real value to businesses.

Since the POWER9 Enterprise systems have yet to be announced and I only know what I know through my secret sources, I’ll limit my statements to just the currently available POWER9 Scale-out systems.

  • POWER8 Scale-out now include PowerVM Enterprise Edition licenses
  • Workload Optimized Frequency now delivers frequencies up to 20% higher over the nominal or marketed clock frequency
  • PCIe4 slots to support higher speed and bandwidth adapters
  • From 2 to 4X greater memory capacity on most systems
  • New “bootable” internal NVMe support
  • Enhanced vTPM for improved Secure Boot & Trusted Remote Attestation
  • SR-IOV improvements
  • CAPI 2.0 and OpenCAPI capability – the latter, though I’m unaware of any supported features is exciting in what it is designed and capable of doing.
  • Improved price points using IS memory

The servers also shed some legacy features that were getting long in the tooth.

  • Internal DVD players – in lieu of USB drive support
  • S924 with 18 drive backplane no longer includes add-on 8 x 1.8″ SSD slots

As consumers, we expect more from our next generation purchases, the same holds true with POWER9.  Get more capability, features and performance for less money.

Contact me if you would like a quote to upgrade to POWER9, running x86 workloads and would like to hear how you may be able to do far more with less as well as learn how my services team will ease any concerns or burdens you may have to remain on your aging and likely, higher cost servers by upgrading to POWER9.

 

 

HPE Memory RAS; Excels at being Average

A recent HPE blog stating memory errors are not the end of the world was meant to reassure clients to accept regular & unplanned platform disruptions. In reality what HPE ends up saying is there is little difference with the other commercial Intel server vendors and their own as they all range from below average to average at best.  Just so happens, this specific blog was written by the HPE Server Memory Product Manager who might be forgiven for painting this dire picture only to then present the best alternative; Yes, HPE SmartMemory. *shock*

To HPE’s credit, they have quite a bit of documentation discussing server Reliability, Availability & Serviceability (RAS) features, specifically about their memory subsystem. They are fairly forthright about their strengths and weaknesses of the entry, mid-range and high-end servers. Sadly though, at every level there message is full of qualifiers, limitations and restrictions which require the consumer to wade through and understand all of the requirements.

An HPE whitepaper from February 2016 titled “How memory RAS technologies can enhance the uptime of HPE ProLiant servers” paints a starkly different picture than the blog. The whitepaper states on page 2 in the 2nd paragraph of the introductory summary section “It might surprise you to know that memory device failures are far and away the most frequent type of failure for scale-up servers.“, up to 2X the rate of the next closest part when the memory is configured with a memory protection configuration not better than SDDC+1.  There is another graph that immediately follows this one showing when memory is configured using a protection scheme of DDDC+1 it decreases memory failures by 85%. That is pretty good, yet the value of 85% used in the whitepaper does not jive with the blog which states when using HPE SmartMemory, memory errors are reduced 99.9998% (yes, that is 5 x 9’s).  I call out this discrepancy because right after claiming 5×9’s they point the reader to the very whitepaper I am citing here.

This blog is not meant to define all of the different terms used, you will have to do some of that work. However, it is worth noting that all of the wonderful features touted in the HPE blog, in the HPE whitepaper and may other sources, the consumer will find there are many qualifiers, limitations and restrictions.  Such as.

  1. E5 chips do not support DDDC or DDDC+1
  2. E5 chips only support SDDC or SDDC + rank sparing
  3. Memory sparing consumes (wastes) either 25% or 12.5% of installed capacity
  4. EX chips support SDDC, SDDC + rank sparing, SDDC+1 and DDDC+1
  5. But, DDDC+1 is ONLY using x4 DIMMs and not x8 DIMMs
  6. DDDC+1 requires x4 DIMMs
  7. Advanced ECC is an option used across 2 DIMMs but can only fill 2 of 3 DIMM slots per channel
  8. Memory Mirroring is the most expensive in terms of cost & performance
  9. Memory Mirroring wastes 1/2 of the DIMM slots for the mirror – not usable
  10. Memory Mirroring only allows you to fill 2 of 3 DIMM slots per channel
  11. Memory Mirroring has a potential performance impact for WRITES

Let’s be clear, consumers have 3 primary options to configure memory on any of the Intel servers.

  1. Performance mode which delivers the highest bandwidth with the lowest reliability features. Not an ideal option for in-memory workloads despite the appeal to maximize the bandwidth.
  2. Lockstep Mode meant to strike a balance of slightly decreased bandwidth (can be up to 50%¹) while increasing reliability over performance mode.  Probably the most common option selected.
  3. Memory Mirroring Mode delivers the highest reliability at the expense of wasting 1/2 the memory capacity as well has a slight performance decrease (remember, this mode can only use 2 of the 3 DIMM slots per channel so you already lose 1/3 of the memory capacity).

What is HPE’s response to clients who want increased memory RAS; especially for those in-memory workloads such as SAP HANA?  Buy more expensive E7 based servers to receive slightly higher memory RAS capability OR install more memory on the already RAS-deficient E5 based servers to increase its capacity to utilize memory spare ranks.

Net-net is that HPE is pushing proprietary memory that is far more expensive than the industry standard memory traditionally used with Intel servers that has earned it the reputation as a low-cost leader relative to traditional Enterprise-class systems like IBM POWER or SPARC. That is evident in the SAP HANA space as the systems required to support these in-memory workloads tend to require more capacity; more cores to achieve the core to memory ratio’s and more sockets to achieve more memory capacity with its associated bandwidth.  Yet, HPE remains true to form as regardless of the path taken, it comes with increased cost, limitations, restrictions and qualifications.

Contrast the never-ending “Compromise” Intel options, IBM’s POWER8 servers use Enterprise memory that is “No Compromise”.  This buffered memory offers spare  capacity, spare lanes, memory instruction replay, chipkill and an incredible DDDC +1+1 allowing for multiple DRAM failures before experiencing a system event.  The design point for POWER8 memory is simple: Not to fail!

AS you consider platforms to host in-memory workloads such as SAP HANA, DB2 BLU, consider which basket you want to place all of your eggs into.  A platform with a memory subsystem designed not to fail or a platform with unending limitations as listed above. The choice should be easy – Choose POWER!

 

SAP HANA – could I have extra complexity please?

Just returned from IBM’s Systems Technical University conference held in Orlando having delivered presentations on 4 different topics.

  1. Benefits of SAP HANA on POWER vs Intel
  2. Why IBM POWER systems are datacenter leaders
  3. Only platform that controls Software Licensing
  4. Why DB2 beats Oracle on POWER (implied that it beats Intel).

With the SAP Sapphire conference last week in Orlando, there was a slew of announcements.  Quick reminder for the uninitiated with SAP HANA, that it is ONLY supported on Intel and POWER based systems running one OS; SUSE or RedHat Linux. With that, IBM POWER continues to deliver the best value.

What is the value offered with the POWER stack? Flexibility! It really is that simple.  If I had a mic on the plane as I write this, I would drop it. Conversely, what is the value offered going with an Intel stack? Compromise!

Some of the flexibility offered thru IBM POWER systems are: Scale-up, scale-out, complete virtualization, grow, shrink, move, perform concurrent maintenance, mix workloads: existing ECC workloads on AIX or IBM i with new HANA running Linux all on the same server.  All of this runs using the most resilient HANA platform available.

Why do I label Intel systems as “Compromise” solutions? It isn’t a competitive shot nor FUD.  Listen, as an Client Executive and Executive Architect for an Channel Reseller, I am able to offer my clients solutions from multiple vendors that include IBM POWER and Intel based systems manufacturers.  I’ve made the conscious decision though to promote IBM POWER over Intel.  Why? Because I not only believe in the capabilities of the platform but also having worked with some of the largest companies in the world, I regularly hear and see the impact running Enterprise workloads on Intel based servers has on the business.

If you read my previous blog, I mention a client who just recently moved their Oracle workloads from POWER to Intel.  Within months, they’ve had to buy over $5M in new licenses going from a simple standalone and a few 2-node clusters (all on the same servers) to an 8-node VMware based Oracle RAC cluster.  This environment is having daily stability issues significantly impacting their business.  Yes, their decision to standardize on a single platform has introduced complexity to the business costing them money, resources (exhausted & not having the proper skills to manage the complexity) that impacts their end-users.

The “Compromise” I mention to host SAP HANA on Intel is that everything has to be an asterisk by it – in other words a limitation or restriction – everything requires follow-up questions and research to ensure what the business wants to do, can be done. Here are some examples.
1) VMware vSphere 5.5 initially supported 1 VM per system which has now been increased to 4 VM’s, but with many qualifications.
a) Restricted to 2 & 4 socket Intel servers
1) VM’s are limited to a socket
2) 2 socket server ONLY supports 2 VM’s, 4 socket would be 4 x 1 sockets each
b) Only E5_v2, E5_v3, E7_v2 and E7_v3 chips are supported – NO Broadwell
c) Want to redeploy capacity for other? Appliances certified only for SoH or S4H
uses cannot be used for other purposes such as BW
d) Did I mention, those VM’s are also limited to 64 vCPU and 1 TB of memory each
e) If a VM needs more memory than what is attached to that socket? No problem, you have to add an additional socket and all of its memory – no sharing!
2) VMware vSphere 6.0 just recently went from 1 to 16 VM’s per system.
a) VM’s are still limited to a socket or 1/2 socket.
b) 1/2 socket isn’t as amazing as it sounds.  Since vSphere supports 2, 4 & 8 socket servers, there can be 16 x 1/2 socket VM’s.
c) What there cannot be, is any combination of VM’s >1 socket with 1/2 socket assigned. In other words, a VM cannot have 1.5 or 3.5 sockets. Any VM resource requirement above 1 socket requires the addition of an entire socket.  1.5 sockets would be 2 sockets.
d) Multi-node setups are NOT permitted …. at all!
e) VM’s larger than 2 sockets cannot use Ivy Bridge based systems, only Haswell or Broadwell chips – but ONLY on 4-socket servers.  Oh my gosh, this is making my head hurt!
f) If using an 8-socket system, it only supports a single production VM using Haswell ONLY processors.  NOT Ivy Bridge and NOT Broadwell!
g) VM’s are limited to 128 vCPU and 4 TB of memory
3) VMware vSphere 6.5 with SAP HANA SPS 12 only supports Intel Broadwell based systems. What if your HANA Appliance is based on Ivy-Bridge or Haswell processor technology? “Where is that Intel rep’s business card? Guess I’ll have to buy another one since I can’t upgrade these”
a) VM’s using >4 sockets are currently NOT supported with these Broadwell chips
b) Now, it gets better. I hope you are writing this down – For 2 OR 8 socket systems, the maximum VM size is 2 sockets.  Only a 4 socket system supports 1 VM with 4 sockets.
c) Same 1/2 socket restrictions as vSphere 6.0.
d) Servers with >8 sockets do NOT permit the use of VMware
e) If your VM requirements exceed 128 vCPU and 4 TB of memory, you must move it to a bare-metal system ….. Call me – I’ll put you on a POWER system where you can scale-up, scale-out without of this mess

Contrast all of these VMware + Intel limitations, restrictions, liabilities, qualification or simply said “Compromise” systems to the IBM Power System.

POWER8 servers run the POWER Hypervisor called PowerVM.  This Hypervisor and its suite of features deliver flexibility allowing all physical, all virtual and a combination of physical & virtual resource usage on each system. Even where there are VM limits such as 4 on the low-end system, that 4 could really be 423 VM’s.  I’m making a theoretical statement here to prove the point. Let’s use a 2 socket 24 core S824 server.  3 VM’s, each with 1 core (yes, I said core) for production usage and the 4th VM’s is really a Shared Processor Pool with 21 cores.  Those 21 cores support up to 20 VM’s per core or 420 VM’s. Any non-production use is permitted.

Each PowerVM VM supports up to 16 TB of memory and 144 cores.  VM size above 108 cores requires the use of SMT4 whereas <=108 cores permit SMT8.  Thus, 144 cores with SMT4 is 576 vCPU’s or 4.5X what Intel can do with 4X the memory footprint.  By the way, that 108 core VM would support 864 vCPU’s – just saying!  Note: I need to verify as the largest SMT8 VM may be 96 cores with only 768 vCPU.

Not only can we allocate physical cores to VM’s and NOT limited to 1/2 or full socket increments like Intel, but POWER systems granularity allows for adjustments at the vCPU level.

PowerVM supports scale-out and scale-up.  Then again, if you have heard or read about the Pfizer story for scale-out BW, you might rethink a literal scale-out approach. Read IBM’s Alfred Freudenberger’s blog on this subject at https://saponpower.wordpress.com/2016/05/26/update-sap-hana-support-for-vmware-ibm-power-systems-and-new-customer-testimonials/

While on the subject of BWoH/B4H, PowerVM supports 6 TB per VM whereas the vSphere 6.0 supports is 3 TB and the limitations increase from here.

Do you see why I choose to promote IBM Power vs Intel? When I walk into a client, the most valuable item I bring with me is my credibility.  HANA on Intel is a constant train wreck with constant changes & gotcha’s. Clients currently with HANA on Intel solutions or better yet, running ECC on Intel have options.  That option is to move to a HANA 2.0 environment using SUSE 12 or RedHat v7 Linux on POWER servers. Each server will host multiple VM’s with greater resiliency providing the business the flexibility desired from the critical business system that likely touches every part of the business.

Oracle is a mess & customers pay the price!

Chaos that is Oracle

Clients are rapidly adopting open source technologies in support of purpose-built applications while also shifting portions of on-premises workloads to major Cloud providers like Amazon’s AWS, Microsoft’s Azure and IBM’s SoftLayer.  These changes are sending Oracle’s licensing revenue into the tank forcing them to re-tool … I’m being kind saying it this way.

What do we see  Oracle doing these days?

  • Aggressively going after VMware environments who use Oracle Enterprise products for licensing infractions
  • Pushing each of their clients toward Oracle’s public cloud
  • Drastically changing how Oracle is licensed for Authorized Cloud Environments using Intel servers
  • Latest evidence indicates they are set to abandon Solaris and SPARC technology
  • On-going staff layoffs as they shift resources, priorities & funding from on-premises to cloud initiatives

VMware environments

I’ve previously discussed for running Oracle on Intel (vs IBM POWER), Intel & VMware have an Oracle problem. This was acknowledged by Chad Sakac, Dell EMC’s President Converged Division in his August 17, 2016 blog in what really amounted to an Open Letter to King Larry Ellison, himself. I doubt most businesses using Oracle with VMware & Intel servers fully understand the financial implications this has to their business.  Allow me to paraphrase the essence of the note “Larry, take your boot off the necks of our people”.

This is a very contentious topic so I’ll not take a position but will try to briefly explain both sides.  Oracle’s position is simple even though it is very complex.  Oracle does not recognize VMware as an approved partitioning (view it as soft partitioning) method to limit Oracle licensing. As such, clients running Oracle in a VMware environment, regardless of how little or much is used, must properly license it for every Intel server under that clients Enterprise (assume vSphere 6+).  They really do go beyond a rational argument IMHO. Since Oracle owns the software and authored the rules they use these subtleties to lean on clients extracting massive profits despite what the contract may say. An example that comes to mind is how Oracle suddenly changed licensing configurations for Oracle Standard Edition and Standard Edition One. They sunset both of these products as of December 31, 2015 replacing both with Standard Edition 2. What can only be described as screwing clients, they halved the number of sockets allowed on a server or in a RAC cluster, limited the number of cpu threads per DB instance while doubling the number of minimum Named User Plus (NUPs). On behalf of Larry, he apologizes to any 4 socket Oracle Standard Edition users but if you don’t convert to a 2 socket configuration (2 sockets for 1 server or 1 socket for 2 servers using RAC) then be prepared to license the server using the Oracle Enterprise Edition licensing model.

The Intel server vendors and VMware have a different interpretation on how Oracle should be licensed.  I’ll boil their position down to using host or cpu affinity rules.  House of Bricks published a paper that does a good job trying to defend Intel+VMware’s licensing position. In their effort, they do show how fragile of ground they sit on with its approach  highlighting the risks businesses take if they hitch their wagons to HoB, VMware & at least Dell’s recommenations.

This picture, which I believe House of Bricks gets the credit for creating captures the Oracle licensing model for Intel+VMware environments quite well. When you pull your car into a parking garage – you expect to pay for 1 spot yet Oracle says you must pay for every one as you could technically park in any of them. VMware asserts you should only pay for a single floor at most because your vehicle may not be a compact car, may not have the clearance for all levels, there are reserved & handicapped spots which you can’t use. You get the idea.

oracle_parking_garage

It simply a disaster for any business to run Oracle on Intel servers. Oracle wins if you do not virtualize, running each on standalone servers.  Oracle wins if you use VMware, regardless of how little or much you actually us.  Be prepared to pay or to litigate!

Oracle and the “Cloud”

This topic is more difficult to provide sources so I’ll just stick to anecdotal evidence. Take it or leave it. At contract renewal, adding products to contracts or new projects like migrating JD Edwards “World” to “Enterprise One” or a new Oracle EBS deployment would subject a business to an offer like this.  “Listen Bob, you can buy 1000 licenses of XYZ for $10M or you can buy 750 licenses of XYZ for $6M, buy 400 Cloud units for $3M and we will generously throw in 250 licenses …. you’ll still have to pay support of course. You won’t get a better deal Bob, act now!”.  Yes, Oracle is willing to take a hit for the on-premises license revenue while bolstering their cloud sales by simply shuffling the Titanic deck chairs. These clients, for the most part are not interested in the Oracle cloud and will never use it other than to get a better deal during negotiations. Oracle then reports to Wall Street they are having tremendous cloud growth. Just google “oracle cloud fake bookings” to read plenty of evidence to support this.

Licensing in the Cloud

Leave it to Oracle Marketing to find a way to get even deeper into clients wallets – congratulations they’ve found a new way in the “Cloud”.  Oracle charges at least 2X more with Oracle licenses on Intel servers that run in Authorized Cloud Environments (ACE). You do not license Oracle in the cloud using the on-premises licensing factor table.  The more VM’s running in a ACE,  the more you will pay vs an on-premises deployment. To properly license an on-premises Intel server (remember, it is always an underlying proof that Oracle on POWER servers is the best solution) regardless if virtualization is used, assuming a 40 core server, would equal 20 Oracle Licenses (Intel licensing factor for Intel servers is 0.5 per core). Assume 1 VMware server, ignoring it is probably part of a larger vSphere cluster.  Once licensed, clients using VMware could theorectially run Oracle as many VM’s as desired or supported by that server. Over-provision the hell out of it – doesn’t matter. That same workload in an ACE, you pay for what amounts to every core.  Remember, if the core resides on-premises it is 1 Oracle License for every 2 Intel cores but in a ACE it is 1 OL for 1 core.

AWS
Putting your Oracle workload in the cloud?  Oracle license rules stipulate if running in AWS, it labels as vCPU’s both the physical core and the hyperthread. Thus, 2 vCPU = 1 Oracle License (OL). Using the same 40 core Intel server mentioned above, with hyperthreading it would be 80 threads or 80 vCPU.  Using Oracle’s new Cloud licensing guidelines, that would be 40 OL.  If this same server was on-premises, those 40 physical cores (regardless of threads) would be 20 OL ….. do you see it?  The licensing is double!!!   If your AWS vCPU consumption is less vs the on-premises consumption you may be ok. As soon as your consumption goes above that point – well, break out your checkbook.  Let your imagination run wild thinking of the scenarios where you will pay for more licenses in the cloud vs on-prem.

Azure
Since Azure does not use hyperthreading, 1 vCPU = 1 core.  The licensing method for ACE’s for Azure or any other ACE if hyperthreading is not used, 1 vCPU = 1 OL.  If a workload requires 4 vCPU, it requires 4 OL vs the 2 OL if it was on-premises.

Three excellent references to review. The first is Oracle’s Cloud licensing document. The second link is an article by Silicon Angle giving their take of this change and the last link is for a blog by Tim Hall, a DBA and Oracle ACE Director sharing his concerns. Just search for this topic starting from January 2017 and read until you fall asleep.

Oracle
Oracle offers their own cloud and as you might imagine, they do everything they can to favor their own cloud thru licensing, contract negotiations and other means.   From SaaS, IaaS and PaaS their marketing machine says they are second to none whether the competition is SalesForce, Workday, AWS, Azure or any other.  Of course, analysts, media, the internet nor Oracle earnings reports show they are having any meaningful success – to the degree they claim.

Most recently, Oracle gained attention for updating how clients can license Oracle products in ACE’s as mentioned above.  As you might imagine, Oracle licenses its products slightly differently than in competitors clouds but they still penalize Intel and even SPARC clients, who they’ll try to migrate into the cloud running Intel (since it appears Oracle is abandoning SPARC).  The Oracle Cloud offers clients access to its products on a hourly or monthly in a metered and non-metered format on up to 4 different levels of software. Focusing on Oracle DB, the general tiers are Standard, Enterprise, High-Performance and Extreme-Performance Packages. Think of it like Oracle Standard Edition, Enterprise Edition, EE+tools, EE+RAC+tools.  Oracle also defines the hardware tier as “Compute Shapes“. The three tiers are General Purpose, High-Memory or Dedicated compute

Comparing the cost of an on-premises perpetual license for Oracle Enterprise  vs a non-metered monthly license for the Enterprise Tier means they both use Oracle Enterprise Edition Database. Remember a perpetual license is a one-time purchase, $47,500 for EE DB list price plus 22% per year annual maintenance.  The Enterprise tier using a High-memory compute shape in the Oracle cloud is $2325 per month.  This compute shape consists of 1 OCPU (Oracle CPU) or 2 vCPU (2 threads / 1 core).  Yes, just like AWS and Azure, Intel licensing is at best 1.0 vs 0.5 for on-premises licensing per core. Depending how a server might be over-provisioned as well as the fact an on-premises server would be fully licensed with 1/2 of its installed cores there are a couple of ways clients will vastly overpay for Oracle products in any cloud.

The break-even point for a perpetual license + support vs a non-metered Enterprise using High-memory compute shape is 30 months.

  • Perpetual license
    • 1 x Oracle EE DB license = $47,500
    • 22% annual maintenance = $10,450
    • 3 year cost: $78,850
  • Oracle Cloud – non-metered Enterprise using High-Memory shape
    • 1 x OCPU for Enterprise Package for High-Compute = $2325/mo
    • 1 year cloud cost = $27,900
    • 36 month cost: $83,700
  • Cross-over point is at 30 months
    • $79,050 is the 30 month cost in the Cloud
  • An Oracle Cloud license becomes significantly more expensive after this.
    • year 4 for a perpetual license would be $10,470
    • 12 months in year 4 for the Cloud license would be $27,900
    • Annual cost increase for a single cloud license over the perpetual license = $17,430
  • Please make your checks payable to “Larry Ellison”

Oracle revenue’s continue to decline as clients move to purpose-built NoSQL solutions such as MongoDB, RedisLabs, Neo4j, OrientDB, Couchbase as well as SQL based solutions from MariaDB, PostgreSQL (I like EnterpriseDB) even DB2 is a far better value.  Oracle’s idea isn’t to re-tool by innovating, listening to clients to move with the market. No, they get out their big stick – follow the classic mistake so many great clients have done before them which is not evolve while pushing clients until something breaks.   Yes, Boot Hill is full of dead technology companies who failed to innovate and adapt. This is why Oracle is in complete chaos.  Clients beware – you are on their radar!

 

 

HPE; there you go again! Part 1

Updated Sept 05, 2016: Split the blog into 2 parts (Part 2). Fixed several typo’s and sentence structure problems. Updated the description of the Superdome X blades to indicate they are 2 socket blades while using Intel E7 chips.

It must be the season as I find myself focused a bit on HPE.  Maybe it’s because they seem to be looking for their identity as they now consider selling their software business.  This time though, it is self-inflicted as there has been a series of conflicting marketing actions. From what they say in their recent HPE RAS whitepaper about the poor Intel server memory reliability stating in the introductory section that memory is far and away the highest source of component failures in a system.  Shortly after that RAS paper is released, they post a blog written by the HPE Server Memory Product Manager stating “Memory Errors aren’t the end of the World”.  Tell that to the SAP HANA and Oracle Database customers, the latter which I will be discussing in this blog.

HPE dares to step into the lion’s den on a topic with which it has little standing to imply it is an authority how Oracle Enterprise software products are licensing in IBM Power servers.  As a matter of fact, thanks to the President of VCE, Chad Sakac for acknowledging that VMware has a Oracle problem.  On August 17th, Chad penned what amounts to an open letter to Larry & Oracle begging them …. No, demanding that Larry leave his people alone.  And, by “his people”, I mean customers who run Oracle Enterprise Software Products licensed by the core on Intel servers using VMware.

Enter HPE with a recent blog by Jeff Kyle, Director of Mission Critical Solutions.  He doesn’t distinguish if he is in a product development, marketing or sales role.  I would bet he it is the latter two as I do not think a product developer would put themselves out like Jeff just did.  What he did is what all Intel marketing teams and sellers have done from the beginning of compute time when the first customer thought of running Oracle on a server that wasn’t “Big Iron”.

Jeff sets up a straw man stating “software licensing and support being one of the top cost items in any data center” followed by the obligatory claim that moving it to an “advanced” yet “industry-standard x86 servers” will deliver the ROI to achieve the goals of every customer while coming damn close to solving world hunger.

Next is where he enters the world of FUD while also stepping into the land of make-believe.  Yes, Jeff is talking about IBM Power technology as if it is treated by Oracle for licensing purposes the same as an Intel server, which it is not.  You will have to judge if he did this on purpose or simply out of ignorance.  He does throw the UNIX platforms a bone by saying they have “excellent stability and performance” but stops there as only to claim they cost more than their Industry standard x86 server counterparts.

He goes on to state UNIX servers <Hold Please> Attention: For purposes of this discussion, let’s go with the definition that future UNIX references = AIX and RISC references = IBM POWER unless otherwise stated.  As I was saying, Jeff next claims AIX & POWER are not well positioned for forward-looking Cloud deployments continuing his diminutive descriptors suggesting proper clients wouldn’t want to work with “proprietary RISC chips like IBM Power”. But, the granddaddy of all of his statements and the one that is complete disingenuous is:  <low monotone voice> “The Oracle license charge per CPU core for IBM Power is twice (2X) the amount charged for Intel x86 servers” </low monotone voice>.

In his next paragraph, he uses some sleight of hand by altering the presentation of the traditional full List Price cost for Oracle RAC that is associated with Oracle Enterprise Edition Database.  Oracle EE DB is $47,500 per license + 22% maintenance per year, starting with year 1.  Oracle RAC for Oracle EE EB is $23,000 per license + 22% maintenance per year, starting with year 1.  If you have Oracle RAC then you would by definition also have a corresponding Oracle EE DB Licenses.  The author uses a price of $11,500 per x86 CPU core and although by doing he isn’t wrong per se, I just do not like that he does not disclose the full license cost of #23,000 up front as it looks like he is trying to minimize the cost of Oracle on x86.

A quick licensing review. Oracle has an Oracle License Factor Table for different platforms to determine how to license its products that are licensed by core. Most modern Intel servers are 0.5 per License.  IBM Power is 1.0 per License.  HP Itanium 95XX chip based servers, so you know also has a license factor of 1.0.  Oracle, since they own the table and the software in question can manipulate it to favor their own platforms as they do, especially with the SPARC servers.  It ranges from 0.25 to 0.75 while Oracle’s Intel servers are consistent with the other Intel servers at 0.5.  Let’s exclude the Oracle Intel servers for purposes of what I am talking about here for reason I said, which is they manipulate the situation to favor themselves. All other Intel servers “MUST” license ALL cores in the server with very, very limited exceptions “times” the licensing factor which is 0.5.  Thus, a 2 x 18 core socket would have 36 cores. Ex: 2s x 18c = 36c x 0.5 License Factor = 18 Licenses.  That would equal 18 Oracle Licenses for whatever the product being used.

What Jeff does next was a bit surprising to me.  He suggests customers not bother with 1 & 2 socket Intel “Scale-out” servers which generally rely on Intel E5 aka EP chipsets.  By the way, Oracle with their Exadata & Oracle Database Appliances now ONLY use 2 socket servers with the E5 processors; let that sink in as to why.  The EP chips tend to have features that on paper have less performance such as less memory bandwidth & fewer cores while other features such as clock frequency are higher, a feature that is good for Oracle DB.   These chips also have lower RAS capabilities, such as missing the MCA (Machine Check Architecture) feature only found in the E7 chips.  He instead suggests clients look at “scale-up” servers which commonly classified as 4 sockets and larger systems.  This is where I need to clarify a few things.  The HP Superdome X system, although it scales to 16 sockets, does so using 2 socket blades.  Each socket uses the Intel E7 processor, which given this is a 2 socket blade is counter to what I described at the beginning of this paragraph where 1 & 2 socket servers used E5 processors.  The design of the HP SD-X is meant to scale from 1 blade to 8 blades or 2 to 16 sockets which requires the E7 processor.

With the latest Intel Broadwell EX or E7 chipsets, the number of cores available for the HD SD-X range from 4 to 24 cores per socket.  Configuring a blades with the 24 core E7_v4 (v4 indicates Broadwell) equals 48 cores or 24 Oracle Licenses.  Reference the discussion two paragraphs above.  His assertion is by moving to a larger server you get a larger memory capacity for those “in-memory compute models” and it is this combination that will dramatically improve your database performance while lowering your overall Total Cost of Ownership (TCO).

He uses a customer success story for Pella (windows) who avoided $200,000 in Oracle licensing fees after moving off a UNIX (not AIX in this case) platform to 2 x HPE Superdome X servers running Linux.  This HPE customer case study says the UNIX platform which Pella moved off 9 years ago was actually a HP Superdome with Intel Itanium processors server running HP-UX.  Did you get this? HP migrated off their own 9-year-old server while implying it might be from a competitor – maybe even AIX on Power since it was referenced earlier in the story.  That circa 2006 era Itanium may have used a Montecito class processor. All of the early models before Tukwila were pigs, in my estimation.  A lot of bluff and hyperbole but rarely delivering on the claims.  That era of SD would have also used an Oracle license factor of 0.5 as Oracle didn’t change it until 2010 and only on the newer 95xx series chips.  Older systems were grandfathered and as I recall as long as they didn’t add new licenses they would remain under the 0.5 license model.  I would expect a 2014/2015 era Intel processor would outperform a 2006 era chip, although if it would have been against a POWER5 1.9 or 2.2 GHz chip I might call it 50-50 J .

We have to spend some time discussing HP server technology as Jeff is doing some major league sleight of hand as the Superdome X server supports a special hardware partitioning capability (more details below) that DOES allow for reduced licensing that IS NOT available on non-Superdome x86 servers or from most other Intel vendors unless they also have an 8 socket or larger system like SGI – oh wait, HP just bought them.  Huh, wonder why they did this if the HPE Superdome X is so good.

Jeff then mentions an IDC research study; big deal, here is a note from my Pastor that says the HPE Superdome is not very good; who are you going to believe?

Moving the rest of the blog to Part 2.

 

 

Intel Vendors & VMware have a Oracle Problem

Houston, we have a problem!

Intel server vendors Dell, VCE, HPE, Cisco, Lenovo, Fujitsu, Hitachi, Huawei, white box vendor Super Micro and any other server vendor using Intel chipsets have a problem if their customers use VMware to host Oracle Enterprise products (ie Database).

What’s “their” problem

In a nutshell, Oracle’s position is that customers running their Enterprise products like Oracle Enterprise Edition Database,  licensed by core (all cores in the server x 0.5) in a VMware environment must license every core on every server in which that Oracle workload could ever potentially reside managed by vCenter.  Server vendors, VMware, consultants and so on have a vested interest for Oracle to not do this because this Oracle tax is an extreme approach with their licensing terms that concern customers if they are running Oracle on Intel servers for fear Oracle will initiate a LMS audit leading to a substantial license settlement.

Quick Background

In my previous blog I wrote about “Intel; the Great Charade” where I discuss  each new generation of Intel processors having less performance per core than the previous generation.  As you read this and the ones referenced in this article (VCE & HoB) keep this ‘per core’ licensing approach in mind as this topic is central to how Oracle (typically) licenses its enterprise products.  For example, if a clients current server models are Sandy Bridge or Ivy Bridge era servers and plan to upgrade to the latest generation Broadwell you actually decrease the per core performance while increasing the number of cores per socket (if staying with the same SKU). Meaning 12 Ivy Bridge cores requires a little over 13 rounded up to 14 Broadwell cores to deliver equal performance. You don’t upgrade to get equal performance so you now have to move to a 16 or maybe 18 core SKU to gain additional socket performance or go with a higher frequency & lower core per socket SKU to obtain receive more performance per core….but now do you have enough overall performance?  To summarize my previous blog: It takes more cores from Haswell or Broadwell to equal the performance of the previous generation chips.  Since this increase in performance is at the socket and not with the core or thread (where most databases almost almost always prefer a stronger core vs more cores let alone weaker cores in a socket).  Since Oracle’s license calculation on Intel is to license all of cores in the server x 0.5 you may end up buying 1 or 2 extra Oracle licenses for every upgraded Intel server running VMware so be sure to factor that into your budget.

Who is complaining?

I could write the rest of this blog on this topic alone; around the right and wrong of Oracle’s licensing methods in VMware environments but I’ll defer to the thousands already available on this very topic.  This is not the reason I am writing this blog but to call out the self-serving and irresponsible Call-to-Action from House of Bricks and leader of a major CI player; VCE and to discuss why Oracle has no incentive to stop doing what they are doing.

Chad Sakac, the President of VCE which is the the Converged Infrastructure (CI) arm of EMC and soon Dell as the acquisition of EMC should be complete any day.  He is a regular blogger and in my opinion a master of marketing, technology & motivation.  On August 17, 2016 Chad wrote a blog titled “Oracle, I’m sad about you, disappointed in you, and frustrated with you.” in which he lays out how Intel server customers running Oracle Enterprise products, most often Oracle Enterprise Edition Database, are fed-up with Oracle’s abusive licensing tactics when Oracle Enterprise Edition products are installed and running in VMware.  He passionately pleads years of Oracle licensing frustration on behalf of clients while challenging clients to stand up to Oracle and not let them be bullied anymore.  He admits to selfishly partnering with House of Bricks (HoB), a VCE partner by funding their analysis on this situation.  HoB has been a leading voice in this fight in there own right so receiving compensation from VCE check was the proverbial icing on the cake IMHO as they were fighting the fight anyway.  What is VCE’s angle? They either have, or are losing  business due to clients fear of running Oracle workloads using VMware vSphere & vCenter.  There must be enough business at stake or EMC / VCE is desperate enough (not being critical here, just observing) to force them down this path to take such a in your face approach to Oracle.

House of Bricks, who is VCE’s partner and author of the whitepaper had a generally fair and moderated tone throughout the whitepaper.  That said, I do find they are irresponsible by encouraging VMware customers who are running  Oracle Enterprise Edition products licensed by core/processor (not socket or NUP) to run in configurations which are in direct conflict with Oracle’s standard licensing practices.  I’m not arguing the merits, fairness or legality of those licensing practices so save your comments.

Fight Mr Customer So We Can Sell You More!

Simply stated, Chad Sakac, the President of VCE and House of Bricks are actively encouraging system administrators, DBA’s and IT organizations to not only defend your use of Oracle Enterprise products in VMware environments, VMware clusters and VMware environments managed under vCenter but also to license Intel servers using sub-capacity licensing, using the BIOS to limit access to sockets or cores, only license the cores being used by Oracle.  Do these things and stand up to Oracle.  Do it for you….do it for us….just do it!  Of course, VCE funded the HoB paper but they won’t be funding your legal case (or bills) with Oracle.  All of this “encouragement” while at the same time promoting EMC / VMware / VCE products in lieu of traditional Oracle availability & replication products seems a little disingenuous…maybe….why not just keep your argument on the complaint of Oracle licensing with VMware?.  But instead, among many “do this instead of that” statements such as liminating Oracle RAC and use VMware HA and consider EMC RecoverPoint / SRDF in lieu of Oracle Active Data Guard (ADG).

Multiple agenda’s

Much of the HoB whitepaper feels like a marketing slick for EMC / VCE products. Then to have Chad be the front man out front crying on behalf of all customers seems a little too self-serving.

My Good Buddy Larry

Now back to Oracle….everybody knows I am NO Oracle fan.  A good day is any day I beat Oracle (anybody beats Oracle) or reduces their revenue.  But, with regard to Oracle’s practices on how they license their Enterprise products in a VMware environment, they have ZERO (0) motivation to loosen their licensing rules given Intel’s continued growth in the marketplace – Oracle is in the drivers seat!  Oracle wants customers to buy infrastructure from them running OracleVM with Oracle Linux hosting the Oracle software stack.  Oracle receives ALL of the Sales & Support dollars this way.  In addition to this, Oracle is predisposed to litigate.  Larry likes to fight!  HP and now HPE, SAP, Google (2 suits, going to a 3rd), Rimini Street, Oregon Healthcare, Mars and many more.  The Oracle v Mars case is a recent example of how Oracle goes after customers using their License Management Service (LMS) group to drive license revenue thru audits.  “Mars stated that Oracle was unwilling to “come to a mutually agreeable process” for completing an audit. Oracle then sent Mars a letter stating Mars had materially breached its license agreement”.  The greatest leverage clients have is to move off of Oracle products (hardware & software) to alternative solutions; specifically database variants such as IBM DB2, Microsoft SQL Server or Open Source alternative PostgreSQL from EnterpriseDB not to mention the many NoSQL alternatives that probably do a far better job.

Alternatives

If VCE really wanted to partner with an enterprise quality commercial-grade database technology to help clients run VMware with sub-capacity licensing for just the servers where the workloads are running and find an alternative to Oracle, they should look at IBM’s DB2 . DB2 is available in multiple editions from a free edition to Advanced Enterprise Server Edition.  What makes it different and better than both Oracle or SQL Server is that AESE, for example, includes many of the products & features that a client desires of Oracle Enterprise Edition products yet have to pay for À la carte.  DB2’s AESE cost $56,210 (list price for 70 PVU) per license which would match up against the Oracle Enterprise Edition portfolio which when you add up those products cost over $225K (Note: DB2 ESE is a level down from AESE, cost less and probably meets 90% of the customers requirements so the story just gets better).  DB2 always includes its first year of maintenance then 20% each year thereafter while Oracle always charges 22% for the first year then 22% each year thereafter.  Of course, DB2 runs 2X faster with Linux on POWER vs Intel. Clients can try it out for free in SoftLayer for 30 days running Linux on a OpenPOWER server.  Since LoP isn’t the topic of this blog, I’ll save that for another day but know that at least both Intel with VMware and IBM POWER servers support sub-capacity licensing with virtualization.

DB2-S822LC-vs-HPDL380

I didn’t write this blog to be a shill for IBM’s DB2 either, it just came to me as I was reading the HoB paper as it felt like they were trying to slyly present SQL Server as a more agreeable alternative to Oracle – maybe they are … either way thought I would mention DB2 for some balance.

There is ONE Platform …

At the end of the day, clients have a choice if they run Oracle products such as PeopleSoft, JD Edwards, Oracle Apps, Oracle E-Business Suite (EBS) or standalone Oracle Enterprise products like Database, RAC, WebLogic and many others.  Clients can run Oracle on Intel with VMware then surely deal with the risk and issues discussed by Chad and House of Bricks OR clients could run Oracle on the only platform which controls Oracle licensing without all of the consternation, debate and angst; IBM POWER servers running AIX.  For those who have read this far and were begging to say “But POWER servers have a core license factor 2X of  Intel so they cost twice as much”.  Enough please! I may hire House of Bricks to write a paper to put an end to this FUD, myth and farce. With POWER8 outperforming Intel servers generally around 2X per core it eliminates this argument right here. But, since we are talking about licensing a product at the core level it is important to remember that POWER servers support sub-capacity licensing natively, without debate from Oracle.  Last and most importantly, IBM’s Power Hypervisor suite, called PowerVM manages the compute resources more efficiently where it scales the 2X performance per core advantage typically increasing it up to 4X, 8X, 12X, even 20X (your mileage will vary).  This isn’t a performance advantage as much as it is an efficiency statement.  I call it the “Total Cost of Efficiency” as it takes into account the TCA, Performance advantage & Hypervisor efficiency and depending on the discussion, years 2-5 maintenance which is TCO.  I have personally sized, architected and delivered these solutions to customers who have in turn realized these very savings.

Now the Call-to-Action!

If you believe VMware & Intel are a critical part of your business identity that make your products better then continue using them with your Oracle products. You will pay more (compared to POWER) due to lower performance & less efficiency and pay the Oracle tax.  If you view IT as an enabler to your bottom line and use the right tool for the job then give me a call as I can help you as I have helped dozens of others save $100K’s to $M’s with IBM server technology.   Oh, and for those last few sharpshooters who want to remark that IBM servers are more expensive go ahead and save your comment.  First, I’ll shut you down by comparing a proper IBM server with the class of Intel server that you present me.  Next, we won’t go the 1 for 1 server route. As I recently showed a customer a reduction of 24 x Dell servers with 596 cores or 298 Oracle licenses to 7 x POWER8 servers with 168 cores and Oracle Licenses. My 7 servers are far less expensive than your 24 servers  not to mention the infrastructure required to support it (power cables, LAN/SAN cables, switch ports, cooling, etc). What makes me different is I show you how it’s possible to save  significant money running Oracle on IBM servers. What makes Ciber different is we have an Oracle consulting practice to help you implement, migrate or optimize your environment.

Intel; the Great Charade

Last weekend I was reading a few blogs on Intel’s recent Broadwell chip.  The current offering is the EP variation.  I regularly read & enjoy articles at The Register, The Four Hundred, AnandTech and The Next Platform.  Working with Enterprise platforms for most of my career, I sometimes get critical (maybe sensitive) of x86 technologies.  After reading a few I was motivated to put together a table comparing the features and performance from Nehalem (Gainestown release) through the latest Broadwell-EP E5-26xx v4 chips.  Essentially the 2 socket systems.

I was just enjoying some political tweets when I saw a tweet by @TheRegister promoting

TheRegister_article_04272016

an article on The Next Platform by Timothy Prickett Morgan titled “Intel does the math on Broadwell server upgrades“.  Tim writes “It all comes down to the math …”.  He is right except the premise behind this statement is there is *more* value going with Broadwell-EP (ie E5-26** v4) chips vs previous models.  I am not saying the author is saying this beyond the information as he understand it or was given by Intel.

This prompted me to stop my political tweeting since Trump & Cruz are in my home state today to write this blog.  There has been a “Great Charade” played by Intel duping customers for years.  Setting aside any comparison of Intel server chips to processors from IBM or Oracle, I will just focus on Intel vs Intel over the last 8 years.

For the table below, I pulled data from Intel’s documentation, Wikipedia, from other The Next Platform articles such as “Xeon bang for the buck, Nehalem to Broadwell” and WCCFtech.

Intel_Neh-Broadwell_Comparison

The row labeled with “1” shows the Relative Performance score used from this The Next Platform article.  My methodology is to divide the Rel Perf score by the # max number of cores available with that processor generation to obtain its per core score labeled by the row with a “2”.  The row labeled with “3” shows the % of increase in cores from the previous generation.  The row labeled with “4” is where it gets interesting.  This shows what the score would be if using the Rel Perf per core score for the original Nehalem processor 0f .29 by the number of cores available in the current generation.  So, Ivy Bridge is 0.29 x 12 to obtain 3.48.  The The data in row labeled with “1” comes from Intel even thought I obtained it from The Next Platform article.  Why is this important?  The actual score for the Ivy Bridge processor of 3.73 is greater than the extrapolated score of 3.48.  Having a higher score for row 1 over row 4 is better for Intel performance.  However, when you look at the Haswell & Broadwell-EP processors, the actual score is below the Nehalem extrapolated score indicating there is a decline in per core performance.

Now that you understand the methodology, lets look at the results for each tick-tock.  The Nehalem processor was released 8 years ago with 4 cores and a Relative Performance rating of 1.16 or .29 per core. The next release was the Westmere-EP processor with 6 cores having a chip score of 1.98 or .33 per core.  Westmere was followed by Sandy Bridge-EP with a Rel Perf score of 2.55 for its 8 cores or .32 per core. After Sandy Bridge was the long awaited and much hyped Ivy Bridge delivering 12 cores for the EP model. Its Rel Perf score of 3.73 translates to .31 per core.  Notice the trend? After IB-EP was Haswell-EP with 18 cores delivering a 5.20 Rel Perf score or .29 per core leading us to the latest and greatest Intel offering; Broadwell-EP with 22 cores & a 6.34 Rel Perf score or .29 per core.

What does this mean? With each Tick-Tock or successive release, Intel touts magnificent performance yet many of the improvements and performance benefits tout the total socket capacity vs its per core capabilities.  There is nothing wrong with socket totals if this is whats required or your software is priced in this fashion.  However,  many enterprise ISV’s charge license & maintenance fee’s based on cores.  Some are based on the total number of cores in the server such as Oracle and others simply base it on the number of cores required.  Either way, the stronger the core the better.

Yet the data shows that per core performance peaked with Westmere-EP followed by Sandy Bridge-EP.  At best you could argue performance has been flat over the last 8 years with little hope for the two anticipated successors in SkyLake and CannonLake.  I extrapolated the data using the same methodology showing performance will be flat to regressing.  This of course is consistent with the Intel Exec VP & GM William Holt who said “The best pure technology improvements we can make will bring improvements in power consumption but will reduce speed“.  Intel began moving away from Moore’s Law to an economic and financial model to remain on a tick-tock schedule rather than taking the approach to build an improved chip; not just one with more capacity.  Intel marketing is forced to hype the capacity increases in the latest chips as performance when it was mainly due to the addition of cores.  Yes, there were micro-architecture improvements but that benefits the internal plumbing to maintain coherence & data flow through increased cores. Core counts increased by 50%, then 66% and now it is slowing down with Broadwell-EP as they run into another problem which Intel’s Holt points to as well…..maintaining performance while maintaining TDP.

All of this as Intel moved from PCIe2 to PCIe3, DDR3 to DDR4 memory, more memory channels per controller and enhancing the QPI bus.  What is more telling with Intel  is that clock speeds have not just stalled but decreased with the top end chips because Intel is unable to deliver high core counts with high frequencies. All models still deliver 2 threads per core,  per core L3 cache has modestly increased with Nehalem at 2 MB to 2.5 MB.

The data speaks for itself.  Intel has perpetrated this Great Charade convincing customers they have receive increased performance with each successive processor release when in fact they are buying flat performance, subsidizing Intel while increasing software licensing / maintenance costs.

Oracle’s at it again – stuffing a card up their sleeve!

Oracle continues its record of having 0 (i.e. ZERO) credibility.  How many times has Oracle been called out for publishing & making statements about competitors solutions that were just flat out wrong leading one to wonder if it is more than the standard competitor FUD and benchmark exaggerations but purposefully meant to mislead.

Take a recent Oracle blog post by @Brian-Oracle at https://blogs.oracle.com/BestPerf/entry/20160317_sparc_t7_1_oltp.  Oracle is hell bent to produce a TPC-C benchmark on a POWER8 server since IBM has not.  I do not work for IBM and have not heard any official reason but have heard they do not see TPC-C as a good benchmark of platform performance which is why you do not see any entries since 2013 by any vendor.

There is a Oracle Marketing troll who posts as @PlinkerTind for this El Reg article http://forums.theregister.co.uk/forum/1/2016/04/07/open_power_summit_power9/?post_received=2834982#c_2834982 who asks what is IBM scared of.  What he fails to disclose is that IBM would love to run a Oracle benchmark on POWER8 but the software license agreements state if the product is being used for benchmark purposes it requires the approval of the owning company.  Well, Oracle doesn’t permit it except for Oracle benchmarks such as Oracle EBS.  Why would Oracle not want to let IBM conduct a benchmark using Oracle DB? Because it would show what customers see who run it and what prospects wee who evaluate it that POWER controls software licensing; Oracle, DB2, EnterpriseDB, and every other product.  Fewer servers, fewer sockets, fewer cores, etc.
Back to the Oracle blog, author writes “On a per chip basis, the SPARC T7-1 server demonstrated nearly 5.5 times better performance compared to an IBM Power System S824 server”.  Let’s break it down using the authors “vernacular”.  A T7-1, although it is actually 8 x quad core chiplets and not a single 32 core chip as Oracle likes to claim,  yet this author along with the other Oracle marketing trolls refer to a 2 socket S824 & S822 as a 4 chip system – essentially for the Scale-Out servers, they call each socket two chips because of how IBM builds the chip using a Dual Chip Module (DCM) vs the Single Chip Module (SCM) in the Enterprise servers.  Now,  IBM who engineered the chips says it works functionally as 1 chip in 1 socket.   (Ref: IBM POWER8 S824 Redbook http://www.redbooks.ibm.com/redpapers/pdfs/redp5097.pdf).

So, the Oracle blog authors says the SPARC  32 core chip, by HIS words is 5.5X better performance than the POWER8 S824 server. Thus, 6 cores vs 32 cores or 5.33X.  6 cores because Oracle is comparing Chip to Chip.  Thus, they consider it reasonable and credible to compare their 32 core SPARC T7 chip vs a 6 core POWER8 chip.  That is so disingenuous.
Next, the author chose the 24 core S824 running at 3.52 GHz vs the 4.13 GHz T7-1. Why didn’t Oracle pick the 16 core S824 running at 4.15 GHz?  That way they could call it a 4 chip, 2 socket with 4 cores per chip system vs Oracle’s 32 core, single socket, single chip (that is really 8 x quad chiplets) system. Using the 16 core S824 would give the SPARC T7-1 server 2X more cores allowing for easier extrapolation?  Oracle did set the ASMI mode to favor performance but it is the 2nd best option to use (there is a better option but since Oracle doesn’t know the platform they either chose not to or didn’t know what to select). Even with this set, their is still a clock frequency discrepancy.  Now, I’m not hung up on the clock frequency because most Intel servers have lower clock frequency.  Unlike Intel which cannot run all of their cores at the higher clock frequency like POWER there are times when the clock frequency comparison being made is vastly different can’t be helped.  That’s not the case though with POWER.  IBM offers servers ranging from 3.0 to 4.35 Ghz making it easy for them to choose one that makes the comparisons as close as possible.

Next the author says “On a per core basis the SPARC T7-1 server demonstrated nearly 3% better performance per core compared to an IBM Power System S824 server.”. I’ll just refer to the above paragraph where the T7 clock frequency was running 17% higher.  Then he says “At the system level, the SPARC T7-1 server demonstrated nearly 1.4 times better performance compared to the IBM Power System S824 server.”. 24 cores times 1.4 = 33.6……hmmm. You tested a 1 socket vs a 2 socket server running with a 17% higher clock.  I point out the 1 vs 2 socket server because with POWER  processors I would expect a single socket to be slightly better performing than a 2 socket than a 4 socket than a > 4 socket server.  Taking into account other factors that do not require the memory & I/O provided by those extra sockets.  Again, if they want to test on a per core basis that can be done with any system. If they wanted to test on a per socket basis then do it with the T7-1 and even use the S824 but only use 12 cores.  Otherwise, compare a T7-2 vs a S824 which would be 64 vs 24 cores.

Read the comments for an article about POWER9 at AnandTech http://www.anandtech.com/show/10230/ibm-nvidia-and-wistron-develop-new-openpower-hpc-server-with-power8-cpus-nvlink.   Look for comments by @Brutalizer who is an Oracle Marketing troll.  He gets crushed by the the commenters as they rightly point out that Oracle ran the Oracle benchmark on the POWER8 server with no disclosure on the full details of how the server was configured; How many DIMMs were used?  Filling all 16 DIMM slots makes a difference than just 8 or 4 DIMMs since all 3 configuration options can achieve the configured 512 GB Ram. Although a few tunables were disclosed as if to demonstrate that Oracle made an effort to give the POWER server a fair shake, I question the ones used. I won’t disclose what I would have done differently as I like that Oracle looks like petty fools in their effort to show Oracle on POWER8 performance. If they were interested they would authorize IBM to run their own benchmark – or accept my POWER challenge where we run a customers workload on each of our servers.  Alas the cowards have yet to acknowledge it let alone accept it.

The lesson I hope readers learn from my blog is not that Oracle software and hardware products are bad, they are not at all.  However, they have this seemingly uncontrollable need to overstate and mislead customers by any effort to get customers to consider and god forbid buy their products.  Yes, in this poker game we all play there is bluffing and smart play but with most vendors it is within the rules.  With Oracle they always seem to have a card stuffed in their sleeve as they seem incapable of competing fairly making them resort to unscrupulous behavior such as the Oracle blog.

P.S.  I do not accept any of the results obtained by Oracle testing the S824.  If IBM conducted the same test running Oracle with AIX on POWER8 I am confident the results would make the S824 look far better than stated by Oracle.  One can only draw the conclusion that Oracle optimized their T7-1 to achieve the most favorable results.  IBM should have the same opportunity.  Of course, my Power challenge to Oracle is the real-world test using actual customer data.  Maybe this blog will get Oracle to man-up!

Power investments continue to pay off!

This article is to highlight the announcements IBM is making in their Fourth Quarter 2015 related to the Power platform.  This is one of the largest announcements in years that I can recall touching Linux, IBM i, AIX, virtualization, management, ISV’s and the platform itself.  Since I am not an IBMer with access to schedules there may be a few things that differ.

First,  you will want to register.  This is a virtual event which is convenient as you can not only register at anytime but also watch it at anytime online.  Register at https://engage.vevent.com/index.jsp?eid=556&seid=80414&code=Social_Tiles.

As a Business Partner I am glad to see that IBM is delivering on what they told us over the past several years.  They are taking their $B investment delivering useful and leading technologies  with Linux on Power as is needed but also with AIX and IBM i.  These latter two Power pillars are far more mature and do not require the technology enhancements nor the ISV adoption like Linux on Power (LoP) requires.  Stands to reason there will be more activity around the LoP space, not because that is the future and the others will diminish but  for what I mentioned, it is less mature relative to the enterprise AIX & IBM i markets.

This is the extensive list of features being announced this quarter.  I will add a reference section after the announcement(s) to allow you to get more information on each of the features.

AIX

  • AIX 7.2 – some really good features!
    • “Live Update” or apply AIX updates concurrently without requiring a reboot
    • RDSv3 over RoCE optimizes Oracle RAC performance using Oracle RDSv3 protocol with Mellanox Connect RoCE adapters (up to 40 Gb)
    • Workload optimization with Flash
    • Dynamic System Optimizer
    • BigFix Lifecycle for automated and simplified patching
  • New AIX Enterprise Edition packaging
  • AIX 6.1 Withdrawal from Marketing April 2016

IBM i

  • New IBM i v7.1 TR11
  • New IBM i v7.2 TR3
  • S822 expanded capabilities – supports IBM i
    • Requires VIOS for I/O

Virtualization – PowerVM

  • New VIOS release – v2.2.4 based on AIX 7
  • NovaLink architecture provides scalability features for OpenStack deployments
  • New SRIOV capabilities
  • Introducing vNIC Adapter – increases performance with SRIOV
  • Shared Storage Pool enhancements

HMC

  • New HMC model – CR9
  • New HMC version – 8.8.4
  • New virtual HMC offering – Run 8.8.4 in a VMware or RHEV VM (x86)

Power platform

  • New Power8 firmware release – 840 or 8.4 level
  • New PCIe adapters
  • PurePower enhancements
    •  IBM i support
    • vHMC support
    • PurePower Integrated Manager improvements
    • Order  both S822 & S822L with initial order

Management

  • New PowerVC 1.3 version – more management & OpenStack integration features
    • Advanced policy-based management
    • Supports MSPP
    • Expanded vSCSI & NPIV support for certain storage models
  • Manage Power servers using PowerVC with OpenStack with VMware’s vRealize

Security

  • PowerSC NERC Profile compliments existing PCI, DOD STIG, HIPPA, SOX-COBIT

High Availability

  • New PowerHA 7.2 version
    • Integrates Power Enterprise Pools as part of a PowerHA failover operation
    • Improved integration with LPM
  • Non-disruptive upgrade
  • Integrates with new AIX Live Update feature
  • New wizard to use GLVM for low cost mirroring option
  • Enhanced EMC SRDF support
  • Supported on AIX 6.1 TL9 and later
  • Supports Power6 and new servers

Linux

  • New Power Linux server models  – true price parity with Intel servers. Built on OpenPOWER
    • S822LC – up to 20 cores, 1 TB, 2 SFF HDD & 5 PCIe slot 2U server
    • S812LC – up to 10 cores, 1 TB, 14 LFF HDD & 4 PCIe slot 2U server
  • PowerKVM features
    • Dynamically add/remove cpu & memory resources from VM’s
    • Live Migration
  • IFL enhancements – IFL’s run IBM software in a Linux VM on 4 socket & larger Power servers with a 70 PVU vs 100 or 120

Performance

  • New CAPI offerings
  • New SSD offerings – Gen 4 drives, higher performance & capacities
  • 36 port EDR 100 Gb/s Infiniband Switch delivering latency as low as 130 ns

ISV & Software

  • New Linux ISV partnerships – More & more ISV’s are coming to IBM asking to be a part of the Power market revolution taking place
  • SAP HANA announcements
  • New BigInsights 4.1 features with Hadoop & Spark
  • PureApp now available with Power8 servers (announced July 2015)

Cloud

  • SoftLayer announcements – Linux on Power bare metal offerings
  • Power Enterprise Pool enhancements

The above list is fairly complete although lacking a lot of detail which is available online in the announcement letters or better yet contact your IBM Power Sales Specialist or Business Partner.  If your Business Partner is not proactively offering to keep you updated on these types of announcements you may want to reevaluate what value your Value Added Re-seller is providing and look for another.  Don’t settle for an order taker but a technology enabler.

IBM continues to deliver innovation, value, solutions and options to the “Good Enough” alternative with Intel where it has become obvious over their last 2 chip releases they are taking  customers for granted.  Hear how the performance of the Power8 processor gives  the equivalent of a 35 PVU vs Intel’s 70 PVU (this is a PowerMan example and not IBM itself). With IBM Software this is an immediate 50% reduction in licensing and maintenance costs.  Factor in the hypervisor efficiency and that should increase significantly.

Who doesn’t want more performance, more reliability for the same price as the competition? You can have it  your way with IBM Power8 & OpenPOWER.

 

Are you keeping score?

However, unlike Oracle and Intel, who in their own ways are both becoming very rigid and proprietary. IBM is going in the opposite direction by actually opening up their own technology as well as embracing the open source community.

Are you keeping up with all of the changes taking place with IBM’s Power portfolio over the last 2 years?  How could you as there have been so many changes to just about every area on the platform.  Just in case, let’s review some of the features and changes.  Now, since I don’t work at IBM, don’t have a magic 8 ball or a direct line to Doug Balog, IBM’s Power General Manager there is a chance I will get something wrong and I’m sure I’ll leave a few things out.  What I will do though, is not talk about any upcoming products – which between you and me – They will be freakin awesome!

  • Power8 processors support both Big Endian and Little Endian
  • Little Endian offerings are with Linux: RedHat 7.1, SUSE 12 & Ubuntu 14 & 15
  • Big Endian offerings are AIX, IBM i as well as Linux (SUSE 11 & RedHat 6.5 & 7.1)
  • PowerVM supports both BE and LE Operating Systems (concurrently)
  • IBM introduced an alternative hypervisor to PowerVM; an open source alternative based on KVM
  • PowerKVM on Scale-Out models support both BE and LE Linux
  • Still supports the use of VIOS for PowerVM environments with HMC enhancements
  • Uses Kimchi to manage PowerKVM environments
  • SR-IOV adapters (finally) available
  • Nice set of quad port Ethernet (2×10+2x1Gb) adapters available for AIX & Linux workloads
  • 40 Gb RoCE adapters
  • PowerVC based on OpenStack replaces Systems Director (portions of it)
  • PowerVC integrates with VMware’s vRealize allowing it to manage & provision Power Systems
  • Gen 2 of a Converged solution called PurePower using S822(L) servers + V7000 + Mellanox switches
  • 2X performance increase over Power7. Greatest increase that I recall from 1 gen to the next
  • 3X greater memory bandwidth for Scale-Out over Power7/7+ Entry servers
  • Just under 2X greater memory bandwidth for Power8 Enterprise vs Power7/7+ Enterprise servers
  • 3X greater I/O bandwidth for all Power8 servers over all Power7/7+ servers
  • 2X more L2 cache
  • Addition of L4 cache
  • SMT8 that is dynamic per VM unlike Intel with their static 2 way Hyperthreading
  • Up to 1 TB Ram per socket
  • Centaur buffer supports DDR3 & DDR4 memory
  • Scale-Out servers (S & L models) use Enterprise Memory just like the Enterprise servers
  • Significantly more Fault Isolation Registers & Checkers
  • Significant reliability enhancements to processor, cache, memory & I/O subsystems
  • Gen3 I/O Drawers with more PCIe slots per drawer
  • Gen3 PCIe slots (x8 & x16) in all servers
  • Available split backplane in ALL Scale-out servers – YEAH!
  • Significant increase in the number of internal Scale-Out server disk slots: 8 – 18 slots
  • 6 or 8 x 1.8″ disk slots (model dependent) using IBM’s Award Winning (Is it?) Easy Tier
  • Hot swap PCIe slots in ALL models
  • Scale-Out internal disk RAID cache increased from 175 MB in Power7 to 7 GB
  • PowerVM, PowerKVM and RHEV hypervisor options
  • Bare-Metal server option (certain models)
  • Enhanced Voltage Regulator Modules (VRM)
  • Enhanced EnergyScale
  • Some models lowered their software tier from Large to Medium and Medium to Small
  • Some servers received improved warranties
  • Power Enterprise Pools with Mobile Cores & Mobile Memory
  • Integrated Facility for Linux (IFL’s) on E850, E870 & E880
  • L model servers that use PowerVM is called PowerVM Linux Edition equivalent to Enterprise Edition
  • Supported with SAP HANA
  • All non-L models supported with EPIC
  • All Enterprise servers supported with EPIC
  • Delivering roughly 2X performance increase over Intel Haswell EP/EX processors
  • Power8 has 3X greater memory bandwidth than Intel Haswell EP memory in Lock-Step mode
  • Power8 has 2.5(ish) greater memory bandwidth than Intel Haswell EX memory in Lock-Step mode
  • Power8 has 2X greater L1 cache
  • Power8 has 2X greater L2 cache
  • Power8 has 2.5X greater L3 cache
  • Intel has no L4 cache
  • Intel has significantly fewer Checkers
  • Intel does not use anything similar to Fault Isolation Registers
  • Intel 1 & 2 socket servers do not support Machine Check Architecture (MCA)
  • Only Intel 4 socket & greater servers support Machine Check Architecture (MCA)
  • Intel rates their memory capabilities in performance mode which has limited RAS capability
  • For Intel to increase memory resiliency, must use lock-step mode which decreases performance
  • For Intel to increase memory resiliency, can use memory mirroring reducing memory capacity by 1/2
  • All Power8 servers have at least 1 Coherent Accelerator Processor Interface (CAPI)
  • Several CAPI solutions are available today with more coming (very soon – that’s the only hint)
  • Using IBM’s Advanced ToolChain delivers an enhanced SDK & GCC delivering increased results
  • IBM optimized software stack for Power8 delivering greater results per core
  • Optimized ISV stack for Power8 delivering greater results per core
  • ISV’s like MariaDB, WebFocus Express, Redis Labs, ColdFusion, EnterpriseDB, Magento
  • ISV’s like McObject, Veristorm, HelpSystems, SugarCRM, Zato, OpenPro and many more
  • Guaranteed utilization levels from 65 – 80% depending on the model
  • Backbone of Watson
  • Supports Apache Hadoop & Apache Spark
  • Supports BigInsights
  • Supports internal disks for Big Data
  • Supports superior BD storage option using Elastic Storage Server: S822L & GPFS Native Raid
  • Run DB2 10.5 with BLU Acceleration 2X faster than Intel
  • WebSphere & Cognos faster
  • SPSS faster and many more
  • 150+ members in OpenPower Foundation
  • Blue badged Power based solutions
  • Non-blue-badged Power based solutions available
  • PowerVP is awesome
  • Virtual HMC is coming (was a IBM SOD)

I tried to create an exhaustive list to make the point that IBM is investing, engineering and building game changing technology to help customers solve real solutions. However, unlike Oracle and Intel, who in their own ways  are both becoming very rigid and proprietary.  IBM is going in the opposite direction by actually opening up their own technology as well as embracing the open source community.  The list of ISV’s, the use of OpenStack, the use of a open source hypervisor and the most recent announcement whereby VMware’s vRealize will be able to provision & manage Power Systems is a testimony to the change taking place within IBM and with customers.  Expect more of the above and more beyond this. Expect it to blow your mind.

I may come back over time and enhance this list if I get froggy.  Of course, if you are a customer you could always invite me to speak with you and I would be happy to discuss any and all of this.

What would you like to see in addition to the extensive list above?