Case Study: Scaling Reservations for the World’s Largest Train System, China Railways Corporation

November 7, 2013 Stacey Schneider

The biggest annual movement of humans on the planet happens around the Chinese New Year, also known as the Spring Festival. According to China Daily, there are 34.88 million trips by air, 235 million trips by rail, and 2.85 billion road trips during the peak of China’s Spring Festival holiday period. Historically, rail travel has meant long lines and waits, and China Railways Corporation (CRC) began to sell tickets online to offer a more convenient method than purchases at stations, ticket offices, or by phone.

With increased use and access to the reservation system, the travel season placed enough pressure on China Railway Corporation’s rail reservation systems to break their legacy RDBMS, warranting a new project to improve their online performance and scalability. These holiday travel periods, like Spring Festival, pressure the system even further. During spikes, the site becomes one of the most popular websites in China. With such severe demand, people experience outages, poor performance, booking errors, payment failures, and issues with ticket confirmations.

One of the first steps taken by Dr. Zhu Jiansheng, Vice Director of the China Academy of Railway Sciences, was to look for areas to boost performance. Back in 2011, Dr. Zhu began sponsorship of a new system based on the two known performance bottlenecks at the time:

Overloading the relational database until it could not handle either the scale of incoming requests or the level of reliability required to meet their SLAs.
The UNIX server’s computational power was inadequate to resolve the capacity requirements.

According to Dr. Zhu, “Traditional RDBMS and mainframe computing models just do not scale like a system built to run in memory across multiple nodes. Our website was proof of this, and trying to scale our legacy system was going to become very expensive.”

Solving Scale and Availability Problems with In-Memory Data Grids

Dr. Zhu’s team began looking at other solutions. Mainframes were found to have the same bottleneck issues as RDBMS. In exploring in-memory data grids (IMDG), they found Pivotal GemFire, an IMDG with a proven track record for scaling the most challenging data problems in the world across financial services, airlines, e-commerce, and other industries. To perform an evaluation, Dr. Zhu and his team selected International Integrated Systems, Inc. (IISI). The team from IISI had a strong track-record of work for government organizations, developing transportation solutions, migrating legacy systems to cloud architectures, and work with Pivotal GemFire. They began with a pilot, believing GemFire would meet the performance, scale, and availability requirements as well as run on low-cost, commodity hardware.

The IISI team created a proof of concept and demonstrated several advantages with GemFire. The speed of ticket calculation improved 50 to 100 times. As load increased, response times maintained latencies of 10-100 milliseconds. They could see the ability to add capacity on demand and achieve near-linear scalability and high availability. The project team built a pilot in just two months, and, four months later, the new online system was fully deployed to all classes of passengers across 5700 train stations.

Scaling the Reservation System

The group in charge of online railways reservations saw massive and unexpected adoption of the online system year over year and project growth as much as 50% per year. With 5700 train stations, their website has booked 2.5 million tickets per day on average.

In the process, the infrastructure changed drastically. Seventy two UNIX systems and a relational database were replaced with 10 primary and 10 back up x86 servers, a much more cost-effective model that holds 2 terabytes or one month of ticket data in memory.

According to Dr. Zhu, “First, Pivotal GemFire offered proof in a realistic test environment. Then, the pilot was a success. Production saw severe, unexpected spikes in seasonal demand, and we took an iterative approach to deployment, overcoming a series of scale challenges. As seen in the most recent National Holiday for 2013, the system is operating with solid performance and uptime. Now, we have a reliable, economically sound production system that supports record volumes and has room to grow. This scale was achieved with 10-100 millisecond latency.” GemFire’s built in high availability, redundancy, and failover mechanisms provide continuous uptime, and it has exceeded all of CRC’s metrics in the area and helped them maintain their SLAs.

Holiday travel periods create peaks of 10 million tickets sold per day, 20 million passengers to visit the web site, 1.4 billion page views per day, and 40,000 visits per second driving up to 5 million tickets sold online per day at peak. Given that so many people rely on the system for travel, it is critical for the site to be continuously available and scale during peak times.

To learn more about this story and Pivotal GemFire:

The Big Data Story (and Webinar) Behind Chinese New Year
Add Years to your RDBMS by Scaling with Spring and NoSQL
Run Multiple Big Data Apps on a Single Data Store
Migrate Spring Apps to Real Time Data Grids
Run Spring Apps 60x Faster with Pivotal GemFire
Pivotal GemFire Product info, documentation, and trial download
Planning for NewSQL: An Architecture How-To with GemFire’s sister
product, SQLFire

About the Author

Biography

How to Mix in Mobile for Restaurant Success

Over the last couple years, we’ve seen a ton of mobile innovation and disruption across a variety of sector...

WebSocket Architecture in Spring Framework 4.0

In this post, Spring expert and committer, Rossen Soyanchev, underscores how WebSocket provides a foundatio...

Case Study: Scaling Reservations for the World’s Largest Train System, China Railways Corporation

Solving Scale and Availability Problems with In-Memory Data Grids

Scaling the Reservation System

About the Author

Previous

Next

Case Study: Scaling Reservations for the World’s Largest Train System, China Railways Corporation

Solving Scale and Availability Problems with In-Memory Data Grids

Scaling the Reservation System

About the Author

Previous

Next

Related content in this Stream

Bitnami-packaged open source software is loved by developers for its ease of use, which enables developers to directly pull a Bitnami package and seamlessly start using it with little effort.

VMware Tanzu announces the General Availability of AWS Commitment Discount Recommendations, which provides recommendations for all reservable services in AWS through VMware Tanzu CloudHealth.

Introducing VMWare Tanzu Data Hub, a self-managed Database as a Service (DBaaS) Platform, providing enterprises a way to host their internal DBaaS offering for internal business users.

In the cloud-native landscape, MCAs drive seamless compliance integration. Their expertise ensures proactive security measures align with regulatory standards for sustained innovation & collaboration.

Tanzu Application Platform brings innovation faster with more frequent feature updates. With 1.9, take advantage of enhanced DORA metrics visibility and improved compliance options for companies.

We’re excited to share some great news! Spring Academy Pro content is now free. It will be available to everyone who registers a work, vocational, or educational email address.

March 28, 2024, marks the official minor release date of Spring Cloud Gateway for K8s version 2.2, and it's set to optimize how developers protect access to their GraphQL services.

We are excited to announce that VMware Tanzu Application Service 6.0 is now generally available!

Get a clear picture of your OSS supply chain, and the risks you face from your open source software dependencies, using the all-new Tanzu OSS Health Assessment.

Trivy can now utilize CSAF VEX data to filter out false positives in CVE reports, maximizing the value of VEX documents in VMware Tanzu Application Catalog.

Bitnami-packaged open source software container images available in DockerHub are now signed by Notation, an implementation of the Notary Project specifications and a CNCF-incubating project.

There’s never been a better time to be a Java and Spring developer! Let me show you why with a sneak peak into JD Conference 2024.

If you're into FinOps, you've probably heard of FOCUS. Introducing our FOCUS FlexReports template for AWS, Azure, and GCP. Turn your cloud bills into FOCUS-compliant reports in minutes!

The latest Spring Boot simplifies infrastructure setup with Docker Compose. Now, supporting Bitnami images, it opens new possibilities for developers. Exciting times ahead!

Shape the future of Spring! Participate in the State of Spring Survey 2024. Share insights, collaborate with the community, and drive innovation.

Extend Apache Tomcat support with Tanzu Spring Runtime. Seamless transition, enhanced security, and uninterrupted workflow for Java applications.

Welcome to another edition of What’s new with Tanzu Application Catalog. This is a quarterly round up of all things related to Tanzu Application Catalog.