PivotalR Improves the Scalability and Performance of In-Database Analytics

June 18, 2013 Paul M. Davis

image by George Montana Harkin, Noun Project

One of the greatest challenges while working with big datasets concerns the need to move information out of storage for analysis. This process can increase the chance of error and often forces practitioners to work with partial or incomplete samplings of the data. One of the key features of the Pivotal HD platform and HAWQ is the ability to directly work with data within a Hadoop cluster, no movement necessary. To this end, the recent announcement of PivotalR 0.1 extends the platform’s capabilities, allowing users of the statistical programming language R to perform in-database analytics without leaving the command line.

PivotalR improves the scalability and performance of in-database analytics by letting users explore and manipulate information in the database using the R interface. PivotalR handles the necessary SQL translation, and computation is done within the database. The result is faster queries and modeling, without requiring the user to move data or work with only a portion of all the available information.

Practitioners familiar with R syntax will be able to perform predictive analytics and interact with MADlib analytics function calls using the language that they are already familiar with. On the roadmap for PivotalR includes support for R visualizations using intelligent sampling, Chorus integration, and support for all existing MADlib algorithms.

The PivotalR 0.1 package is available for download on Github, along with documentation, example code, and the quick start guide. You can also learn more about PivotalR from this video walkthrough by Hai Qian of Pivotal’s Predictive Analytics Team and Woo Jae Jung of the Data Science Team.

About the Author

Biography

Pingpongapalooza

Helps Best non-paperclip solution for S3? There was a question about uploading to S3 on an iOS project back...

Bulletproof font-face syntax with SASS and Rails

The Bulletproof font-face syntax effectively gives you broad browser support for your fancy (or austere) we...

PivotalR Improves the Scalability and Performance of In-Database Analytics

About the Author

Previous

Next

PivotalR Improves the Scalability and Performance of In-Database Analytics

About the Author

Previous

Next

Related content in this Stream

Following the xz supply chain attack blog, explore security and trust in open source with VMware Tanzu's secure container solutions and proactive measures.

VMware Tanzu empowers Netflix accelerates its service evolution and boosts the capabilities of its development teams. Tanzu helps to provide them with the platform to run on and scale.

Unveil regulatory compliance ease with VMware Tanzu Spring Runtime! Elevate audits, adhere to FIPS & NIST standards, benefit IT, DevOps, and Auditors.

Uncover open source risks and the 'Zero CVE' myth with insights on continuous lifecycle management. Discover how VMware Tanzu supports diverse projects effectively.

This 7-part blog series provides a roadmap for architecting a data science platform using VMware Tanzu. We'll delve into the building blocks of a successful platform that drives data-driven insights.

This 7-part blog series provides a roadmap for architecting a data science platform using VMware Tanzu. We'll delve into the building blocks of a successful platform that drives data-driven insights.

This 7-part blog series provides a roadmap for architecting a data science platform using VMware Tanzu. We'll delve into the building blocks of a successful platform that drives data-driven insights.

This blog provides a summary of VMware Tanzu CloudHealth news and product updates for the month of April, 2024

This 7-part blog series provides a roadmap for architecting a data science platform using VMware Tanzu. We'll delve into the building blocks of a successful platform that drives data-driven insights.

How VMware Tanzu CloudHealth helps customers uncover spiraling AWS Extended Support charges.

VMware Tanzu enhances Spring development with simplified operations, accelerated innovation, seamless microservices transition, increased security, and effortless scaling.

This 7-part blog series provides a roadmap for architecting a data science platform using VMware Tanzu. We'll delve into the building blocks of a successful platform that drives data-driven insights.

This 7-part blog series provides a roadmap for architecting a data science platform using VMware Tanzu. We'll delve into the building blocks of a successful platform that drives data-driven insights.

This 7-part blog series provides a roadmap for architecting a data science platform using VMware Tanzu. We'll delve into the building blocks of a successful platform that drives data-driven insights.

Bitnami-packaged open source software is loved by developers for its ease of use, which enables developers to directly pull a Bitnami package and seamlessly start using it with little effort.

VMware Tanzu announces the General Availability of AWS Commitment Discount Recommendations, which provides recommendations for all reservable services in AWS through VMware Tanzu CloudHealth.

Introducing VMWare Tanzu Data Hub, a self-managed Database as a Service (DBaaS) Platform, providing enterprises a way to host their internal DBaaS offering for internal business users.

In the cloud-native landscape, MCAs drive seamless compliance integration. Their expertise ensures proactive security measures align with regulatory standards for sustained innovation & collaboration.