Data Science

Identifying patterns in big data to predict and influence business outcomes gives businesses the foresight to be better at what they do.

  • The Eight-Fold Path of Data Science

    The Eight-Fold Path of Data Science

    Read More
  • Data Science topic page

    Go to Data Science
  • Continuous Integration for Data Science

    Continuous Integration for Data Science

    This is a follow up post on Test-Driven Development for Data Science and API First for Data Science focusing on Continuous Integration. Motivation Last time we wrote about the importance of...

    Read More
  • Data Science in Practice

    Data Science in Practice

    Read More
  • Data Science How-To: Using Apache Spark for Sports Analytics

    Data Science How-To: Using Apache Spark for Sports Analytics

    In this post, Chris Rawles gives a hands-on tutorial for getting started with the recently released Spark 2.1 using data from the National Basketball Association (NBA).

    Read More
  • Subscribe to Pivotal Insights Podcast

    Subscribe
  • Using Data Science to Detect Healthcare Fraud, Waste, and Abuse

    Using Data Science to Detect Healthcare Fraud, Waste, and Abuse

    Health care spending reached $3.2 trillion in the U.S. in 2015, accounting for over 17% of GDP. Nobody is sure how much of that spending was wasted on fraudulent insurance claims and unnecessary...

    Read More
  • How to Use Temporal Features in an MPP Database

    How to Use Temporal Features in an MPP Database

    Learn how to create temporal features in an MPP database and how to set up the data for cross-validation and model building.

    Read More
  • Transforming Your Company into a Data Science-Driven Enterprise

    Transforming Your Company into a Data Science-Driven Enterprise

    Read More
  • Data Science Revealed: A Data-Driven Glimpse into the Burgeoning New Field

    Data Science Revealed: A Data-Driven Glimpse into the Burgeoning New Field

    Read More
  • New Tools To Shape Data In Apache MADlib

    New Tools To Shape Data In Apache MADlib

    Apache MADlib includes new utilities that make it easier for data scientists to shape and transform data, and to evaluate the accuracy of predictive models. It is well known that data scientists...

    Read More
  • Using Data Science for Cybersecurity

    Using Data Science for Cybersecurity

    Read More
  • Designing for Highly Technical Products — Tim McCoy & Mike Long38:23

    Designing for Highly Technical Products — Tim McCoy & Mike Long

    Where does UX/Product design fit in a world of command line tools and machine learning algorithms? Hear Tim McCoy, Pivotal's Senior Director of UX, discuss how we go beyond the GUI in Pivotal BDS and

    Watch Video
  • Operationalizing PySpark Data Science Models on Pivotal Cloud Foundry

    Operationalizing PySpark Data Science Models on Pivotal Cloud Foundry

    In this post, Pivotal's Andreas Fleig and Dat Tran illustrate how to deploy and operationalize an Apache Spark-trained machine learning model in an application running on Pivotal Cloud Foundry.

    Read More
  • Data Science for Connected Vehicles

    Data Science for Connected Vehicles

    Hear about data science techniques used by the data science team at Pivotal Software to create predictive maintenance applications for connected vehicles

    View Presentation
  • Support Vector Machines in Apache MADlib

    Support Vector Machines in Apache MADlib

    The new release of Apache MADlib 1.9 (incubating) includes support vector machines, which can be used for classification and regression tasks. SVM models have two particularly desirable features:...

    Read More
  • Data Science How-To: Text Analytics-as-a-Service

    Data Science How-To: Text Analytics-as-a-Service

    Pivotal Data Scientist Chris Rawles shares a useful example of how to operationalize a text analytics model and deploy it as a scalable, autonomous microservice on Cloud Foundry. This example...

    Read More
  • The Apache MADlib Community4:47

    The Apache MADlib Community

    Learn how to become part of the Apache MADlib community.

    Watch Video
  • The Origin of Apache MADlib (incubating)5:10

    The Origin of Apache MADlib (incubating)

    Hear how the MADlib SQL Machine learning open source project originally started by Joe Hellerstein, the original founder.

    Watch Video
  • Deep Dive into Apache MADlib 1.9.136:33

    Deep Dive into Apache MADlib 1.9.1

    Watch some demos of the new features of Apache MADlib (incubating) 1.9.1 including support vector machines, predictive metrics, sessionization, pivoting, and improvements to path functions What's New

    Watch Video
  • Path Functions in Apache MADlib

    Path Functions in Apache MADlib

    The new release of Apache MADlib (incubating) 1.9 includes several new features, including path functions, which can be used to perform regular pattern matching over a sequence of rows, and then...

    Read More
  • Data Science: Driven Software Product Innovation — Radhakrishnan, Ramanujam51:44

    Data Science: Driven Software Product Innovation — Radhakrishnan, Ramanujam

    In this webinar, learn how Pivotal's Regunathan Radhakrishnan and Srivatsan Ramanujam helped a large financial services company evolve one of its customer-facing applications to meet the changing beh

    Watch Video
  • loading
    Loading More...