Accurately Building Genomic Cohorts at Scale with Delta Lake and Spark SQL
This is the second post in our “Genomic Analysis at Scale” series. In our first post, we explored a simple problem: how to provide real-time aggregates when sequencing large volumes of genomes. We...
View ArticleMonitor Medical Device Data with Machine Learning using Delta Lake, Keras and...
On August 20th, our team hosted a live webinar—Automated Monitoring of Medical Device Data with Data Science—with Frank Austin Nothaft, PhD, Technical Director of Healthcare and Life Sciences, and...
View ArticleEngineering population scale Genome-Wide Association Studies with Apache...
Try this notebook series in Databricks The advent of genome-wide association studies (GWAS) in the late 2000s enabled scientists to begin to understand the causes of complex diseases such as diabetes...
View ArticleParallelizing SAIGE Across Hundreds of Cores
As population genetics datasets grow exponentially, it is becoming impractical to work with genetic data without leveraging Apache Spark. There are many ways to use Spark to derive novel insights into...
View ArticleIntroducing Glow: An Open-Source Toolkit for Large-Scale Genomic Analysis
The key to solving some of today’s most challenging medical problems lies in the analysis of genomics data. Understanding the impact of the minor changes in an individual’s genome on their overall...
View ArticleAutomating Digital Pathology Image Analysis with Machine Learning on Databricks
Check out our solution accelerator notebooks for automating digital pathology analysis or watch our on-demand webinar to learn more. With technological advancements in imaging and the availability of...
View ArticleBuilding a Modern Clinical Health Data Lake with Delta Lake
The healthcare industry is one of the biggest producers of data. In fact, the average healthcare organization is sitting on nearly 9 petabytes of medical data. The rise of electronic health records...
View ArticleIntroducing GlowGR: An industrial-scale, ultra-fast and sensitive method for...
Today, we announce that we are making a new whole genome regression method available to the open source bioinformatics community as part of Project Glow. Large cohorts of individuals with paired...
View ArticleDetecting At-risk Patients with Real World Data
With the rise of low cost genome sequencing and AI-enabled medical imaging, there has been substantial interest in precision medicine. In precision medicine, we aim to use data and AI to come up with...
View ArticleBurning Through Electronic Health Records in Real Time With Smolder
In previous blogs, we looked at two separate workflows for working with patient data coming out of an electronic health record (EHR). In those workflows, we focused on a historical batch extract of EHR...
View Article
More Pages to Explore .....