Quantcast
Channel: Frank Austin Nothaft – Databricks
Browsing latest articles
Browse All 23 View Live

Image may be NSFW.
Clik here to view.

Accurately Building Genomic Cohorts at Scale with Delta Lake and Spark SQL

This is the second post in our “Genomic Analysis at Scale”  series.  In our first post, we explored a simple problem: how to provide real-time aggregates when sequencing large volumes of genomes. We...

View Article



Image may be NSFW.
Clik here to view.

Monitor Medical Device Data with Machine Learning using Delta Lake, Keras and...

On August 20th, our team hosted a live webinar—Automated Monitoring of Medical Device Data with Data Science—with Frank Austin Nothaft, PhD, Technical Director of Healthcare and Life Sciences, and...

View Article

Image may be NSFW.
Clik here to view.

Engineering population scale Genome-Wide Association Studies with Apache...

Try this notebook series in Databricks The advent of genome-wide association studies (GWAS) in the late 2000s enabled scientists to begin to understand the causes of complex diseases such as diabetes...

View Article

Image may be NSFW.
Clik here to view.

Parallelizing SAIGE Across Hundreds of Cores

As population genetics datasets grow exponentially, it is becoming impractical to work with genetic data without leveraging Apache Spark. There are many ways to use Spark to derive novel insights into...

View Article

Image may be NSFW.
Clik here to view.

Introducing Glow: An Open-Source Toolkit for Large-Scale Genomic Analysis

The key to solving some of today’s most challenging medical problems lies in the analysis of genomics data. Understanding the impact of the minor changes in an individual’s genome on their overall...

View Article


Image may be NSFW.
Clik here to view.

Automating Digital Pathology Image Analysis with Machine Learning on Databricks

Check out our solution accelerator notebooks for automating digital pathology analysis or watch our on-demand webinar to learn more. With technological advancements in imaging and the availability of...

View Article

Image may be NSFW.
Clik here to view.

Building a Modern Clinical Health Data Lake with Delta Lake

The healthcare industry is one of the biggest producers of data. In fact, the average healthcare organization is sitting on nearly 9 petabytes of medical data. The rise of electronic health records...

View Article

Image may be NSFW.
Clik here to view.

Introducing GlowGR: An industrial-scale, ultra-fast and sensitive method for...

Today, we announce that we are making a new whole genome regression method available to the open source bioinformatics community as part of Project Glow. Large cohorts of individuals with paired...

View Article


Image may be NSFW.
Clik here to view.

Detecting At-risk Patients with Real World Data

With the rise of low cost genome sequencing and AI-enabled medical imaging, there has been substantial interest in precision medicine. In precision medicine, we aim to use data and AI to come up with...

View Article


Image may be NSFW.
Clik here to view.

Burning Through Electronic Health Records in Real Time With Smolder

In previous blogs, we looked at two separate workflows for working with patient data coming out of an electronic health record (EHR). In those workflows, we focused on a historical batch extract of EHR...

View Article
Browsing latest articles
Browse All 23 View Live




Latest Images