David Tulga is a Senior Software Engineer who most recently worked remotely for Iterative on the Python tool DataChain to accelerate Machine Learning and AI Training by providing an unstructured data ETL, analytics, and versioning framework. He also designed and implemented the parallel and distributed processing system included in the Iterative Studio version of DataChain. This work included integrating with cloud infrastructure on AWS and Google Cloud, databases and data warehouses such as PostgreSQL and ClickHouse, and the Iterative Studio React JavaScript frontend.
Previously from 2019 to 2022, he worked at Asimov
on laboratory records and sequence databases, Next-Generation Sequencing analysis pipelines, the cloud infrastructure powering those services, as well as designed and implemented the React JavaScript frontend used by laboratory scientists.
From 2017 to 2019 he worked for Freenome, on their large-scale genomic information database, integrations with machine learning,
and associated APIs and cloud infrastructure.
He also founded the open-source software company TransReality Technology, Inc. in 2017 where he developed
arcBlocks, a suite of C++ app development libraries,available on GitHub here.
From 2014 to 2016 he worked as a Software Engineer at Google on backend server optimization and analytics.
You can read more about this work and other projects on his LinkedIn profile.
He received his Bachelor of Science degree in Bioengineering
from UC Berkeley in 2009. During his time at
Berkeley, he was a Regents' and Chancellor's Scholar and an undergraduate
researcher in the Arkin Laboratory for
quantitative biology through 2008 and 2009. During the summer of 2007 he
participated on the 2007 Berkeley iGEM Team.
And during the summer of 2006 through the spring of
2007, he was an undergraduate researcher in Dr. Steven E. Brenner's
Computational Genomics Research Group.