5+ yrs. professional
“The scientific method is like a bicycle: you can go somewhere on it if you want to, but it’s not really the sort of thing you can just play around with.” - Terry Pratchett
About Me
Target audience: scientists, engineers, managers, and academic partners
Seeking a dynamic addition to your team? I bring 5 years of experience in industry biopharmaceutical R&D teams coupled with 10 years of research with Linux and data analysis with Python/Perl/R/Rust. Let’s elevate your projects together!
Do you consider cultural values, communication style, productivity, and dedication in your decision to hire new talent? Do you require a professional adept at communicating with engineers and scientists from diverse disciplines?
Do you need to know something more specific about my laboratory experience, software projects, data science experience, industry experience, or coding proficiencies? Use the links on the sidebar to find out more.
See the sidebar"s dropdown for more "About Me" details.
Who am I?
I’m Matt Ralston, a natural science expert, data science, scientific and cloud computing, and knowledge-economy professional. Outside of this, I’m a phenomenal cook for my parents, I play 4 instruments and I’m a self-taught guitarist for over 20 years, and I enjoy playing soccer for a recreational league in Delaware. I’ve played in bands, coffee shops, and as a street performer. In my previous life, I was a Undergraduate Research Fellow at the University of Delaware in the area of colorectal-cancer stem cell research.
Matt in a nutshell
Works in groups of engineers, analysts, and wet-bench laboratory professionals, team leads, associate directors, and alike. Listens to as many voices as possible but works independently quite well.
I just enjoy the journey and the process of collaboration as much at the technical know-how.
- has soft skills: style and brevity in communication
- track-record and wide expertise: engineering, chemistry, biology and genomics, bioinformatics, and Illumina WGS/WES/RNAseq/metagenomics
- agile methodology: strong documentation abilities while working independently
- system design mindset: design, benchmarking and testing
- cloud-agnostic: works with commodity cloud vendors using Docker containers
Life science specialty
So, I do wet-laboratory science as well… I’m effecient, effective and timely in the lab. My aseptic technique has led to less errors and better correlations between replicates and I’m used to handling sensitive materials during molecular biology methods all the way to organic synthesis. In the lab, I’m always cleaning something in the autoclave and I know where the imaging equipment is. I can perform single-gene targetted expression studies (sometimes referred to as qPCR or rt-PCR) on mRNA molecules but I also work with Illumina lab reagent kit technologies for multiplexed experiments..
Industry experience
Agile software development and scientific computing
12+ years of software development experience
I work in GNU/Linux and MacOS/OSX environments on biological sequence analysis, data science, and Illumina whole-genome, whole exome, and RNA-sequencing problems. I have 5 years of industry experience writing Python, Perl, R, Javascript, HTML/CSS, SQL, bash and recently, both web applications (FastAPI, Flask, Leptos, WASM) and safe-memory GUIs/TUIs with Python and Rust.
My projects have included technologies such as kubernetes code, Dockerized workflows, shell scripts for HPC environments and grid engines like UGE/SGE and SLURM. I can build MVC/MVVC web applications and I studied database systems (SQL) at the graduate level. I am talented in cloud computing techonlogies such as AWS (Amazon Web Services).
I’m a full-stack developer too: not just web application development… relational databases, commodity cloud vendors, Javascript/HTML/CSS, front-end frameworks and dashboarding (mostly jQuery, Vue.js, Alpine.js, TailwindCSS, D3.js, and SVG), REST-API design, and database normalizations. I wonder what your team team can do with this?
Specific stories and industry experiences are given below.
- Scripted automation tasks for Bristol-Myers Squibb in Perl and Python to optimize repetitive processes, reducing manual effort by ~ 40%.
- Developed robust backend systems and automations for about 10-20 bench-scientists using Python, NodeJS, and R so they didn’t have to manage spreadsheets and upload data to BMS archives by hand.
- Implemented cloud infrastructure on AWS for Bayer Crop Sciences: leveraged services such as EC2, S3, and Lambda for scalable and cost-effective solutions.
- Demonstrated excellence in laboratory skillsets with replicate correlation coefficients (Pearson on normalized count data) > 95% on Illumina HiSeq 2500 RNA-Seq libraries.
- Containerized applications and workflows using Docker and Kubernetes (k8s), ensuring consistency across development, testing, and production environments.
- Utilized Linux command line tools, including bash, grep, and sed, for data manipulation and system administration tasks.
- Designed and implemented interactive data visualizations (I built a genome browser and interactive data visualizations) using D3.js to enhance data-driven decision-making processes.
- Managed document preparation and typesetting using LaTeX for high-quality technical documentation.
Wet-bench Molecular Biology and Biochemistry
5+ years of published laboratory biochemistry and molecular biology expertise
I have worked in a variety of laboratory settings including analytical chemistry, organic synthesis, molecular biology and disease physiology research, translational oncology labs with patient tissue samples, extreme anaerobic and BSL-2 microbiolgy aseptic culture, and much more. In addition to laminar-flow hoods and GLP environments, I am familiar with a wide range of instrumentation and bioassay designs. I have worked with qRT-PCR, immunological assays, immunofluorescence, Northern/Western blots, HPLC, GC/LC-MS, UV-VIS and more. Please see the skills section for specific details and a list of known lab protocols.
I keep a meticulous laboratory notebook. I can re-run batches of samples if their quality is off (e.g. RIN score). Also, I deeply enjoy performing molecular assays at the bench, whether its genomics, microbiology, human or mammalian disease pathology, genetics, and physiology. I am detail oriented and very organized and I focus on replicate quality in-between samples and I understand instrument limitations, positive controls, and core goals of hypothesis testing and data modeling.
If you’re looking for a hybrid wet-bench and data scientist who can aliquot 1uL reagents to pipette instrument tolerance, or who can run a multi-factor experiment or bioassay from innoculation through sample preparation and instrumental analysis, I’m your guy.
Skills
Team skills
Culture:
I’ve received glowing recommendations from industry managers on the topics of productivity, code maintainance, and team behaviors; I truly enjoy delivering value on projects within teams of laboratory scientists, programmers, engineers, and managers on projects in wet-bench biology and bioinformatic research areas. I study algorithms, data-science, and discrete math in my spare time. In fact, you should check out the my /software page, the /research page, my life sciences message at /biosciences, my profile on LinkedIn, and my manager testimonials section and perhaps my publicly facing GitHub and /portfolio pages. You’ll also see I have expert familiarity with publishing first-author and collaborating author articles.
I write usable and effective documentation, I have excellent skills in literate programming with Rmarkdown, markdown, Pandoc, Infrastructure-as-code (IAC), and reproducibilty reports and analyses. I have a modest https://github.com/MatthewRalston with 3-5 open source genomics projects concerning sequence alignment conceptual understanding, molecular species and diversity, alignment-free algorithms for sequence similarity and assembly, correlation analyses, k-mer and minimizer expertise, and Illumina sequence data profiling. I am enthusiastic about my software engineering, my scientific and code development, and my Discovery and R&D processes.
Communication:
I’ve had great experiences with technical writing, science, engineering, and stats/probability/mathematics and I’ve delivered value and knowledge to others without losing touch. Your high-value team demands a well-written style as well as the know-how in your publications, meetings, and documentation. Clear and compelling written style is essential to today’s business successes, tomorrow’s publications, and your quarterly progress reports. Want to see more? Check out the publications or testimonials sections for more reasons why I’m a great fit for your business needs.
Closing remarks on team skills
Five years in industry bioinformatics and software engineering teams have taught me this: the knowledge-economy requires skills AND attention to detail, time-management, project management, and public speaking, in addition to the technical accumen needed to deliver value for teams.
With deep and well-rounded expertise in the natural sciences (biology and chemistry) and in data analytics, algorithms, mathematics, and statistics, I have affected several teams and their publications, screening methods, and more. My team-oriented style begins with a fundamentals-first approach to discussing expertimental or analytical concerns with your Principal Investigator. When the artifacts or biophysical aspects of a assay design require expert treatment and know-how, this backgroun lets me quickly integrate suggestions from team members into the analytical process (like filtering bad leads or integrating additional knowledge-bases for specific cases within large datasets).
Engineering skills
- Computer Science MSc. (MSc. in Comptuer Science, concentration in bioinformatics, 3.96 GPA, UD 2015)
In addition to my considerable training in analytical/physical/organic chemistries and molecular biology and genomics, I possess very strong skills in programming and data science. With a bioinformatics major in graduate school at Univ. of Delaware, I have formal training in advanced calculus, machine learning and statistics, and software engineering.
Additionally, I have practiced many skills in industry, leading efforts in Bristol-Myers Squibb company initiatives such as legacy software development and maintainance, reproducible research, dashboard development, software application development, database management and data version control, and programming language adoption.
Software development
-
Cloud and containers: Amazon Web Services (AWS), Docker and related container technologies,
containerd
, OCI, kubernetes/k8s -
Programming: Rust, Python/C-Python, R, R-Shiny, bioconductor and Rmarkdown/Quarto,
bash
, Javascript/NodeJS/Bun,emacs
, D3.js, Julia, Perl,awk
,grep
,sed
, LaTeX, Matlab, HTML5/CSS3, SQLite3/libSQL, MySQL/MariaDB, Oracle SQL, MS SQL Server, PostgreSQL, miniconda -
Relational Databases and noSQL technologies: MongoDB, Microsoft SQL Server, Oracle SQL, MariaDB/MySQL, SQLite3/libSQL, PostgreSQL, SQLalchemy and Object-relational-mapping (ORM), Rust ORMs,
sqlx
, rdflib, snapshotting -
Cloud Computing and Cloud Native: GCP, Azure, AWS, S3, Elastic Compute (EC2), CodeBuild/CodeDeploy, Elastic Container Repository (ECR), Elastic Container Service (ECS), Docker, IAM, kubernetes/k8s
-
Networking: Slurm, Sun Gride Engine (SGE/UGE), firewalld, iptables,
tmux
, nixos -
Systems Administration:
systemd
, Docker, Ansible/chef,tmux
,nginx
and Apache/httpd
web servers, Sun Grid Engine (SGE/UGE) and SLURM, bash programming, parallel
Statistics and data science
-
Statistics: p-values, Student t-test, Chi-Square test, Fisher’s Exact Test, Hypothesis testing, ANOVA/GLM/regression, linear algebra, limma, distribution fitting, R/Rstudio/Rshiny, Bioconductor, discrete/continuous probability, multivariate statistical analysis, regression, least-squares estimation, PCA, clustering, classification, normalization, regularization, variance reduction, naive Bayes, random forest, XGBoost, DESeq2
-
Dimensionality reduction: PCA/SVD, Uniform Manifold Approximation and Projection (UMAP), t-SNE, canonical correlation analysis,
-
Optimizers: Gradient descent, simplex, particle swarm, simulated annealing, self-organizing maps, others.
-
Kernels: kernel trick, linear programming, matrix math, linear algebra, abstract algebra, discrete math
-
Maching learning: perceptron, SVM, PCA, UMAP, t-SNE, Deep Neural Networks, Recurrent Neural Networks, and Artificial Intelligence (AI) recurrence relations, linear regression and formal OLS-family assumptions and caveats, SAS programming, R programming, Python programming
-
Data Science: Python programming,
numpy
,pandas
, Jupyter notebooks, Markdown, LaTeX/Pandoc, staticly-typed compiled languages (rustc
/cargo
,zig
, Typescript, Scala/Java/Clojure, C extensions in Python and R), RAII, CUDA, Cython, anaconda, apline linux, Docker, kubernetes/k8s,
Bioinformatics
-
Sequence alignment: Bowtie2, BWA, BLAST, Multiple-sequence alignment, Muscle/Clustal, Hidden Markov Models, E-values
-
Data processing: See data-science above.
-
Biostatistics: See Statistics above.
-
Genome/metagenomics assembly: Trinity, Velvet, SPAdes, metaSPAdes
-
Illumina data processing: Fastqc, Samtools, Picard, Bedtools
-
NCBI training: Pubmed, literature research, PFAM, KEGG, Sequence Read Archive
Wet laboratory skills
- Laboratory sciences BSc. (Chemistry department bachelors degree with specialization: BSc. Biochemistry, University of Delaware, 2012)
I have demonstrated competencies in many laboratory methods as well as soft skills related to team organization, and I have graduate-level course work and 5 years of industry experience and 7 years of laboratory work in the natural science areas covering [molecular biology, human and mammalian genomics and immortalized cell-line culture, mammalian physiology, microorganismal genomics (bacterial, viral), metagenomics and phylogenomics, biofuels and cell culture bioreactor,] process optimization, [novel methodologies and protocols, Illumina RNA-seq library preparation, qRT-PCR, RT-PCR, PAGE (Western blot, Northern blot), ELISA, Chromatography (TLC, HPLC), Mass Spectrometry (GC-MS, MS-MS, LC-MS), RNA quality analysis/RIN, Nanodrop, nucleic acid extraction, enzymatic treatment], Flow Cytometry/FACS-Aria, [light microscopy, fluorescence microscopy, immunohistochemistry (IHC), immunofluorescence (IF), gene knockdown, cloning,] bioassay development, and additionally [media formulation, contaminant testing and detection], [technical and software expertise], [GLP, ELN], and additional undergraduate lab and bioscience, [natural sciences, undergraduate mathematics, probability, statistics, molecular diversity], [data standards and reporting, technical writing, trends and conventions in biodata standards].
NOTE: Skills listed above in brackets indicate genuine industry experiences and/or experience at a professional level.
Additional skills
Communications and Documentation
I work independently with consistent email and Slack communications, and project updates and logging via Version Control and git
. My project management software is often plain text, also git-based, metrics, org-files for Pandoc compatibility, second-brain, markdown where possible, license documents, source files and configurations, Dockerfiles where possible, and I write LaTeX primarily in .Rmd
.
Committed to continuous learning and staying updated with the latest advancements in these fields, my breadth of knowledge allows me to model, program, and problem solve in engineering teams. With my natural science knowledge I can ask intelligent questions about variables and covariates, treatment conditions, and their effects in your experimental design. I write clearly in technical and team messaging.
I have stylistic Javascript and UI/UX ecosystem and preference set and functional and OOP style knowledge. I can write CSS/HTMX, and I can build web apps in any language. I’m very partial to having a static asset stack where possible, Nginx, and I like Github Pages and Jekyll CMS. Most of my content is Markdown, plain-text, and org-mode when it’s not software projects.
That being said, most of my technical reporting is LaTeX, Pandoc, and Rmarkdown for scientific documentation.
I use Sphinx and Markdown for most simple code-and-a-story literate programming.
I am no stranger to Flask, FastAPI, Dask, OpenMPI, LAPACK, Tableau, Jupyter, and other standard app-and-reporting codebase trends and ease-of-use and lifetime centric development.
I’ve worked in Agile teams and I am familiar with scrum, sprints, stories and other development team concepts.
I’ve used Slack, Skype, and Atlassian distributed-team software and I am consistent on issues tracking, reporting, kanban, application lifetime management suggestions, and of course version control and repository feng-shui.
GTD/Productivity mindset
Productivity is a modest topic for discussion and I find it enjoyable because you see others perspectives on engineering concerns, assumptions, testing needs, Blue-Green workflows, Red-team and Green-team development, and of course GTD methods when you’re with new colleagues and their perspectives.
Projects and time management
My workflow centers on the README.md
, software usage patterns and documentation, Sphinx, Oxygen, and Pandoc documentation, literate programming, org-mode, and Emacs.
I use the zed editor, PyCharm, Emacs and vi, tmux, and the bash shell for my development environment and I focus heavily on commit messsages, gitflow, rebase/merge/squash source tree design, and I use plain text and org-mode to create and manage my TODO lists and organize information in a Pandoc compatible way.
My task and time management largely revolves in Atlassian and Github kanban and issue tracking systems. I also can use Emacs functionality to perform clock-ins on tasks and git to manage versions of documentation and code, in source trees, to perform time estimates and prototype cost estimation.
I am flexible regarding your task and time tracking and coding requirements speicified by your org chart and PM guidelines.
Server side vs UI and UX preferences
I use Rmarkdown and Python for most of my direct data science stack. I use Python SQLAlchemy, Flask, and Rust ORMs and RDBMS technologies to create normalized data structures during ETL. My experience in scale-up and cloud deployment was largely driven by internal protocols and reporting, lift-and-shift deployments, cloud-native or vendor-specific cloud stacks such as AWS, GCP, and Azure. My ETL code is mostly Python, numpy, CUDA, and/or rust
where I get the time to develop robustly and comfortably at my knowledge of static type function signatures and handling types robustly with GPT4.0-o1-mini in zed
or Claude Sonnet 3.5+ mobile to assist with code when needed.
I use minimal software stacks and don’t do much UI development, but I’m no stranger to Javascript and CSS frameworks. I don’t use Vue.js or Next or Nuxt or MEAN or MERN in most of my app programming. I’m just not a front end guy, I like systems that allow me to look at large sets of variables in a flexible way w.r.t. number of samples and necessary EDA and diagnostic outputs.
I’m a backend programmer and ETL and Data Science are my most accomplished skillset in terms of what I can do. But often, you’ll want a ‘full-stack developer’ in the classical sense too.
Disclaimer and software licensing
I definitely have some expertise with modeling, data-processing, and visualization techniques such as R, Python, Javascript, Rust, Linux/bash, Docker, AWS/cloud and more. But It’s more than just websites, APIs, numbers and figures here; my strong fundamentals and data-driven design strategy have been employted towards diverse goals in industry: hypothesis generation, key-questions and KPIs, application or integration needs, and more, such that my products are reproducibile, documented, and clearly written. I’m adaptable and overcome obstacles with teams that work together whether in-person or remote.
I am familiar with open source standards, commercial licensing and procurement, software licenses, and EULAs.
And god willing I love working on multiplexed datasets in the lab and on the computer.
I have some conceptual familiarity with a lot of different methods from chemical analytical, bioassay, immunoassay, fluorimetry, MS, GC-MS, LC-MS, ELISA, and an intimate understanding of cell culture and laminar flow hoods.
My last publication featured Illumina of course, but the cherry on top was validations of my RNA sequencing with Northerns and I discovered novel lnc-RNAs, antisense RNAs, RNAi, and other species with strand-specific protocols.
Training
Univ. of Delaware B.Sc. Biochemistry (2012) and M.Sc. Bioinformatics and Computational Biology (3.96 GPA, 2014)
Undergraduate program
I developed laboratory fundamentals as a wet-bench laboratory biochemist and molecular biologist. I learned aseptic culture, some immunological, and fluorescent assay techniques, and RT-PCR in a translational cancer laboratory. The UD chemistry department emphasized other laboratory fundamentals such as HPLC, GC-MS, spectrometry, and organic synthesis.
Subects studied include organic chemistry, physical chemistry and thermodynamics, and biochemistry, molecular biology, genetics, physiology, microbiology, virology, physics, linear algebra, probability, calc I/II/III and others.
Graduate program
My thesis was hybrid wet-lab and dry-lab 2-year project developing the first strand-specific Illumina NGS map of the C. acetobutylicum transcriptome during solventogensis and sporulation stresses. Owing of course to the help of Drs. Papoutsakis and Venkataramanan, I investigated the whole-transcriptome response of a biofuel-producing bacterium to the biophysical stresses of solvent/biofuel accumulation and associated stress-response adaptations in anaerobic fermentation conditions. I formulated a fixed media, prepared the bioreactors and innocula, sampled across time points for small molecule characterization by HPLC, RNA-quality investigation, subsequent PCR-mediated Illumina HiSeq 2500 library prepration, and delivered my samples to the Delaware Biotech. Institute core for sequencing.
Afterwards, I developed both genome browser and interactive volcano-plot visualization web-applications for assessing differential expression of transcripts across the time-stress design.
Subjects included in my graduate program include metabolic engineering, bioinformatics, biostatistics, computational biology, database systems, algorithm topics, sequence alignment, Illumina NGS, biostatistics, Linux HPC training, web application development, Perl, Python and bourne shell (bash) programming.
Contact
Do you want to connect?
If you have a research question or you would like to discuss wet-bench, software, or bioinformatics please let me know.
If you are a recruiter please provide a location, time-frame to hire, and a full job description as plain-text in an email to my gmail.
If you are an interviewer and would like to schedule a phone or zoom interview, please copy and paste the following text into your email.
1
2
3
4
5
6
7
8
9
10
name: Jane Doe
email: jane.doe@example.com
company: Example dot com
department: Data engineering
subject: I'd like to contact you about the following opportunity
date: Date (- of opportunity)
time: time of interview
description: A description of the position or opportunity you'd like to disclose to me.
question: Key questions I should address in my response
kpi: What are your indicators of whether or not this is successful, like a successful interview shedule by a certain date. A fee range you'd like to remain within. What makes our relationship successful and how may I discuss details more
Testimonials
When asked for references, I’ve received the following responses from former managers, indicating their support and continued pen-pal relationship during my early career.
Anonymous, Associate Director at Bristol-Myers Squibb
Whatever you need, big guy.
Anonymous, former boss and mentor at BMS
Of course. Any time. You don’t even need to ask.
References
1
2
3
4
5
6
# boss, manager, and program director at Bristol-Myers Squibb
Mark Russo PhD | russomf@hotmail.com
# Committee member and Program Director at Univ. of Delaware
Cathy Wu PhD | wuc@udel.edu
# manager and microbial genomics cloud computing expert at Bayer Crop Sciences
Kyle Lambert | kjlambert@ucdavis.edu