Blog posts

2025

Migrant Workers Playbook

4 minute read

Published: March 01, 2025

Remember to document unsafe labor practices. The only way to properly report incidents including lost wages is to include:

Migrant Workers Playbook Es

5 minute read

Published: March 01, 2025

Recuerde documentar las prácticas laborales inseguras. La única manera de informar adecuadamente incidentes que incluyan salarios perdidos es incluir:

Debunking Coronavirus Conspiracy Theories

9 minute read

Published: February 18, 2025

I refer the reader to the primary source² for this discussion, a US House Oversight Committee hearing on the Coronavirus response featuring testimony from the former director of the U.S. Center for Disease Control and Prevention (CDC), Dr. Robert Redfield.

Thoughts on ML deployments, containerized workflows, and notebooks.

6 minute read

Published: February 06, 2025

This article hopes to bring the reader up to date (ca. 2017-2022) on modern cloud-native and scalable solutions for data science and natural science research application stacks using the Docker container standard for container specification (vs Singularity, Podman, or containerd containers that are equally valid). First I will provide a brief description of the goal of Docker containers. Next I’ll touch on the kubernetes architecture for distributed data processing and application service management. Finally, I’ll describe code repository, container registries, and Markdown/Rmarkdown/LaTeX documentation as it purtains to a service’s lifespan w.r.t. notebooks and documentation of custom services and their orchestration.

Minimizers A Bioinformatic Primer

less than 1 minute read

Published: January 27, 2025

2024

Benchmarking Aligners

1 minute read

Published: December 23, 2024

Short-read aligners make up the core of modern bioinformatics program technologies. The aligners are responsible for mapping (rather than aligning) short reads (typically produced by Illumina sequencers) to their most likely originating loci with gapped and ungapped, genomic and transcriptomic methods. For lists of both aligners and short-read mappers and their pros/cons, see the Wikipedia article.

Youth Falls In Night

less than 1 minute read

Published: December 12, 2024

I came to life from two unknown trees, Roots tangled deep in mystery’s seas. Yet arms reached out, so strong, so kind, To plant me anew, a love redesigned.

Roots And Wings

less than 1 minute read

Published: December 12, 2024

I came to life from two unknown trees, Roots tangled deep in mystery’s seas. Yet arms reached out, so strong, so kind, To plant me anew, a love redesigned.

Ruining Programmerhumor Memes 2

2 minute read

Published: December 10, 2024

monolithGang

What America Means To Me

5 minute read

Published: November 28, 2024

Where does this begin?

Basic Python 2

16 minute read

Published: November 20, 2024

Skip the description, let’s get to the code!

Probability And Markov Sequences

7 minute read

Published: November 18, 2024

The interesting macromolecules inside a cell are sequences

Modern Data Literacy Guide

6 minute read

Published: October 16, 2024

What is data journalism

Implementing D2 Metrics In Cython For Kmer Count Profile Distance

1 minute read

Published: October 12, 2024

D2 metrics to compare sequences from kmer frequency vectors

State Of The Cpu Market Servers And Workstations

6 minute read

Published: October 10, 2024

What is the state of the CPU market

consider server CPUs and workstations or small PC CPU architectures for high-performance compute.

Ricing In 2024

less than 1 minute read

Published: October 07, 2024

Customization is essential for maximizing visual appeal and ease of use of the computer system.

Linear Runtimes For Quasimapping And Alignment Free

1 minute read

Published: October 07, 2024

I want to discuss linear runtimes and what that means in alignment-free methods for bioinformatics and sequence alignments and quasi alignments. First, it is the splitting of the sequences, as they are read, into, let’s say, ‘a’ De Bruijn graph. This graph consists of the k-mers, their neighborhoods, and of course the walks or paths through the graph that constitute optimal criteria and local maxima of course for traversal and contig/walk/path maximization. Typically, a search through the De Bruijn structure may be Breadth-First to find optimal depths for traversal of the path through the De Bruijn structure, optimizing for creating some sequence. This leads to read collapse along the sequence unidirectionally (bidirectionally in a unidimensional space) along the sequence space.

Ricing In 2024

less than 1 minute read

Published: October 06, 2024

Productivity In Org Mode

4 minute read

Published: October 06, 2024

Goals

Industry Skillsets And Conformity Vs Academic Topics For Q3 2024

2 minute read

Published: August 24, 2024

What topics are currently under focus?

Network Attached Storage (NAS) vs storage server

6 minute read

Published: August 08, 2024

What is Network Attached Storage (NAS)?

Python programming environment

4 minute read

Published: July 29, 2024

Python

Productivity Setup

4 minute read

Published: July 28, 2024

Ubuntu-based Linux System

Kmer Database Format Part 3

7 minute read

Published: July 11, 2024

Summary

2022

Kmer Database Format Part 2

5 minute read

Published: June 22, 2022

Summary

2020

How do I become a bioinformatician?

22 minute read

Published: September 05, 2020

I see plenty of posts on /r/bioinformatics of students or mid-career professionals asking what it takes to become a bioinformatician. What skills should I have? What programming languages should I learn? What course should I take? Do I need a masters or PhD? Here is my answer to the question 'How do I become a bioinformatician?'

Sorting Out Modules in Python

11 minute read

Published: February 26, 2020

via Gfycat

Differentiation in Computational Research Environments

11 minute read

Published: February 16, 2020

Introduction

Benchmarking Python CLIs

4 minute read

Published: February 16, 2020

What is benchmarking and why do it?

Site Redesign

1 minute read

Published: February 12, 2020

Welcome!

2019

Kmer Database Format (Part 1.)

33 minute read

Published: October 26, 2019

The goal of this blog post is to introduce the concepts of k-mer subsequences and blocked GNU-zip file (.bgzf) and suggest that they be used together to form a new file specification for younger bioinformaticians. If I'm successful, the reader should have a basic understanding of common k-mer packages, my opinion on the algorithms and APIs, and the challenge of understanding advanced computer science and benchmarking concepts utilized by those packages/algorithms from the eyes of novice, beginner, and even intermediate bioinformatics students.

Views of the ‘Second Shift’ Phenomenon in Children - Domestic Egalitarianism

10 minute read

Published: October 23, 2019

Review notes

These are some notes I jotted down while reading a study from Psychological Science linked by /r/science Reddit at 2.3k updoots. This article had a somewhat clearer design and premise than a more popular post (8.8k currently) this morning in the same subreddit. PDF to the former article and the Springer link to the latter. I am reviewing the article to talk with a new acquaintance on the phone about feminism and gender equality.

Magika - A Sword Art Online fanfiction

43 minute read

Published: September 08, 2019

Officially, magic was a forbidden component of Sword Art. Magical skills, item stats, criticals, and ability leveling may have elements beyond the typical diagnostic expertise of the players. Yet, enchanted items, durability, and other components of the material realm of Sword Art have mechanics of randomness, high-level crafting, beyond the scope the typical player will invest into the game. Additionally, defensive and offensive techniques also have randomness and executional nuance beyond the scope of most players’ level of interest. The code of the game actively encourages this obfuscation to provide a simple experience for the players, and even goes so far as to encode a level of instinct that Kirito granted Cline in the West Field. The user interface and neurological link was indeed part of harnessing full potential in the battlefield, involving physical, mental, and psychological nuances. The West Field had reminded him of other MMO’s he had played online; a pointless grind for experience and loot, almost deliberately trivial in comparison to the dungeons and party mechanics. Sure the social components of the game were important, but you never know who’s going to fail to support your flank and ensure survival. Since Kirito was a solo player, it had always seemed easier to grind in the fields so he had something to bring to the table when he actually joined a party.

How to build a scientific calculator in Python - Part 1.

44 minute read

Published: June 15, 2019

About a 45 min read.

Top 10 - Productivity Tricks I Learned From Grad School

22 minute read

Published: May 29, 2019

Things that made me faster in grad school

Basic Skills in Computational Research Environments

35 minute read

Published: April 14, 2019

About a 25 minute read

2018

First time with Common Workflow Language (CWL)

8 minute read

Published: December 02, 2018

If Perl is Glorified Shell, Shell Scripts Are Dead. Long Live Shell Scripts

AWS S3 System Backup Tutorial

13 minute read

Published: November 07, 2018

This article is geared for novice to intermediate users of OSX and Linux, perhaps a 20min read.

Review of Ken Robinson’s Changing Education Paradigms

11 minute read

Published: October 01, 2018

In this post I’m going to try something a little bit different. I’m presenting at my alma matter this fall to talk about my research for Bristol Myers Squibb. As I’ve been reflecting on objective differences in project longevity, rigor, or collaborative style in preparation for some early-mid 20’s students, I revisited one of my favorite RSA Animate talks: Ken Robinson’s ‘Changing Education Paradigms’.

Ruining /r/programmerhumor memes

5 minute read

Published: July 05, 2018

Industry Science

3 minute read

Published: June 28, 2018

See the great thing about science and technology jobs is our transparency and focus on the problem, not on the finance. You could say that the tech sector is very scientific and rational. You see, we rationalize taking advantage of liberally licensed software.

Pro-Pharma Rant and the Nebula Network

3 minute read

Published: April 16, 2018

Hi everyone, have you ever wondered what would be a good idea for an app? Since I became a developer, now I have *some* of the tools to potentially create some basic web applications, command line tools, etc. For some developers who aren't creating regularly, it may seem like your work actually gets in the way of the creative process, and you might not get the time to make that killer app you really wanted to build.

Beginner Array/List Methods

17 minute read

Published: April 14, 2018

On Python Documentation with Sphinx

1 minute read

Published: March 20, 2018

I’ve been exploring the Python documentation system for ReadTheDocs with Sphinx. It could be that I’m dense, but the sphinx-quickstart command seems horribly divorced from most of the automagic documentation goals of the Sphinx project. After following several guides^{1, 2} I noticed that only an index.rst had been created in the _sources directory. Brandon’s guide suggests that adding your module to either your PYTHONPATH or to your conf.py before make html would lead to autodoc recognizing your package. I wasn’t able to get this to work on OSX Sierra using a virtualenv. After looking through more tutorials, I encountered some suggestions for the sphinx-apidoc command, without actual invocation details.

Hello World

less than 1 minute read

Published: March 17, 2018

Hello world, I am building my first blog for science and technical topics called Not Very Humerus (NVH). I’ll be adding periodically to this collection and focusing on the technical side of my journey for a bit. I’ll try to proceed chronologically with some retrospectives at the beginning of the blog to focus on my experiences in University.

Matt Ralston

Blog posts

2025

2024

monolithGang

Where does this begin?

The interesting macromolecules inside a cell are sequences

What is data journalism

D2 metrics to compare sequences from kmer frequency vectors

What is the state of the CPU market

Goals

What topics are currently under focus?

What is Network Attached Storage (NAS)?

Ubuntu-based Linux System

Summary

2022

Summary

2020

Introduction

What is benchmarking and why do it?

Welcome!

2019

Review notes

Things that made me faster in grad school

About a 25 minute read

2018

If Perl is Glorified Shell, Shell Scripts Are Dead. Long Live Shell Scripts

2015

Popovers