Software

Sinitcyn Lab Software Photo

In mass spectrometry, even the best experiments produce complex data, so “garbage in, garbage out” is always lurking in the background. So we spend our days sorting signals from noise with a smile.

DecoyGen

DecoyGen illustration

In MS-based proteomics, false discovery rates are typically estimated with target–decoy strategies, but common decoy generators (e.g., reversing or shuffling sequences) often produce unrealistic peptides and distort score distributions, undermining FDR confidence. We use protein language models to generate more realistic decoy peptides by masking and resampling amino acids, and validate them with target–decoy separability tests and MS-based shotgun proteomics data (DDA and DIA).

fdr target-decoy protein language models
View on GitHub

IgNovo

A Python-Rust package for de novo protein and peptide sequencing from mass spectrometry data using combination of deep learning and graph-based methods.

de novo sequencing top down proteomics antibody discovery
View on GitHub