Skip to content

Expression Matrix CSV-file

What is an expression matrix?

An expression matrix is a table that shows how much of each molecule (gene, protein, or metabolite) is found in each sample.

How to format the expression matrix before uploading to Omics Studio

Your file must be a CSV file with:

  • One sample per row
  • One molecule per column
  • Expression values as numbers, empty cells, NA, or NaN

Example table

📄 Expression Matrix (.csv)
SampleA0A1B0GTW7H3BV60O14717...
Sample 110.14101678284929.931865492523076.69890691314978...
Sample 29.735649734286538.0856590575154212.5898915913406...
Sample 311.502122605212610.79568441760078.75086505276381...
Sample 410.14101678284929.931865492523076.69890691314978...

Rules

Header (first row)

SampleA0A1B0GTW7H3BV60O14717...
  • The header is the very first line of the file. These contain column names (e.g. UniProt IDs)
  • There is no specific requriements for the first cell (A1). You can also leave it empty if preferred.
  • Every other column name must be a valid molecule ID (UniProt, Ensembl, CHEBI, or InChI).
  • Only one type of ID can be used per file — mixing ID types is not allowed.
  • No empty cells in the header.

First column

  • Contains sample names (e.g. Sample 1, Patient_A).
  • Each sample name must be unique and non-empty.
Sample
Patient A
Patient B
Patient C
...

Expression values

  • Each expression value must be a number or the text NA.
  • Empty cells and NA are all treated as NaN values.
  • Any other text (e.g. missing, abc) will cause an error.
10.1410167828492NA6.69890691314978...
9.735649734286538.0856590575154212.5898915913406...
11.502122605212610.7956844176007...
NaN9.931865492523076.69890691314978...

💡 Good to know

✅ You can use both types of decimal separators (e.g. 10.53 or 10,53).
❌ Thousand separators are not allowed!

Statistics CSV-file(s)