Skip to content

Statistics CSV-file ​

What is a statistics file?

A statistics file contains the results of comparisons between groups (e.g. treated vs control) for each molecule, such as fold changes and p-values.

How to format the statistics file before uploading to Omics Studio ​

Your file must be a CSV file with:

IDComparisonFold_changeLog2_Fold_changep_valueAdjusted_p_value...
  • The following columns are required (case-insensitive):

    • ID
    • Comparison
    • Fold Change
    • Log2 Fold change
    • P-value
    • Adjusted P-value
  • Optional: you can add extra columns (e.g., metadata, annotations, etc.)

Example table

šŸ“„ Statistics (.csv)
IDComparisonFold_changeLog2_Fold_changep_valueAdjusted_p_value...
A0A1B0GTW7Treated vs Control1.450.840.0110.044...
H3BV60Treated vs Control-0.84-0.250.0910.102...
O14717Treated vs Control0.22-2.180.9430.121...
P05067Treated vs Control1.120.170.0510.943...

How Molecule IDs and Comparisons must be paired ​

  • Only one type of molecule ID (UniProt, Ensembl, CHEBI, or InChI) can be used per file – mixing molecule ID types is not allowed.
  • Each row represents a unique combination of a molecule ID and a Comparison combination.
    • The same molecule ID (e.g., H3BV60) can appear in multiple comparisons – for example, X vs Y and X vs Z.
    • However, each molecule ID + comparison pair must be unique. That means you cannot have two rows with the same molecule ID for the same comparison (e.g., two entries for H3BV60 in X vs Y).

Example

šŸ“„ Statistics (.csv)
IDComparison...
A0A1B0GTW7Treated vs Control...
H3BV60Treated vs Control...
A0A1B0GTW7Unreated vs Treated...
H3BV60Untreated vs Treated...

āš ļø If uploading several statistics files

Make sure that there is no dublicates across files, where the same ID and Group Comparison is mentioned. E.g. If UniProt ID O14717 combined with comparison X vs Y is in several files, you will not be able to create a study (including both files).

Show example

Statistics CSV file 1

IDComparisonFold_changep_valueAdjusted_p_value
O14717X vs Y1.230.0040.015
H3BV60X vs Y-0.880.0410.089
A0A1B0GTW7X vs Z0.350.1200.200
Q8IZX4A vs B-1.120.0090.030

Statistics CSV file 2

IDComparisonFold_changep_valueAdjusted_p_value
O14717
(āŒ duplicate)
X vs Y
(āŒ duplicate)
1.190.0060.020
H3BV60Z vs X0.910.0330.075
A0A1B0GTW7
(āŒ duplicate)
X vs Z
(āŒ duplicate)
0.400.0980.170
Q9XYZ1A vs C-0.510.2100.310

How to fix it:
Remove the duplicates from either Statistics 1 or Statistics 2 before uploading again.

General overview of the columns ​

If in doubt how to the rules apply to the different mandatory columns, you can learn more in the following.

Molecule ID columns ​

  • Must contain molecule identifiers (e.g. UniProt, Ensembl, CHEBI, or InChI).
  • Only one ID type is allowed per file.
  • No empty cells are allowed in this column.
  • Each ID must be unique.

Example

ID
A0A1B0GTW7
H3BV60
O14717
...

Comparison columns ​

  • Each value must represent a unique pairwise comparison, formatted as:

Examples:

  • Treated vs Control
  • Sample_A vs Sample_B
  • You must add vs between the two.
  • Format is case-insensitive (x vs y = X vs Y).
  • The two parts must be different (e.g. X vs X is invalid).
  • No empty cells are allowed in this column.

Example

Comparison
Treated vs Control
ConditionA vs ConditionB
Group1 vs Group2

Numeric columns ​

  • The following columns must contain numeric values (or NA):
  • Fold_change
  • p_value
  • Adjusted_p_value
  • Empty cells and NA are treated as missing values.
  • Any other text (e.g. missing, null, none) will cause an error.

Example

Fold_changep_valueAdjusted_p_value
1.450.00250.011
-0.840.0450.091
0.22NANA

Additional columns ​

  • You may include as many extra columns (e.g. gene names, annotations, or statistics) as you like.
  • These are not part of the validation criteria in Omics Studio when uploading, but must still have a non-empty header.
  • Empty values are allowed in these columns.

šŸ’” Good to know

āœ… Underscores, spaces, and hyphens are ignored (p value, p-value, or p_value are all valid). āœ… Plural ā€œsā€ at the end is ignored (comparisons = comparison).
āœ… Decimal separators can be either . or ,.
āŒ Thousand separators are not allowed.
āŒ Empty headers or missing required columns will cause upload errors.