Further to our previous article on the new sequence listing standard ST.26 at WIPO, the implementation date of ST.26 has now been delayed until 1 July 2022.
This means that any PCT applications with a filing date as of 1 July 2022 (and not the priority date). This includes any divisional applications filed after 1 July 2022 (even though the parent application was filed before 1 July 2022).
WIPO ST.26 is an XML document (eXtensible Markup Language) with elements and attributes, which is very different from the text document with numeric identifiers currently required in WIPO ST.25. The change to WIPO ST.26 aims to provide a uniform format of sequence listing data across member states that is both human and machine-readable for automated validation and data exchange and can be searched in publically available databases.
One of the main changes of ST.26 is that the sequence listing must not include any sequences having fewer than ten specifically defined nucleotides, or fewer than four specifically defined amino acids.
A “specifically defined” nucleotide or amino acid is any nucleotide other than those represented by “n” or “x”. Only specifically defined residues count towards the new minimum length requirement in ST.26.
A summary of the main changes between ST.25 and ST.26 is set out in the table below.
| WIPO ST.25 (for PCT applications with a filing date prior to 1 January 2022) | WIPO ST.26 (for PCT applications with a filing date after 1 July 2022) |
|---|---|
| Not required to include: -D-amino acids -Linear portions of branched sequences -Nucleotide analogs | Must include: -D-amino acids -Linear portions of branched sequences -Nucleotide analogs |
| Feature keys (e.g. 5’ UTR) as annotation of sequences | Feature keys (e.g. 5’ UTR) and optional qualifiers as annotations of sequences (e.g. allele, function, gene, gene_synonym, map, note, standard_name, trans_splicing) |
| Permitted to include: -Sequences <10 specifically defined nucleotides -Sequences <4 specifically defined amino acids permitted | Not permitted to include: -Sequences <10 specifically defined nucleotides -Sequences <4 specifically defined amino acids permitted |
| Sequences identified as DNA, RNA or PRT | Sequences identified as DNA, RNA or AA Must include a mol_type qualifier to further describe the molecule. These are: For DNA: Genomic DNA, other DNA or unassigned DNA For RNA: Genomic RNA, mRNA, tRNA, rRNA, other RNA, transcribed RNA, viral cRNA, unassigned RNA For AA: Protein |
| Organism names: -Latin genus/species -Virus name -“artificial sequence” -“unknown” | Organism names: -Latin genus/species -Virus name -“synthetic construct” -“unidentified” |
| “u” represents uracil in nucleotide sequences | “t” represents uracil in RNA sequences and thymine in DNA sequences |
| Amino acid sequences represented by three letter abbreviations | Amino acid sequences represented by one letter abbreviations |
| “n” and “Xaa” variables must have a definition provided in a feature | Default value assumed for “n” and “X” variables with no definition required “n” has a default value of “any one of ‘a’, ‘c’, ‘g’, or ‘t/u’, “unknown” or “other” “X” has a default value of “any one of ‘A’, ‘R’, ‘N’, ‘D’, ‘C’, ‘Q’, ‘E’, ‘G’, ‘H’, ‘I’, ‘L’, ‘K’, ‘M’, ‘F’, ‘P’, ‘O’, ‘S’, ‘U’, ‘T’, ‘W’, ‘Y’, or ‘V’, “unknown” or “other”. |
| All priority information included | Only the earliest priority information included |
| All applicant and inventor names included | Only one applicant permitted Can also optionally include one inventor name |
WIPO has released a desktop tool to support the generation of ST.26 compliant sequence listing. Our team have spent considerable time familiarising themselves with the new standard. Therefore, your usual Forresters attorney will be ready to provide compliant sequence listings for our clients, when this comes into force in 2022.
If you have any questions about the changes to sequence listing format in PCT applications, please contact your usual Forresters attorney.