SEQXML
Updated 44 days ago
The SeqXML schema (XSD) defines the skeletal structure of the sequence files and allows one to set constraints for each type of data it contains: for example, one can limit a DNA sequence to consist only of {A,G,C,T,N}. If one then tries to import a DNA sequence containing a 'Z', this error will be detected automatically by any XML validator...
As with FASTA, a SeqXML file includes the gene or protein ID, a description and the sequence itself, but it also provides the option to add alternative identifiers. The content can be validated and the position is well defined making it easy to parse and process...
OrthoXML is designed broadly to allow the storage and comparison of orthology data from any ortholog database. It establishes a structure for describing orthology relationships while still allowing flexibility for database-specific information to be encapsulated in the same format.