Archive of effectors.org Changes: * Domains in version 5.2 has been annotated using interproscan 64.0 and Pfam 31.0 * Domains in version 5.1 has been annotated using interproscan 57.0 and Pfam 29.0 * Domains in version 5.0 has been annotated using interproscan 53.0 and Pfam 28.0 * The database format of effectors.org has changed since version 5.0 Eukaryotic-like domains: Three tables store the eukaryotic-like domains in the effectors.org database. Two tables store genome and protein metadata. All these are dumped as tab-delimited files. euk_domain.txt (Non-pathogenic frequencies of eukaryotic-like domains in genomes of non-pathogens) Column 1: domainid (protein domain ID) Column 2: description (protain domain description) Column 3: nonpath_avg (average number of proteins encoding a domain in non-pathogens in original RefSeq/Genbank genome annotation) Column 4: nonpath_stddev (standard deviation of the occurrence of domain in non-pathogens in original RefSeq/Genbank genome annotation, taken from SIMAP database) euk_prediction.txt (Annotation of proteins with eukaryotic-like domains) Column 1: gpid (genome project ID) Column 2: accession (protein accession) Column 3: domainid (protein domain ID) euk_score.txt (Genome-specific scores of eukaryotic-like domains) Column 1: domainid (protein domain ID) Column 2: gpid (genome project ID) Column 3: zscore (genome-specific Z-score=(number of proteins from this genome encoding a domain - nonpath_avg) / nonpath_stddev) Additionally, the genome and protein tables provide relevant metadata from the genome repository: genome.txt Column 1: gpid (genome project ID) Column 2: taxonomyid (NCBI taxonomyid) Column 3: refseq_accessions (NCBI RefSeq accessions) Column 4: description (genome description) Column 5: class (type of genome: pathogen, symbiont or non-pathogen) protein.txt Column 1: gpid (genome project ID) Column 2: accession (protein accession from NCBI RefSeq) Column 3: genenames (gene names from NCBI RefSeq) Column 4: locustags (locus tags from NCBI RefSeq) Column 5: description (protein description from NCBI RefSeq) Pre-calculated Effective predictions: All files are stored in their original format in the genomes.zip archive.