Gene Athe_1775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1775 
Symbol 
ID7408562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1849673 
End bp1850998 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content37% 
IMG OID643716152 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_002573641 
Protein GI222529759 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGGTGA CAGAGATTAT TAGGAAAAAA CGTGACGGAG AGATACTGAG CAAAGAAGAG 
CTTGAATTTA TTGTAAACGG ATATGTAAAA GGCGAAATTC CAGACTATCA GATGTCTGCT
TTCTTGATGG CTATATATTT TAGAGGGATG TCAAAAGATG AGCTTGTAGA ACTTACCATG
CTCATGGCAA AATCAGGTAA GATGGTAGAC CTGAGCAGTA TAGAGGGTAT CAAGGTTGAT
AAACACTCAA GTGGTGGTAT TGCTGATACA ACAACCCTAG TCTTAATACC ACTGGCAGCA
TCTTGCGGTG TAAAGGTTGC AAAGATGTCA GGACGAGGGC TTTCTCACAC AGGTGGGACA
ATTGACAAGT TGGAGTCAAT TCCAGGTTTT AAAACAGAGC TTTCAGAAGA TGAGTTTATA
AAAGCAGTAA ATAAAGTTGG TGCGGCAATT GTTGGGCAGT CAGAAAGCCT TGTTCCTGCA
GACAAAAAGA TATATGCTTT AAGAGATGTC ACAGGTACAG TTGAATCTAT ACCACTTATA
GCATCTTCCA TCATGAGCAA AAAGATTGCA GCTGGTAGTG ATAAGATAAT ACTTGATGTC
AAGTTTGGCA AAGGTGCTTT CATGAAAGAG TATGAAAAGG CAAAAGAGCT TGCTAATACT
ATGGTTGAGA TTGGAACTTT GGCAGGAAGA GAAACAGTGG CATATGTTAC AGATATGAAT
CAGCCACTTG GTCTTATGAT TGGCAACGCT CTTGAGGTTA TTGAAGCAAT AGAGGTCTTA
AAAGGAAGAG GACATGAAGA TTTGAAGAAT CTTTGTATTG AATTTGCATC TGAGATGATG
ATTATGGCTG GAGTTGAAAA GGAGAAAAAA TTAGCACAAG AAAGAGCAAT TGAGAGCATT
GAAAAAGGGC ATGCTCTCAA AAAATTTAGA GAAATTATAA AAAACCAGGG TGGAAATCCT
GAAATAGTTG ACAATTATTC ATTGCTGCCA CAAGCCAAAT ATATTTATGA ACTAAAATGC
GACGAAGATA TGTATATTAA AGATATTGAT GCTCTCAAAC TTGGGCTTTG CGCACTAAAA
CTTGGAGCAG GAAGACAAAG AAAAGAAGAC AAGATTGACT ACGCAGTTGG AATTCAACTT
TTTGGTAAAA TAGGCGACAA AATAGCCAAG AATATGCCGT TTGCTAAAAT CTATGCAAAT
GATGAAAAAA GGGTTGAAGA AGCCATCTCA GATGTGAAAA CTGCATTTGA GTTTTCAAAA
GTACCTGTTC CAAAAAGAAA AGTAATATTT GCAAAGATAA CAAAAGATAA TGTTTTTGAA
TTTTAA
 
Protein sequence
MLVTEIIRKK RDGEILSKEE LEFIVNGYVK GEIPDYQMSA FLMAIYFRGM SKDELVELTM 
LMAKSGKMVD LSSIEGIKVD KHSSGGIADT TTLVLIPLAA SCGVKVAKMS GRGLSHTGGT
IDKLESIPGF KTELSEDEFI KAVNKVGAAI VGQSESLVPA DKKIYALRDV TGTVESIPLI
ASSIMSKKIA AGSDKIILDV KFGKGAFMKE YEKAKELANT MVEIGTLAGR ETVAYVTDMN
QPLGLMIGNA LEVIEAIEVL KGRGHEDLKN LCIEFASEMM IMAGVEKEKK LAQERAIESI
EKGHALKKFR EIIKNQGGNP EIVDNYSLLP QAKYIYELKC DEDMYIKDID ALKLGLCALK
LGAGRQRKED KIDYAVGIQL FGKIGDKIAK NMPFAKIYAN DEKRVEEAIS DVKTAFEFSK
VPVPKRKVIF AKITKDNVFE F