Gene Dtox_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2054 
Symbol 
ID8429036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2232285 
End bp2233601 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content44% 
IMG OID645034375 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_003191506 
Protein GI258515284 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATGT ATGATATTAT TCTGAAAAAA AGACGCGGCC TGGTATTGAC AGCAGAAGAG 
ATTAATTTCT TCATTGAGCA GTACAGCAGG GATAAAATAC CGGATTACCA GGCTGCAGCA
CTGCTTATGG CAATATTTTT TCGTGGTTTG GATGCTGAGG AAACAGCAGC CTTAACTCTG
GCAATGGCTA ATTCAGGTGA CCGGGCGGAT CTGTCTTCTA TACCGGGACT GAAGGTAGAT
AAACACAGCA CCGGAGGGGT AGGAGACAAA ACTACATTGG TCTTAATTCC AATGGTTGCC
GCTGCAGGAG TTCCTGTAGC CAAAATGTCG GGGAGAGGCC TGGGACATAC GGGTGGGACA
ATTGATAAAT TAGAGTCAAT ATCAGGATTT CGGGTTAATC TTGATCCAGA GGAATTTATT
TCTCAGGTCA ATAACATTAA AGCCGCGGTA GTAGCTCAAA CCGGGCAGCT TGCCCCGGCG
GATAAGAAAC TTTATGCTCT GAGGGATGTT ACAGCAACAG TAGACAGTAT CCCTCTAATA
GCTTCCAGTG TAATGAGTAA GAAAATTGCC GCCGGAGCGG ATGCTATTGT GCTGGATGTA
AAAACCGGTT GCGGAGCTTT TATGAGAGAA ACAGAAGATG CTTTTAAACT GGCACGCACT
ATGGTATCCA TAGGAAAGAG GGTAAACCTG CCTACAGTGG CCTTGATAAC GGATATGGAT
CAACCTTTAG GTAATGCAGT AGGTAATGCC TTGGAGGTCA AAGAGGCTAT ATTAACCTTG
CAGGGCAAGG GACCGTCTGA CCTGGAGGAA CTTTGTCTGG CTCTGGGCAG CCAAATGCTT
CTGGCGGCTA AGAAGGTTAA AACAGATAAA GAGGGGCGGC AATTGCTGTT AGAGCTATTG
AAAAACGGTA AAGCGCTGCA AAAATTTAAA GACATTATAT CTGCCCAGGG TGGACAGGTT
GAGGTTTTTG ATAACCCGGA ACTTTTATCA AAGGCAGACT TAATAAAAAG TGTCAAAGCA
TCCAGTGATG GCTATATTTT AGGAATTCAC GCAGAAATGA TTGGCAATGC GGCTATGCTC
TCCGGGGCAG GCAGAGAAAC AAAAGATGCT GAAGTGGATC TCAGGGCGGG AATAGTTCTT
CATAAAAAAA TTGGTGATAA AATATCTGCC GGGGATACCC TGGCGGTTTT GTATACAAAT
AGACCTGAGA AGGAAGCGGA AATAATTAGA ATTATTCAAG AGGCATTTAT TTCAGGTTCA
CAAAAACCTT TTCTGCCTCC ATTAATACAT GGAATTGTAA AACCGGGGGA TGTGTGA
 
Protein sequence
MRMYDIILKK RRGLVLTAEE INFFIEQYSR DKIPDYQAAA LLMAIFFRGL DAEETAALTL 
AMANSGDRAD LSSIPGLKVD KHSTGGVGDK TTLVLIPMVA AAGVPVAKMS GRGLGHTGGT
IDKLESISGF RVNLDPEEFI SQVNNIKAAV VAQTGQLAPA DKKLYALRDV TATVDSIPLI
ASSVMSKKIA AGADAIVLDV KTGCGAFMRE TEDAFKLART MVSIGKRVNL PTVALITDMD
QPLGNAVGNA LEVKEAILTL QGKGPSDLEE LCLALGSQML LAAKKVKTDK EGRQLLLELL
KNGKALQKFK DIISAQGGQV EVFDNPELLS KADLIKSVKA SSDGYILGIH AEMIGNAAML
SGAGRETKDA EVDLRAGIVL HKKIGDKISA GDTLAVLYTN RPEKEAEIIR IIQEAFISGS
QKPFLPPLIH GIVKPGDV