Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_2054 |
Symbol | |
ID | 8429036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 2232285 |
End bp | 2233601 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 645034375 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_003191506 |
Protein GI | 258515284 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATGT ATGATATTAT TCTGAAAAAA AGACGCGGCC TGGTATTGAC AGCAGAAGAG ATTAATTTCT TCATTGAGCA GTACAGCAGG GATAAAATAC CGGATTACCA GGCTGCAGCA CTGCTTATGG CAATATTTTT TCGTGGTTTG GATGCTGAGG AAACAGCAGC CTTAACTCTG GCAATGGCTA ATTCAGGTGA CCGGGCGGAT CTGTCTTCTA TACCGGGACT GAAGGTAGAT AAACACAGCA CCGGAGGGGT AGGAGACAAA ACTACATTGG TCTTAATTCC AATGGTTGCC GCTGCAGGAG TTCCTGTAGC CAAAATGTCG GGGAGAGGCC TGGGACATAC GGGTGGGACA ATTGATAAAT TAGAGTCAAT ATCAGGATTT CGGGTTAATC TTGATCCAGA GGAATTTATT TCTCAGGTCA ATAACATTAA AGCCGCGGTA GTAGCTCAAA CCGGGCAGCT TGCCCCGGCG GATAAGAAAC TTTATGCTCT GAGGGATGTT ACAGCAACAG TAGACAGTAT CCCTCTAATA GCTTCCAGTG TAATGAGTAA GAAAATTGCC GCCGGAGCGG ATGCTATTGT GCTGGATGTA AAAACCGGTT GCGGAGCTTT TATGAGAGAA ACAGAAGATG CTTTTAAACT GGCACGCACT ATGGTATCCA TAGGAAAGAG GGTAAACCTG CCTACAGTGG CCTTGATAAC GGATATGGAT CAACCTTTAG GTAATGCAGT AGGTAATGCC TTGGAGGTCA AAGAGGCTAT ATTAACCTTG CAGGGCAAGG GACCGTCTGA CCTGGAGGAA CTTTGTCTGG CTCTGGGCAG CCAAATGCTT CTGGCGGCTA AGAAGGTTAA AACAGATAAA GAGGGGCGGC AATTGCTGTT AGAGCTATTG AAAAACGGTA AAGCGCTGCA AAAATTTAAA GACATTATAT CTGCCCAGGG TGGACAGGTT GAGGTTTTTG ATAACCCGGA ACTTTTATCA AAGGCAGACT TAATAAAAAG TGTCAAAGCA TCCAGTGATG GCTATATTTT AGGAATTCAC GCAGAAATGA TTGGCAATGC GGCTATGCTC TCCGGGGCAG GCAGAGAAAC AAAAGATGCT GAAGTGGATC TCAGGGCGGG AATAGTTCTT CATAAAAAAA TTGGTGATAA AATATCTGCC GGGGATACCC TGGCGGTTTT GTATACAAAT AGACCTGAGA AGGAAGCGGA AATAATTAGA ATTATTCAAG AGGCATTTAT TTCAGGTTCA CAAAAACCTT TTCTGCCTCC ATTAATACAT GGAATTGTAA AACCGGGGGA TGTGTGA
|
Protein sequence | MRMYDIILKK RRGLVLTAEE INFFIEQYSR DKIPDYQAAA LLMAIFFRGL DAEETAALTL AMANSGDRAD LSSIPGLKVD KHSTGGVGDK TTLVLIPMVA AAGVPVAKMS GRGLGHTGGT IDKLESISGF RVNLDPEEFI SQVNNIKAAV VAQTGQLAPA DKKLYALRDV TATVDSIPLI ASSVMSKKIA AGADAIVLDV KTGCGAFMRE TEDAFKLART MVSIGKRVNL PTVALITDMD QPLGNAVGNA LEVKEAILTL QGKGPSDLEE LCLALGSQML LAAKKVKTDK EGRQLLLELL KNGKALQKFK DIISAQGGQV EVFDNPELLS KADLIKSVKA SSDGYILGIH AEMIGNAAML SGAGRETKDA EVDLRAGIVL HKKIGDKISA GDTLAVLYTN RPEKEAEIIR IIQEAFISGS QKPFLPPLIH GIVKPGDV
|
| |