Gene Dred_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_1101 
Symbol 
ID4956968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp1172197 
End bp1173513 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content48% 
IMG OID640180271 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_001112461 
Protein GI134298965 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATGG TGGACATTAT CCTGAAAAAA AGACGGGGCT TGGAACTTAC CTCGGAAGAA 
ATAGAATTTT TTATTCAGGG TTATACCAAG GGAGAAGTAC CGGACTATCA AGCGGCAGCC
TTTCTAATGG CAGCCTTTTT CCAGGGACTA ACCCCCAGGG AAACGGGAGA TTTAACTATG
TCCATGGCTA GATCCGGTGA TCAGGCGGAC CTTTCATCGA TTCCCGGCAT AAAGGTCGAT
AAGCATAGTA CAGGTGGCGT TGGGGATAAA GTTACGTTAA TTCTCGGACC CTTGGTTGCG
GCTGCGGGCA TACCGGTGGC TAAGATGTCC GGTCGGGGCT TGGGACATAC CGGAGGCACC
ATTGATAAAC TGGAATCCAT TCCAGGCTTC CAAGTAACCA TGGATAACCA AAATTTTTTG
CAACAGGTAA AACGAGTAAA ATTGGCCGTT GTGGCCCAAA CCGGTCATTT AGCTCCGGCA
GATAAGAAAC TCTATGCATT ACGGGATGTT ACGGCCACTG TGGACAGTAT TCCCTTTATT
GCTGCCTCGG TCATGAGTAA AAAGATAGCC GCCGGGGCGG ATGCCATCGT ATTGGATGTT
AAGGTGGGCA GTGGTGCATT CATGAAAAAC TCAGAGGATG CCTTTTCCTT AGCCAGAACT
ATGGTGGAAA TTGGCACCAG CGTAGGGCGG CAAACGGTGG CCCTGGTTAC CGACATGGAT
CAACCCTTGG GTTTTGCCAT TGGCAATGCG TTGGAGGTAA AAGAAGCCAT CGAAACCCTA
AGGGGCAACG GGCCGGCTGA TTTACGGGAG CTATGCATTT ATCTGGGCAC CGAGATGCTA
AAACTGGCTG GCATAGCAGA GGATGAGTTA GTAGCCCGCA GAAAATTAGA AGAGCTCTTA
AGTAATGGTG GCGCCCTCAA TAAATTTAAG GAGCTTATTG AGGCCCAGGG CGGTGATCCT
GAAGTTGTGG AGAATCCCGA TCGATTACCG GGGGCCTCTA GTGTATATCC TGTGATATCA
GATATAGAGG GATATGTAAG GGAAATACAG TCTGAGCAGG TTGGCGTCGT TGCCATGTGG
CTGGGGGCTG GCAGGGCCAC TAAGGAATCG GTGATCGACC TAGGTGTGGG TGTGGTCTTA
AAGAAAAAGG TCGGTGATTA TGTAAAGAAA GGTGAGGTTA TCGCTGATTT ACATGTTAAT
GAAAACAAGG AAATTGCCAA GGTTGCAGAC CTGCTAAGGA AGGCCTATGT TTTACAAAGG
GAACCAGTCG TGGCCAAGGA AATTTTACTG GGTAAGGTAA CGAAGGAAAG TATATAA
 
Protein sequence
MRMVDIILKK RRGLELTSEE IEFFIQGYTK GEVPDYQAAA FLMAAFFQGL TPRETGDLTM 
SMARSGDQAD LSSIPGIKVD KHSTGGVGDK VTLILGPLVA AAGIPVAKMS GRGLGHTGGT
IDKLESIPGF QVTMDNQNFL QQVKRVKLAV VAQTGHLAPA DKKLYALRDV TATVDSIPFI
AASVMSKKIA AGADAIVLDV KVGSGAFMKN SEDAFSLART MVEIGTSVGR QTVALVTDMD
QPLGFAIGNA LEVKEAIETL RGNGPADLRE LCIYLGTEML KLAGIAEDEL VARRKLEELL
SNGGALNKFK ELIEAQGGDP EVVENPDRLP GASSVYPVIS DIEGYVREIQ SEQVGVVAMW
LGAGRATKES VIDLGVGVVL KKKVGDYVKK GEVIADLHVN ENKEIAKVAD LLRKAYVLQR
EPVVAKEILL GKVTKESI