Gene Dtox_0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0343 
Symbol 
ID8427278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp355018 
End bp356631 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content44% 
IMG OID645032741 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003189919 
Protein GI258513697 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGCAA AACAAAAATG GCTCAGGCAG TTAATTACAC TATTACTTAT GCTGGGCCTA 
ACAGTGGTTG TGCTGTCAGG TTGTGGCACA AACCGGGCTG TCGAGAAAAA TGCCGGCGGG
TCAGGTGAAG TTGCTGTTTA TACTATCGCC GATTCAACGG GTGACTGGGG TTTTCCTTCT
CCTTATACTC ACTATAACCG GGGACCTGGC TACGTAAGAA TGAGTTTTCT CTTTGACACA
CTGGTGTGGA AAAACGATCG GGAGTATCTG CCTGGTCTGG CCGAAAAATG GCAATACCTG
ACGCAGGAAA ATGCTTATTT GTTTAACCTG CAAAAAAATG TTACCTGGCA TGACGGGGAA
AAATTTACTT CCGGAGATGT GCTGTTTACG TATAATTATG TTAAAGCCCA CCCTTATCAA
TGGGCCGATG TCGGCATGAT TAAGAAAATC GAGGCTTTGG ACGATTATAC TGTAAAGATG
TATTTAAACA AACCCTATGC TCCGTTCTTG GATACCGTGG TTGGCAGCAT GCCCATTTTG
CCCGGGCATA TTTGGAAAAA TGTGCAAAAT CCAATGCAGT TCCAGAAGGA GGAGGCATTA
ATCGGGACCG GTCCTTATAA GCTGTTGGAT TACAATAAAG AGCAAGGTAC ATACCTATAT
GAGGCCTATG ATAATTATTA CCTGGGAAAA CCCCGGGTGA AGCAGTTAAA GTTCATTAAA
ATTAGCAATG AAATGGTGGG GAATGCTTTA AAACAGAAAC AGGCTGACGC GGCGCAAGTT
CCGCCGGAAC TGGCCAGTCA AATGGAAAAA GAAGGATTTA ATATTTTAAA AGGTTCTCAC
GATTCGGTAG TCAAAATACA AATAAACCAC CGGAAAGAGC CCCTGTCCAA TAAAGAATTC
CGGCAGGCGC TGGCTTATGC CGTAAACCGC CAGGAACTGT TGGATACTAC CCTGCGGGGT
TATGGTCTGG TAGGCAATCC GGGCTTGGTG CCGCCGGATA ACAGCTGGTA TAATCCTCAA
GTGGAACAAT ACTCTTATAA CCCGGTCAAA ACGGGGGAAA TACTTGCCAA ACTGGGATAT
GTTAAAAAAG GAATGTATTT TGCGAAGGAC GGAAAACCGC TGGAACTGGA GCTTTTAATC
AGTGGGGCAG GTTCAGCTAA TACTCCGGGA GTGCGCCAGG GCGAAATGAT TAAGGAGCAG
TTGGAAAAGG CGGGCATAAA GGTAAATTTG CGCAGCCTGG ATCCCAAGAC ACTCGACAGC
ATGGTGGGAG AATGGAAATT TGATCTGGCT TTAATCAGTC ACGGCGGAAT GGGCGGGGAA
CCTAAAGTAT TAAACACAAT GATTACAGAT AAAAGCTTTA ATAGTGCCAG GTATCTGAAA
AGTGAAGAAC TCAACAGTCT TTTGCAGCAG CAATTGGAGA AAATAAACCA GCAGGAGCGT
AGAAAACTAA TCAATAGAAT TCAGGAAATT TATGCTCAGG AAATGCCTTC TTTACCTCTT
TATTATCCTA GCAGCTATTG GGTTTATGAT AATCAGGTGA AACTTTTTTA TACTAAACAA
GGTATTGGTA TCGGCGTTCC AATTCCTGCT AACAAAATGT CTTTTGTGAA ATAA
 
Protein sequence
MDAKQKWLRQ LITLLLMLGL TVVVLSGCGT NRAVEKNAGG SGEVAVYTIA DSTGDWGFPS 
PYTHYNRGPG YVRMSFLFDT LVWKNDREYL PGLAEKWQYL TQENAYLFNL QKNVTWHDGE
KFTSGDVLFT YNYVKAHPYQ WADVGMIKKI EALDDYTVKM YLNKPYAPFL DTVVGSMPIL
PGHIWKNVQN PMQFQKEEAL IGTGPYKLLD YNKEQGTYLY EAYDNYYLGK PRVKQLKFIK
ISNEMVGNAL KQKQADAAQV PPELASQMEK EGFNILKGSH DSVVKIQINH RKEPLSNKEF
RQALAYAVNR QELLDTTLRG YGLVGNPGLV PPDNSWYNPQ VEQYSYNPVK TGEILAKLGY
VKKGMYFAKD GKPLELELLI SGAGSANTPG VRQGEMIKEQ LEKAGIKVNL RSLDPKTLDS
MVGEWKFDLA LISHGGMGGE PKVLNTMITD KSFNSARYLK SEELNSLLQQ QLEKINQQER
RKLINRIQEI YAQEMPSLPL YYPSSYWVYD NQVKLFYTKQ GIGIGVPIPA NKMSFVK