Gene Dtox_3570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3570 
Symbol 
ID8430576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3769017 
End bp3770537 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content41% 
IMG OID645035798 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003192905 
Protein GI258516683 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGCTT TAGGCTTTTT TTCCTGGAAT CTAAAGCTAG GCGGCATACC TGTGCTCAGT 
CTCCTGGACT CGCATCAAAC AGTTGTCTAT GCTTACACCT CAGAAGCTCA GACACTGGAC
CCGGCACTGG CCAAAGATCC TCAATCAGCC AAGCTAATTA GCTGTATATA TGAAGGACTG
GTACGTTATA AACCTGGAAC TCTCGATATT GAACCTGCGT TAGCTTCAGA TTGGAAAATA
TCTAAAGACG GCCTTACCTA CACTTTTAAT CTTAAAAAAA ATATTTTATT CCACGACGGT
ACACCTTTTA GGGCCCAGGC TGTTAAATAC AGTATTGAGC GTCAAACTAA AAAGGGAATA
GCAAGCTATT ATTACGATCT GGTTTTCGGG CCCTTAAGGG AGGTAAAGGT AATTGATGAT
TATACGGTAA AGCTGGTCCT TTCTAAACCC TATGCTCCAC TCCTGCGGAA TCTGGCTATG
CCCTATGCCG CTCCCGTTGT TTCACCTTCG GCAGCAGAGA AATATGGTTC GGCCTTTGGA
GTCAATCCTA CGGGCACTGG ACCCTACATC TTACTGGATT GGGATAAAGA TATTTCGATC
AGCCTGGCAG TTAACCCGTT TTATCGCGAA CCGGGACCCG AAGTTAAAAA GCTTATATTT
AAGGTGATAC CACAAACTGG CCAGCGTCTA ACAGCGCTAT TAAATGGTGA TGTCCACCTG
GCAGACGGTT TTACACAGCT AAATACCTTG TTATTCAAGC AAAGGAATAT ACCTTACCAG
ACTACTGTCA GCAATGATGT TAGCTACCTG GGTTTTTACG TTAACAAGCC ACCCTTTAAT
AATATTAAAC TTCGCCGTGC AATAAGCATG GCTATTAACC GCGGTGAAAA AGGAGATACC
GGCCAATTAG CAATAAATAA TGAAAACATT TTACCTCCCG GCGTTTGTGG TTTTGAGAAA
AACTTTAAAC TATACGGCTA TGACCCTGAA AAATCAATTC AGATACTTAA TGACACAGTT
GGCAATGGCT TTTCCTTTGA GCTGATTACT TATCTGGATC AAAGACCTTA TAACGACACA
GGCGGTGAGC CACTGGCCAA ATTTATCCAA GATCAGTTGG CACAGGTAAA TATAAAAGTT
AGAATAAAAA CATACAATTG GGAAAAGTAC AAGAGAGCAC TGCTTAACCA GGAGGGTGAT
GCTTTCCTGT ACGGATGGAT TAGTGACAGC GGAGACCCAG ATGATTTTTT ATACCATCAA
TTCGGCGGTA AACTAAATGA GAGCGGTTTA AACATCTTCC ACTATAATAA TGAAGCTGTT
AATTCGCTGC TGGAGGAAGG ACGTTCTTCT TGCGATTTAG AGCAAAGACA GAATATTTAT
GCCAGAGTAC AAAGACTTAT CTTATCAGAC GCCCCCCTTG TACCCCTTAA TCACGGCATT
AATATTTTAG CTGTGTCACC TGTAATTAAA AATTGTATAT TACAAAGAAA CGGTACCTGT
TTTTTAAACA GACTCAACTA A
 
Protein sequence
MFALGFFSWN LKLGGIPVLS LLDSHQTVVY AYTSEAQTLD PALAKDPQSA KLISCIYEGL 
VRYKPGTLDI EPALASDWKI SKDGLTYTFN LKKNILFHDG TPFRAQAVKY SIERQTKKGI
ASYYYDLVFG PLREVKVIDD YTVKLVLSKP YAPLLRNLAM PYAAPVVSPS AAEKYGSAFG
VNPTGTGPYI LLDWDKDISI SLAVNPFYRE PGPEVKKLIF KVIPQTGQRL TALLNGDVHL
ADGFTQLNTL LFKQRNIPYQ TTVSNDVSYL GFYVNKPPFN NIKLRRAISM AINRGEKGDT
GQLAINNENI LPPGVCGFEK NFKLYGYDPE KSIQILNDTV GNGFSFELIT YLDQRPYNDT
GGEPLAKFIQ DQLAQVNIKV RIKTYNWEKY KRALLNQEGD AFLYGWISDS GDPDDFLYHQ
FGGKLNESGL NIFHYNNEAV NSLLEEGRSS CDLEQRQNIY ARVQRLILSD APLVPLNHGI
NILAVSPVIK NCILQRNGTC FLNRLN