Gene Dtox_3958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3958 
Symbol 
ID8430973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4142715 
End bp4144025 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content53% 
IMG OID645036176 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_003193274 
Protein GI258517052 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000179604 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000917586 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTTGCCGG TTATATTTTT TGTAGCCTTT CTTCTCACCG GTTGTCCGGA TAAGGGTGAA 
CAGAAACCTG TTAGCCAGAA GGTAAAAATA GGAGTCTGTT TAGCTGATAT GGAGAGGGAC
GGGAATCGAA TTATAAAAAA AATTATGACC GAGAGGGCTC AAAAGGAAGG GATAAGTTTT
ACCTGGCAGG ATGCCGGCAA TGATGCCGAA AAACAGCAAG AGCAAATAAA TGAGTTAGTA
AAGAAAAAGG TTAAGGCTGT TATCCTGCAG GCAGTTGACC CTTCAGCAGG GCCGGCTATA
GCCCGCACTT TAACGGAAAA AGGAATTAAA ACAGTGGCTC TCGAAACACT GCCTGCCAAT
ACGCCGTTGG ATGCTTATGT AGCTTCCGAT CATGGCAGGG TGGGAGAACT GCAGGCCCGC
TATGCTTTAG AGCAGTCTGC CGCCGGCGGG AAAACAGAGG GCTTGCGTAT ACTGGTGCTG
CAGGGTGATC CCTCCGACCT GGCCGCGCGT GAAATTACCG CTGCTAATGA AGCAGTCTTA
AAAGGAGATA AGCGGGTAGC CGATTTGAAG GTGCAGAGCA ATTTGCGGGC TGATCCGGGC
CTGGCCCGGG TCGCGGTTGA AAACACTCTG GCTGCCGGTA AATTGGATGC GGTGCTGGCC
AATGACAGCA GACTGGCTAT GGCTGCGGTA CAGGTGCTGA CAGAAAGAAA CTTAGCGGCT
CAGGTGGTAA CGGTAGGCGT GGGTGCGGAC AGTAAATCAG TCCGGGGTAT TGCTTCCGGC
CAGCATGACG CTGAGGTGGA CAATTCGCCG GAGCTGCTGG CCCAGTACAT AACTGACGCG
GCGGTTGAAC TGGCCAAAAC AGGCCACTGG CAGTATACGG GCCATGTCCC CAACGGTCAA
TATGATGTGG CGACAAAAAT AACCCCGGTA CGCTTAATCA CCATGAAGGA TCTTTATTTA
TTGGGTGAGA GGGCAAGCGG TTCGGGTGGC GGAAATGAGC AGAACGATAA GTCTGATCAG
CAAAGCGGCG GCTCAAGCCA CTCAGGAGAA AAACAGGGTG GCGGTGAGCA GGGCGGCAGC
GGGGAAAATT CGGGTGACCA GGGTGGTTCC GGTGGGCAGA GCCAATCGGG TGCTCAAGGG
CAGTCCGGCA AACGAAAGAC TACTCTAAAA ATAACCACTC AGGACGGTAA AACCATGGAG
GTAGAATTAA ACGGTGAGAT AAAGTCCATC GAAAGCAAAG ACGGCGGCGG CCGGCAGAGT
TCGGGTGACA GCCAGCAGTC AGGCGGAGCC AATGTGAGCA GCGGAGGTTA G
 
Protein sequence
MLPVIFFVAF LLTGCPDKGE QKPVSQKVKI GVCLADMERD GNRIIKKIMT ERAQKEGISF 
TWQDAGNDAE KQQEQINELV KKKVKAVILQ AVDPSAGPAI ARTLTEKGIK TVALETLPAN
TPLDAYVASD HGRVGELQAR YALEQSAAGG KTEGLRILVL QGDPSDLAAR EITAANEAVL
KGDKRVADLK VQSNLRADPG LARVAVENTL AAGKLDAVLA NDSRLAMAAV QVLTERNLAA
QVVTVGVGAD SKSVRGIASG QHDAEVDNSP ELLAQYITDA AVELAKTGHW QYTGHVPNGQ
YDVATKITPV RLITMKDLYL LGERASGSGG GNEQNDKSDQ QSGGSSHSGE KQGGGEQGGS
GENSGDQGGS GGQSQSGAQG QSGKRKTTLK ITTQDGKTME VELNGEIKSI ESKDGGGRQS
SGDSQQSGGA NVSSGG