Gene Dtox_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2042 
Symbol 
ID8429024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2221849 
End bp2222970 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content37% 
IMG OID645034363 
ProductThiamin pyrophosphokinase catalytic region 
Protein accessionYP_003191494 
Protein GI258515272 
COG category[S] Function unknown 
COG ID[COG4825] Uncharacterized membrane-anchored protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.978362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTATA AAGGTATAGC TAGGATTGAT AAAAGAACTA AAAATTTAGT TAAACGCCTA 
ATATCATCCG ATATTGCTAT AATTGATCAC AAAGATCTTG ACGAAGTGGC TGCCCAATCT
CTTTTGGAAA CAAAGGTACG TATTGTAGTT AATGCCTCAC ACTCATTAAG TGAGGATTAT
CCTAATCCAG GTCCGCTTGT CCTGGTAGGT TCCGGCGTGC ATTTAATTGA TAATGCAGGT
AAAGAAATCA TGTCAGCCAT TTCTGAAGGA CAGGAAATTG AAATTGTTGA GAACCGGATA
TTGCTAAACG GGGAGCTAAT TGCTGAGGGT AAGTTATTAA GTATAGATTA TATAAAAGAA
AAAATGTTAG AAACACAAAA ACATATTAAC AGAGTGTTGT CAAAGTTTGT ACAAAACACA
CTTGAATATG CGCAAAATGA AGTAGGTATG ATTCTTGGTG AAGTTGAAGT ACCTGAGACC
AGAACGGTTT TTAAAAATAA GCATACACTG ATTGTTGTTA GAGGGAAAAA CTATAAAGAA
GATTTAAATG CCATAACATC TTATATTAAT GAAGTTAAGC CTGTTCTGGT AGCGGTTGAC
GGCGGGGCAG ATGCTTTAAT GGAATTTGGT TATCAACCTG ATGTAATTAT TGGTGATATG
GATAGTATCA GTGACAAAAT GCTGCGATGC GGGGCTGAAT TAATAGTACA TGCCTACCCT
AACGGCAAGG CGCCCGGTTT AGAGAGATTA AATGAATTGG GTTTGTCTGC CTTGGTTTTT
CCTGCTCCTG GAACCAGTGA AGATATAGCC ATGCTTTTAG CTTATGAAAA AGGTACTGAT
TTAATAGTAG CGGTAGGAAC ACATTCCAAC ATGTATGATT TTTTAGAAAA AGGACGAAAA
GGAATGTCCA GCACATTTCT TGTTAGATTA AAGGTCGGTT CTGTATTAGT TGATGCCAAA
GGTGTCAGCC AGCTTTATAA AAGTAATATT AAGGTTCGCT ATTTAGCGCA GATTATTCTG
GCTGCACTGC TGCCATTTAC TATTGTTCTG GTAATTTCTC CTACCACAAG AGAATTACTG
CGTTTATTAT ATATTCAGTT CCGGCTAATA TTGGGGATAT AA
 
Protein sequence
MYYKGIARID KRTKNLVKRL ISSDIAIIDH KDLDEVAAQS LLETKVRIVV NASHSLSEDY 
PNPGPLVLVG SGVHLIDNAG KEIMSAISEG QEIEIVENRI LLNGELIAEG KLLSIDYIKE
KMLETQKHIN RVLSKFVQNT LEYAQNEVGM ILGEVEVPET RTVFKNKHTL IVVRGKNYKE
DLNAITSYIN EVKPVLVAVD GGADALMEFG YQPDVIIGDM DSISDKMLRC GAELIVHAYP
NGKAPGLERL NELGLSALVF PAPGTSEDIA MLLAYEKGTD LIVAVGTHSN MYDFLEKGRK
GMSSTFLVRL KVGSVLVDAK GVSQLYKSNI KVRYLAQIIL AALLPFTIVL VISPTTRELL
RLLYIQFRLI LGI