Gene Dtox_1596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1596 
Symbol 
ID8428560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1673079 
End bp1674377 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content49% 
IMG OID645033929 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003191078 
Protein GI258514856 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTTT TAGAGGGTGC GACAAAAGGA ATTACAACAG AGCAAATAGA AGCTGCTGCA 
GCCGCTGAAG GGGTAGCTGC AGCATTGATC AGACAAGGAA TTAATGATGG CACTATCGTG
GTACCCTGCA ACCCTTCGCA CAAAGGACTG AAGCCTATGG CAATCGGTAA GGGGCTTCGA
ACAAAAGTAA GTGCCAGTAT CGGGCTGGAC AGCAAGGATA GCACTGTGGC TTATGAACTG
GAAAAGCTGC AGACAGCTTT GACCGCCGGG ACTCATGCGA TCATGGATCT CAGTATTGCC
GGAGATATGG ATGGTGCTCG CAGGGCTATT TTGTTAGAAT CGCCGGTGCC TGTAGGTACC
CTGCCTCTAT ACCAGACGGT AGCCGAAGCC AGTCGGAAAT ACGGCTCTGC ATTGAAAATG
ACACCGGAAC AGATGCTCGA GGTTATAGAA CGTCAAGCAT CTGATGGTGT GGATTTCATG
GCACTGCATT GTGCAACTAC CTTTGAAACT ATTGAACGTG CTAAAAATGA AGGTCGCATA
GACCCTTTGG TTAGCTATGG CGGTTCACAT ATCATAGGCT GGATGGTTTA TCATAAAAAG
GAAAACCCTC TCTATGAAAA CTATGACCGC ATTCTGGAGA TCGCGAAGAA ATATAGTGTG
ACACTTAGCC TGGCGGATGG CATGCGCCCG GGCTGCTTGG CGGATTCCTT GGACGGAGCC
CAGGTGCAGG AGCTGATCAT GTTAGGCGAG TTAGTAGATC GGGCTCGTAA AGCCGGTGTG
CAAATTATGA TCAAAGGGCC CGGACATATG CCGCTTAATC ATATTAAGGA TACCATGACC
TTGCAGAAAA GCCTTTGCAA GGGAGCGCCT TACTTTGTTT TCGGTCCTTT ATTGACTGAC
CTTGCTGTCG GTTACGATCA CATCAATGCA GCCATTGGCG GGGCTATCAG CAGCTGGTAT
GGTACTGAAT TCCTATGTTA CGTTACCCCT GCCGAGCATA TTGGCAATCC CGATGTTTCA
CAGGTGCGTC AGGGCGTCAT TGCCGCCCGT ATTGCCGCTC AGGCCGGAGA CCTGGCCAAG
GGCATGCCGG AAGCGATTCA GTGGGAGCTG GATATGTCCA ATGCCCGCAG AGATTTAAAA
TGGACGGAAC AGATCAGACT GGCCATCGAT CCCGAATATG CCGAATATAT CCGGAAGACC
AGAAATGATG GCGAGATTTC TACCTGTGCC ATGTGCGGTA AGTTCTGCGC TATGAAAATA
ATTGCCGAAC ACTTACATTT TGACAAGCAC AGCTGTTGA
 
Protein sequence
MTFLEGATKG ITTEQIEAAA AAEGVAAALI RQGINDGTIV VPCNPSHKGL KPMAIGKGLR 
TKVSASIGLD SKDSTVAYEL EKLQTALTAG THAIMDLSIA GDMDGARRAI LLESPVPVGT
LPLYQTVAEA SRKYGSALKM TPEQMLEVIE RQASDGVDFM ALHCATTFET IERAKNEGRI
DPLVSYGGSH IIGWMVYHKK ENPLYENYDR ILEIAKKYSV TLSLADGMRP GCLADSLDGA
QVQELIMLGE LVDRARKAGV QIMIKGPGHM PLNHIKDTMT LQKSLCKGAP YFVFGPLLTD
LAVGYDHINA AIGGAISSWY GTEFLCYVTP AEHIGNPDVS QVRQGVIAAR IAAQAGDLAK
GMPEAIQWEL DMSNARRDLK WTEQIRLAID PEYAEYIRKT RNDGEISTCA MCGKFCAMKI
IAEHLHFDKH SC