Gene Dtox_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1684 
Symbol 
ID8428650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1771802 
End bp1773097 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content50% 
IMG OID645034017 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003191164 
Protein GI258514942 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000247869 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATTATA CTACTCAAAT GGACGCTGCC CGTCAAGGAA TTGTCACCGG GGAAATGGAA 
GAGGTGGCTC GCAAGGAGTT AATGGATGTA TCAGTTTTGC GGGAACTGAT CGCGGAGGGT
AAGGTGGTTA TACCCGCCAA TAAAAATCAT ACTTCCCTAA AAGCTTGCGG GATTGGGCAG
GGCTTAAAAA CAAAAATTAA CGTTAATCTA GGTGTCTCCA AAGACTGCTG CAGTATTGAG
TCTGAGATGG AAAAGGTTCG GCGTGCCATT GAACTGCAGG CGGATGCCAT TATGGATCTT
AGCTGTTACG GAAAAACCGA AGAATTTAGG CGCAGACTGG TGGAAATATC ACCGGCGGCT
GTTGGCACCG TGCCTGTCTA TGATGCTGTC GGTTTTTACG ATAAGGAATT GAAAGAAATA
ACAGCCGGGG AGTTTCTGGG AGTGGCTGAG AAACATGCCC AGGACGGGGT TGACTTTATG
ACCGTACACG CAGGGATTAA TCGGGAGACT GCCGGCCGGT TCAAGAAAAA CCCGCGTTTG
ACCAATATTG TTTCCAGAGG TGGGGCATTG CTTTATGCCT GGATGGAACT TAATGATCGA
GAAAACCCTT TCTTTGAGTA TTATGACGAA CTTCTGGATA TTTTCAGAAA GTATGATGTC
ACTATCAGCC TGGGTGATGC CTGCCGGCCG GGCAGTATTA AAGATGCTAC TGATGCCAGC
CAGATTCAGG AATTGATTAT TCTTGGTGAA TTGACTAAGC GGGCCTGGGA GAAAGATGTT
CAGGTGATGA TAGAAGGACC CGGTCATATG GCTTTAAACG AAATTGTCCC GAACATGCTT
TTGGAGAAGA AGTTATGCCA CGGTGCTCCT TTTTACGTCC TGGGACCGCT GGTTACCGAT
GTAGCTCCCG GTTACGACCA TATCACCAGT GCCATCGGCG GGGCCATCGC TGCTGCCAAT
GGGGCGGATT TCCTCTGTTA TGTAACTCCG GCGGAGCACC TGCGGCTGCC CACCCTGGAA
GATATGAAAG AGGGTATCAT CGCCTCCCGT ATTGCTGCCC ACGCGGCCGA CATAGCCAAG
GGAGTTCCCT GTGCCAGGCA GTGGGATGAT AATATGAGCG AAGCCAGGCG CAATTTGGAC
TGGCAGAGAA TGTTTGAGTT GGCCCTGGAT CCGGAGAAGG CCAGGAACTA CAGGTCACAA
TCCCAGCCTG AGAACGAGGA CACCTGCACC ATGTGCGGCA AAATGTGTGC TGTACGTAAT
ATGAATAAGG TGTTGGACGG GTCGGAGCCT ATTTAG
 
Protein sequence
MNYTTQMDAA RQGIVTGEME EVARKELMDV SVLRELIAEG KVVIPANKNH TSLKACGIGQ 
GLKTKINVNL GVSKDCCSIE SEMEKVRRAI ELQADAIMDL SCYGKTEEFR RRLVEISPAA
VGTVPVYDAV GFYDKELKEI TAGEFLGVAE KHAQDGVDFM TVHAGINRET AGRFKKNPRL
TNIVSRGGAL LYAWMELNDR ENPFFEYYDE LLDIFRKYDV TISLGDACRP GSIKDATDAS
QIQELIILGE LTKRAWEKDV QVMIEGPGHM ALNEIVPNML LEKKLCHGAP FYVLGPLVTD
VAPGYDHITS AIGGAIAAAN GADFLCYVTP AEHLRLPTLE DMKEGIIASR IAAHAADIAK
GVPCARQWDD NMSEARRNLD WQRMFELALD PEKARNYRSQ SQPENEDTCT MCGKMCAVRN
MNKVLDGSEP I