Gene Dtox_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1643 
Symbol 
ID8428609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1722538 
End bp1723641 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content44% 
IMG OID645033976 
ProductRadical SAM domain protein 
Protein accessionYP_003191123 
Protein GI258514901 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000555203 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000328368 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAGGGC TATTTCTTAA ATCGGGTTTA GAGGATATTG CAGAAAGAGT ATTGACAGGT 
AAAAGGCTTT CTTTTAATGA TGGGGTGAGG CTTTTAAAGT CACCGGATAT ATTAACCATA
GGCTATCTGG CCAGTCATAT CTGCCGTCAA AAGAATGGTG ACAGGGTGCA TTTCATTGTT
AACCGCCATA TTAATCATAC TAATATATGT GCCAATAGGT GTAAGTTTTG CGCTTTTGGC
AGGGATGCCG GGGATAAAGG TGCCTATGTC TTGTCACTGG ATGAAATAGA AGCCAAGGCG
CTGGCTTCCC GGAATGAAAA TATTTCTGAG ATCCATATCG TCGGCGGTTT GAACCCTGAC
TTAAAACTGG ATTATTATGT GGAAGCATTA AAGAGAGTAA AGCGAGTATT GCCGGGGGTT
ATTATTCAAT CTTTTACTGC TGTTGAGGTT GCCTATCTGG CCGAGCAGCA TAATATGTCC
CTGCAGGAAG TCTTGATTAC ACTAAAAGAG GCGGGGTTGG ATTCACTTCC GGGCGGGGGG
GCAGAAGTAT TTGCTCCGAG AGTTCGAGAC CTCGTGTGTG AGAAAAAAAT TAGCGGGGAG
CACTGGCTGG CAGTCCACGA AGCAGCGCAC GGCATAGGCA TGAGAACCAA CGCCACTATG
TTGTACGGTC ATGTTGAGAC TATTGAGGAA AGAGTGGATC ATTTAATCAA ACTTCGCGAT
TTACAAGACA GTACAGGAGG TTTTTTAACC TTTATACCAC TTGCTTTTCA TCCTAAGAAC
ACTCCTATGG AAGCCATGGG TTTAGCCAGG TCAACAGGAT ACGATGATTT AAAGGTATTA
GCAGTCTCCA GATTATTACT GGATAACTTC GACCATATTA AAGCATACTG GCTGATGATC
GGGCCTAAAC TGGCTCAAGT TTCACTGGCC TTTGGGGTAG ATGATTTAGA TGGTACAGTG
GTTGAGGAAC AAATAGCTCA TGACGCCGGA GCAGACACGG AACAATATAT GTCCAAAAAA
AATTTGATTA ATATGATAAA GGCGGCCGGG AGAATTCCGG TAGAGCGGGA CACCCTGTAC
AACACCATTA GGGAGGGTTT CTAG
 
Protein sequence
MEGLFLKSGL EDIAERVLTG KRLSFNDGVR LLKSPDILTI GYLASHICRQ KNGDRVHFIV 
NRHINHTNIC ANRCKFCAFG RDAGDKGAYV LSLDEIEAKA LASRNENISE IHIVGGLNPD
LKLDYYVEAL KRVKRVLPGV IIQSFTAVEV AYLAEQHNMS LQEVLITLKE AGLDSLPGGG
AEVFAPRVRD LVCEKKISGE HWLAVHEAAH GIGMRTNATM LYGHVETIEE RVDHLIKLRD
LQDSTGGFLT FIPLAFHPKN TPMEAMGLAR STGYDDLKVL AVSRLLLDNF DHIKAYWLMI
GPKLAQVSLA FGVDDLDGTV VEEQIAHDAG ADTEQYMSKK NLINMIKAAG RIPVERDTLY
NTIREGF