Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1681 |
Symbol | |
ID | 8428647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 1769228 |
End bp | 1770328 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 645034014 |
Product | thiazole biosynthesis protein ThiH |
Protein accession | YP_003191161 |
Protein GI | 258514939 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000164161 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTTTTT ACGGAGAACT CCAAAGGTAT GAGAACTTTG ATTTTGAACT CTTTTTTAAA CAAGTTACTG ACGCCGGGAT TAAAAGAATT ATAGCCCAGC ACCGCCTTAA TGAAAGAGAT TACCTGGCGC TGCTGTCTCC CCGGGCGGAA AATTTTCTGG AGGAAATGGC TCAAAAAGCT CACCGCCTTA CTGTTCAGCA CTTTGGCAGG GTAATTTTCC TTTTTACTCC CATGTACCTG GCCAACTATT GTGTCAATCA GTGTGTGTAC TGCGGGTTTC AGGTTCATAA TAGGCTGGAA AGAAAGAAAC TTTCTCCCGC CGAAGTGGAA AAGGAAGCTA AAATTATTGC AGCTACCGGT CTAAAGCATA TACTCATTCT TACCGGAGAG TCCAGGCAGG AATCCCCGGT TTCCTATATC AGAGATTGTG TCGAGGTGCT GAAAAAATAT TTTACTTCTG TCAGCATAGA AATTTATCCA CTGGAAGAAG ACGAGTATGC CGAGCTTATT GCTGCCGGGG TGGACGGTTT GACTATGTAC CAGGAGGTAT ATAACGAGGA GGTTTATGCC GAACTGCATC CGGGTGGGCC GAAACGAAAT TACCGCTTCC GGTTGGATGC TCCGGAGCGG GCCTGCCGGG CAGGAGTGAG GACAGTTAAT GTGGGCGCCT TACTGGGACT GCATGACTGG CGAAGCGAGG CTTTTTTCAC AGGTCTGCAT GCTGATTATC TCCAGAAAAA TTTTACGGAT GTTGAGGTCA GCATATCGCC GCCGCGGATG CGCCCTCACC TGGGGGGCTT TCAACCCAGA GTTGAAGTGA GCGATCAAAA CCTGGTGCAG TACCTACTGG CCTTCCGGCT CTTTATGCCG CGCGGCGGTA TTACTGTTTC CACCAGAGAG AGGGCAGAAT TGCGGGATCA TCTTGTGCGG CTGGGCGCGA CCAAAATGTC GGCCGGTTCT TGTACTGCTG TGGGTGGGCG GTCTGATCAG GAATCCACCG GCCAGTTTGA GATATCTGAT GAGCGCAATG TGGTGGAGAT GGCGGACATG CTTTACTCTG TTGGTTACCA GCCGGTCTAT AAAGATTGGC AGTCGTTTTG A
|
Protein sequence | MSFYGELQRY ENFDFELFFK QVTDAGIKRI IAQHRLNERD YLALLSPRAE NFLEEMAQKA HRLTVQHFGR VIFLFTPMYL ANYCVNQCVY CGFQVHNRLE RKKLSPAEVE KEAKIIAATG LKHILILTGE SRQESPVSYI RDCVEVLKKY FTSVSIEIYP LEEDEYAELI AAGVDGLTMY QEVYNEEVYA ELHPGGPKRN YRFRLDAPER ACRAGVRTVN VGALLGLHDW RSEAFFTGLH ADYLQKNFTD VEVSISPPRM RPHLGGFQPR VEVSDQNLVQ YLLAFRLFMP RGGITVSTRE RAELRDHLVR LGATKMSAGS CTAVGGRSDQ ESTGQFEISD ERNVVEMADM LYSVGYQPVY KDWQSF
|
| |