Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3104 |
Symbol | thiH |
ID | 7311701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3640231 |
End bp | 3641643 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643610008 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002507376 |
Protein GI | 220930467 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAACA GTAAATCAAA AAAGGCTGAA GACTTTATTA ACGACGAGGA GATATTAGAA ACGTTGGAGT ATGCCCGTAG GAACAAGGAG AATATGTCAC TGATAGAAGA TATTCTTAAA AAGGCAGCTG AGTACAAAGG ATTAAGCTAT AGGGAAGCAG CAGTATTATT GGAATGTGAG CTTGACGAGG TTAAAGAAAA AGTGTTCGGT CTTGCAGAGC ATATTAAAAA GAAATTCTAT GGAAACAGGA TAGTAATGTT TGCACCTCTT TATCTTTCGA ACTACTGTGT AAATGAGTGC AGATACTGCC CTTACCATGG TTCCAACAAG CATATTTCAA GAAAACAGCT GTCACAGGAG GATATAGTCA GGGAGGTTAT GGCCTTGCAG GATATGGGTC ACAAACGACT TGCCCTTGAA ACGGGAGAGG ACCCGGAGAA CTGTCCTATA GAATATGTAT TGGAAAGTAT AAAAACAATT TACGGAATAA AGCATAAGAA CGGTGCAATC CGCCGTGTCA ATGTGAATAT TGCCGCTACA ACCATTGAAA ATTACAAAAA ACTCAAAGAT GCGGGAATAG GAACATATAT ACTGTTTCAG GAAACTTACC ATAAGCCCAC ATACGAGTAC CTGCACCCCA AGGGTCCTAA GCACAATTAT GCTTATCATA CTGAAGCCAT GGACAGAGCA ATGGAGGGTG GAATTGATGA TGTAGGACTT GGTGTCCTCT TTGGTCTGAA CCTTTACAAG TATGATTTTG TAGGTCTGCT CATGCACGCA AAGCATTTGG AGGATGCAAT GGGAGTTGGG CCTCATACAA TCAGTGTACC ACGTATAAGG CCGGCTGATG ATGTGGATTT GAAAGAATAT TCAAATGCAA TACCTGACTC TATATTTGAA AAAATTGTAG CTATACTTCG TATAGCGGTA CCATACACAG GTATAATCAT GTCCACAAGA GAATCAGAAA AGACCCGTGG GGAATGTCTC AAATTGGGTG TTTCTCAAAT TAGCGGAGGA TCATCAACAA GTGTGGGCGG TTATGTAGAA AAAGAAGCAG AGAATTCTGC ACAGTTCGAA GTTAACGATA CAAGAACCAT GGACGAAGTA GTTAACTGGC TCCTGACATT GGGGTATATT CCAAGCTTCT GTACAGCATG CTACCGGGAA GGTCGAACAG GGGACAGATT TATGAGACTT GTTAAAAGCG GTGCAATTGC ACAGGTTTGT CATCCCAATG CAATTATGAC ATTAAAGGAA TATCTGGAAG ACTATGCATC GGAAGATACA AGAGCAAAAG GTGAGAAAAT GATAGAAAAA GAAGTGGAGC TACTGCAAAA CAGCGATGTT AAAAGAATCG TTAAAGAACA TTTAAGTGAC CTCCATGAGG GTAAGAGGGA TTTCAGGTTC TAA
|
Protein sequence | MYNSKSKKAE DFINDEEILE TLEYARRNKE NMSLIEDILK KAAEYKGLSY REAAVLLECE LDEVKEKVFG LAEHIKKKFY GNRIVMFAPL YLSNYCVNEC RYCPYHGSNK HISRKQLSQE DIVREVMALQ DMGHKRLALE TGEDPENCPI EYVLESIKTI YGIKHKNGAI RRVNVNIAAT TIENYKKLKD AGIGTYILFQ ETYHKPTYEY LHPKGPKHNY AYHTEAMDRA MEGGIDDVGL GVLFGLNLYK YDFVGLLMHA KHLEDAMGVG PHTISVPRIR PADDVDLKEY SNAIPDSIFE KIVAILRIAV PYTGIIMSTR ESEKTRGECL KLGVSQISGG SSTSVGGYVE KEAENSAQFE VNDTRTMDEV VNWLLTLGYI PSFCTACYRE GRTGDRFMRL VKSGAIAQVC HPNAIMTLKE YLEDYASEDT RAKGEKMIEK EVELLQNSDV KRIVKEHLSD LHEGKRDFRF
|
| |