Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2082 |
Symbol | thiH |
ID | 5744088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 2568774 |
End bp | 2570192 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641293179 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001559189 |
Protein GI | 160880221 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.388177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAATA AACAATCAAA AAAGGCAGAA GAATTTATTT CTAATGAGGA AATCTTAGAA ACATTAGAAT ATGCAGAAAA GAATAAACAT AATGAAGAAT TAATCGATGA GATATTAAAT AAAGCAAGAC TAAAAAAAGG ATTATCCCAC CGCGAAGCAG CTGTTCTTTT AGACTGTGAT ATTCCAGAGA AAAATGAAGA AATCTATGCT TTAGCGAAAC AACTTAAGGA AGATTTTTAT GGTAATCGTA TTGTTATGTT TGCTCCATTA TATCTTTCAA ACTATTGTAT CAACGGATGT GTATACTGTC CATATCATTT AAAAAATAAA CACATAGCTA GAAAGAAATT AACACAAGAG GAAATCCGAG AAGAAGTTAT TGCACTTCAA GATATGGGTC ATAAAAGATT AGCCTTAGAA ACTGGGGAAG ATCCAATCAA TAACCCTATT GAATATATCT TAGAGAGTAT AAACACCATT TACTCTATCA AACATAAGAA TGGAGCAATT AGACGAGTTA ATGTAAATAT TGCTGCTACT ACTGTAGAAA ACTATAAAAA GCTTCATGAT GCTGGCATTG GTACTTATAT CTTGTTTCAA GAGACCTATC ACAAAGAAAG TTATGAAGCC CTTCACCCAA CTGGTCCTAA GCATGATTAT GCTTATCATA CAGAGGCAAT GGATCGTGCG ATGCAAGGTG GTATTGATGA TGTAGGCCTT GGTGTTTTAT TCGGCTTAGA GCGCTATCGC TATGAGTTTG CTGGTCTTTT AATGCATGCA GAACATCTTG AAGAAGTATA CGGTGTGGGA CCTCATACTA TAAGTGTTCC ACGAATCCGT CCAGCTGATG ATATCGATCC AAATAGCTTT TCCAATGGCA TCAATGATGA TGTATTTGCT AAAATTGTTG CTTGTATTCG TATCTCGGTT CCTTATACCG GCATGATTGT ATCTACCAGA GAAAGTAAAA AAACTCGTGA ACGTGTGTTA CAACTTGGAG TATCTCAGAT TAGCGGAGGT TCCAAAACAA GTGTTGGAGG TTATGTTCAT TCAGAAGAAG AGGATGATAA ATCTGAACAG TTTGATGTTA TCGACCAGCG TCCATTAGGG CAAGTCGTAA AATGGTTAAT GGAACTTGGA TTTATCCCAA GTTTTTGTAC TGCATGTTAC AGAGAAGGTC GTACCGGAGA TCGTTTTATG AGTCTTTGTA AGAGTGGACA AATCGCTAAC TGCTGCCTTC CAAATGCACT AATGACGTTA AAGGAATTCT TAATGGATTA TGCAGATGAG GAAACTAGAG AAGTAGGTAA TCAACTCATT GAGACAGAAT TAGCAAAGAT TCCAAATGAA AAAGTAAAAC AAATTGCTAA GGATAATTTA ATGTCAATCA CTCTTGGTTC AAGAGATTTT CGTTTCTAA
|
Protein sequence | MYNKQSKKAE EFISNEEILE TLEYAEKNKH NEELIDEILN KARLKKGLSH REAAVLLDCD IPEKNEEIYA LAKQLKEDFY GNRIVMFAPL YLSNYCINGC VYCPYHLKNK HIARKKLTQE EIREEVIALQ DMGHKRLALE TGEDPINNPI EYILESINTI YSIKHKNGAI RRVNVNIAAT TVENYKKLHD AGIGTYILFQ ETYHKESYEA LHPTGPKHDY AYHTEAMDRA MQGGIDDVGL GVLFGLERYR YEFAGLLMHA EHLEEVYGVG PHTISVPRIR PADDIDPNSF SNGINDDVFA KIVACIRISV PYTGMIVSTR ESKKTRERVL QLGVSQISGG SKTSVGGYVH SEEEDDKSEQ FDVIDQRPLG QVVKWLMELG FIPSFCTACY REGRTGDRFM SLCKSGQIAN CCLPNALMTL KEFLMDYADE ETREVGNQLI ETELAKIPNE KVKQIAKDNL MSITLGSRDF RF
|
| |