Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_0244 |
Symbol | thiH |
ID | 4184995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 293903 |
End bp | 295018 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 638070254 |
Product | thiamine biosynthesis protein |
Protein accession | YP_676876 |
Protein GI | 110636669 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.913346 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACAGT TTAAAGAAAT CTTTGATCAG TATTCCTGGG ATGAAGTATA TGCTTCTATC TACGCTAAAA AAGCGCTGGA TGTAGAACGT GCCTTGTCTA AAGAGAATCT GGATCTGGAA GATTTCAAAG CATTGGTTTC TCCCGCTGCC GCTGCGTATT TGCCTGCTAT GGCCGAACGC AGTCATCAGC GTACCTTACA GCGCTTCGGT AAAACGATGC AGATGTATGT GCCTTTGTAT CTTTCGAACG AGTGCCAGAA CATTTGTACT TACTGTGGTT TCAGCATGGA TAATAAACTG CTGCGTAAAA CCTTAAAGGA CGAAGAGATT ATCCGTGAAG CAAAAGCCAT TAAAGAAATG GGTTTTGATC ACGTGCTGCT GGTAACGGGT GAAGCGAATC AGATGGTTGG CGTGCCGTAT TTAAAACATG CGATTGAACT ATTGCGTCCA TACTTTGCAC AGATCTCGAT TGAAGTGCAG CCGCTGGATG AAGATGAATA TAAAACACTG ATTGATGCCG GAGCTTACGC CGTATTGGTG TATCAGGAAA CGTATCATCA GGAAGAATAT AAAACACATC ATCCAAAAGG AAAGAAATCA AATTTCTATT ACCGTCTGGA TACGCCGGAC CGTGCGGCAC GCGCAGGTGT AGATAAGTTG GGCCTGGGTG TATTAATCGG TCTGGAAGAC TGGCGGGTAG ATAGTTTCTT TACAGCACTT CACTTGAATT ACTTAGAGAA ACAATACTGG CAAACGAAAT ATTCCCTGTC GTTTCCCCGC TTGCGTCCGT ATGTGGGTAA CACCGAACCG AAAGTAATTA TGAACGATCG CGAACTGGTG CAATTGATTT GCGCGTACCG TTTGTTCGAT CAGGAATTAG AACTATCCAT TTCGACACGC GAAACCGAAG CGTTCCGGAA TCACATTATA AAATTAGGCA TCACGTCCAT AAGCGCTGGC TCAAAAACAA ATCCGGGCGG CTATGTGGTA GAGAAAGAAT CGCTTGAACA GTTCGAGATC TCCGACGACC GCACTCCACA ACAGATAGCA ACAATGCTGA AAGGCGCGGG CTACGAACCG GTGTGGAAGG ATTGGGCCCA GGCGTATGAT GTGTAA
|
Protein sequence | MSQFKEIFDQ YSWDEVYASI YAKKALDVER ALSKENLDLE DFKALVSPAA AAYLPAMAER SHQRTLQRFG KTMQMYVPLY LSNECQNICT YCGFSMDNKL LRKTLKDEEI IREAKAIKEM GFDHVLLVTG EANQMVGVPY LKHAIELLRP YFAQISIEVQ PLDEDEYKTL IDAGAYAVLV YQETYHQEEY KTHHPKGKKS NFYYRLDTPD RAARAGVDKL GLGVLIGLED WRVDSFFTAL HLNYLEKQYW QTKYSLSFPR LRPYVGNTEP KVIMNDRELV QLICAYRLFD QELELSISTR ETEAFRNHII KLGITSISAG SKTNPGGYVV EKESLEQFEI SDDRTPQQIA TMLKGAGYEP VWKDWAQAYD V
|
| |