Gene CHU_0244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_0244 
SymbolthiH 
ID4184995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp293903 
End bp295018 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content45% 
IMG OID638070254 
Productthiamine biosynthesis protein 
Protein accessionYP_676876 
Protein GI110636669 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.913346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGT TTAAAGAAAT CTTTGATCAG TATTCCTGGG ATGAAGTATA TGCTTCTATC 
TACGCTAAAA AAGCGCTGGA TGTAGAACGT GCCTTGTCTA AAGAGAATCT GGATCTGGAA
GATTTCAAAG CATTGGTTTC TCCCGCTGCC GCTGCGTATT TGCCTGCTAT GGCCGAACGC
AGTCATCAGC GTACCTTACA GCGCTTCGGT AAAACGATGC AGATGTATGT GCCTTTGTAT
CTTTCGAACG AGTGCCAGAA CATTTGTACT TACTGTGGTT TCAGCATGGA TAATAAACTG
CTGCGTAAAA CCTTAAAGGA CGAAGAGATT ATCCGTGAAG CAAAAGCCAT TAAAGAAATG
GGTTTTGATC ACGTGCTGCT GGTAACGGGT GAAGCGAATC AGATGGTTGG CGTGCCGTAT
TTAAAACATG CGATTGAACT ATTGCGTCCA TACTTTGCAC AGATCTCGAT TGAAGTGCAG
CCGCTGGATG AAGATGAATA TAAAACACTG ATTGATGCCG GAGCTTACGC CGTATTGGTG
TATCAGGAAA CGTATCATCA GGAAGAATAT AAAACACATC ATCCAAAAGG AAAGAAATCA
AATTTCTATT ACCGTCTGGA TACGCCGGAC CGTGCGGCAC GCGCAGGTGT AGATAAGTTG
GGCCTGGGTG TATTAATCGG TCTGGAAGAC TGGCGGGTAG ATAGTTTCTT TACAGCACTT
CACTTGAATT ACTTAGAGAA ACAATACTGG CAAACGAAAT ATTCCCTGTC GTTTCCCCGC
TTGCGTCCGT ATGTGGGTAA CACCGAACCG AAAGTAATTA TGAACGATCG CGAACTGGTG
CAATTGATTT GCGCGTACCG TTTGTTCGAT CAGGAATTAG AACTATCCAT TTCGACACGC
GAAACCGAAG CGTTCCGGAA TCACATTATA AAATTAGGCA TCACGTCCAT AAGCGCTGGC
TCAAAAACAA ATCCGGGCGG CTATGTGGTA GAGAAAGAAT CGCTTGAACA GTTCGAGATC
TCCGACGACC GCACTCCACA ACAGATAGCA ACAATGCTGA AAGGCGCGGG CTACGAACCG
GTGTGGAAGG ATTGGGCCCA GGCGTATGAT GTGTAA
 
Protein sequence
MSQFKEIFDQ YSWDEVYASI YAKKALDVER ALSKENLDLE DFKALVSPAA AAYLPAMAER 
SHQRTLQRFG KTMQMYVPLY LSNECQNICT YCGFSMDNKL LRKTLKDEEI IREAKAIKEM
GFDHVLLVTG EANQMVGVPY LKHAIELLRP YFAQISIEVQ PLDEDEYKTL IDAGAYAVLV
YQETYHQEEY KTHHPKGKKS NFYYRLDTPD RAARAGVDKL GLGVLIGLED WRVDSFFTAL
HLNYLEKQYW QTKYSLSFPR LRPYVGNTEP KVIMNDRELV QLICAYRLFD QELELSISTR
ETEAFRNHII KLGITSISAG SKTNPGGYVV EKESLEQFEI SDDRTPQQIA TMLKGAGYEP
VWKDWAQAYD V