Gene Cphy_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2082 
SymbolthiH 
ID5744088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2568774 
End bp2570192 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content35% 
IMG OID641293179 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001559189 
Protein GI160880221 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.388177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAATA AACAATCAAA AAAGGCAGAA GAATTTATTT CTAATGAGGA AATCTTAGAA 
ACATTAGAAT ATGCAGAAAA GAATAAACAT AATGAAGAAT TAATCGATGA GATATTAAAT
AAAGCAAGAC TAAAAAAAGG ATTATCCCAC CGCGAAGCAG CTGTTCTTTT AGACTGTGAT
ATTCCAGAGA AAAATGAAGA AATCTATGCT TTAGCGAAAC AACTTAAGGA AGATTTTTAT
GGTAATCGTA TTGTTATGTT TGCTCCATTA TATCTTTCAA ACTATTGTAT CAACGGATGT
GTATACTGTC CATATCATTT AAAAAATAAA CACATAGCTA GAAAGAAATT AACACAAGAG
GAAATCCGAG AAGAAGTTAT TGCACTTCAA GATATGGGTC ATAAAAGATT AGCCTTAGAA
ACTGGGGAAG ATCCAATCAA TAACCCTATT GAATATATCT TAGAGAGTAT AAACACCATT
TACTCTATCA AACATAAGAA TGGAGCAATT AGACGAGTTA ATGTAAATAT TGCTGCTACT
ACTGTAGAAA ACTATAAAAA GCTTCATGAT GCTGGCATTG GTACTTATAT CTTGTTTCAA
GAGACCTATC ACAAAGAAAG TTATGAAGCC CTTCACCCAA CTGGTCCTAA GCATGATTAT
GCTTATCATA CAGAGGCAAT GGATCGTGCG ATGCAAGGTG GTATTGATGA TGTAGGCCTT
GGTGTTTTAT TCGGCTTAGA GCGCTATCGC TATGAGTTTG CTGGTCTTTT AATGCATGCA
GAACATCTTG AAGAAGTATA CGGTGTGGGA CCTCATACTA TAAGTGTTCC ACGAATCCGT
CCAGCTGATG ATATCGATCC AAATAGCTTT TCCAATGGCA TCAATGATGA TGTATTTGCT
AAAATTGTTG CTTGTATTCG TATCTCGGTT CCTTATACCG GCATGATTGT ATCTACCAGA
GAAAGTAAAA AAACTCGTGA ACGTGTGTTA CAACTTGGAG TATCTCAGAT TAGCGGAGGT
TCCAAAACAA GTGTTGGAGG TTATGTTCAT TCAGAAGAAG AGGATGATAA ATCTGAACAG
TTTGATGTTA TCGACCAGCG TCCATTAGGG CAAGTCGTAA AATGGTTAAT GGAACTTGGA
TTTATCCCAA GTTTTTGTAC TGCATGTTAC AGAGAAGGTC GTACCGGAGA TCGTTTTATG
AGTCTTTGTA AGAGTGGACA AATCGCTAAC TGCTGCCTTC CAAATGCACT AATGACGTTA
AAGGAATTCT TAATGGATTA TGCAGATGAG GAAACTAGAG AAGTAGGTAA TCAACTCATT
GAGACAGAAT TAGCAAAGAT TCCAAATGAA AAAGTAAAAC AAATTGCTAA GGATAATTTA
ATGTCAATCA CTCTTGGTTC AAGAGATTTT CGTTTCTAA
 
Protein sequence
MYNKQSKKAE EFISNEEILE TLEYAEKNKH NEELIDEILN KARLKKGLSH REAAVLLDCD 
IPEKNEEIYA LAKQLKEDFY GNRIVMFAPL YLSNYCINGC VYCPYHLKNK HIARKKLTQE
EIREEVIALQ DMGHKRLALE TGEDPINNPI EYILESINTI YSIKHKNGAI RRVNVNIAAT
TVENYKKLHD AGIGTYILFQ ETYHKESYEA LHPTGPKHDY AYHTEAMDRA MQGGIDDVGL
GVLFGLERYR YEFAGLLMHA EHLEEVYGVG PHTISVPRIR PADDIDPNSF SNGINDDVFA
KIVACIRISV PYTGMIVSTR ESKKTRERVL QLGVSQISGG SKTSVGGYVH SEEEDDKSEQ
FDVIDQRPLG QVVKWLMELG FIPSFCTACY REGRTGDRFM SLCKSGQIAN CCLPNALMTL
KEFLMDYADE ETREVGNQLI ETELAKIPNE KVKQIAKDNL MSITLGSRDF RF