Gene CPR_1570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1570 
SymbolthiH 
ID4204108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1761165 
End bp1762268 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content31% 
IMG OID642566121 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_698886 
Protein GI110801691 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.25811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTTTTT ATGATGTAGT AGAAAAATAT AGAGATTTTG ATTTTTATGG ATATTTTGAT 
TCTGTAAAAA AGGAAGATGT GTTAAGAAGT ATTTATGAAA GAAATAAGAG GCCAGAGGAT
TTACTTAATT TAATATCTCC TATGGGAGAA TTAGTTTTAG AAGAAATGGC TCAAGAAGCA
AGAAATCTCT CCTTAAAATA TTTTGGAAGA ACAATATTAT TATATACACC TATGTATATC
TCGAATTATT GTGTAAATAA GTGTTCATAT TGTGGGTATA ATGTAGAAAA TAAAATATGT
AGGAAAAAAT TAAATCAAGA AGAAATAGAA AAAGAGGGGG AAGCTATTTC AAAGGAGGGA
TTTAAACATA TTCTAATATT AACAGGAGAA AGTGAATATC ATACTCCAGT AGAGTATATA
GAAAAGAGTA TTAAAACTTT GAAAGGGAAA TTTCCTTCAA TAACCATTGA AATATACCCA
ATGACAGAAG AGGGATATAA AAAAGTGGTA GAAGCAGGTG CTGAAGGGCT TACTGTATAT
CAAGAGACCT ATGATGAAAA GGTATATGAT AGGGTTCACG TGGCTGGTCC AAAGAAAAAT
TATAAATTCA GATTAGAAGC TCCAGAGAGA GGAGCAGAAG CTGGAATGAG AAGCATAAGT
ATAGGAGCCT TATTAGGATT AGCTGATTTT AGAATAGATG CCTTCTTTAC AGCAATGCAT
GGAAAATATT TAAGGGATAA GTATCCTCAT ATAGATATAA GTTATTCAGT TCCAAGAATA
AGACACTGCG AAGGAGGGCT TAAAAAGTTA AATGAAGTTT ATGATAGGGA ACTAGTTCAA
ATACTTTTAG CCTATAGACT ATTTGATCCC CAAGGAGGAA TAAATATATC TACTAGAGAA
GGAAAGGATT TTAGAAGAAA TTTAATTCCC TTAGGAGTGA GTAAAATCAG TGCTGGAGTT
TCAACTGAGG TTGGAGGCCA TTCTTTAAAA GAAAAAGGTA CAAGTCAATT TGATATAAAT
GATGAAAGTT CTGTAAGTGA AGTTAAGGAA TTAATAAAAA GTGAAGGTTA TCAACCTATA
TTTAAGGATT GGCATAGATT TTAA
 
Protein sequence
MSFYDVVEKY RDFDFYGYFD SVKKEDVLRS IYERNKRPED LLNLISPMGE LVLEEMAQEA 
RNLSLKYFGR TILLYTPMYI SNYCVNKCSY CGYNVENKIC RKKLNQEEIE KEGEAISKEG
FKHILILTGE SEYHTPVEYI EKSIKTLKGK FPSITIEIYP MTEEGYKKVV EAGAEGLTVY
QETYDEKVYD RVHVAGPKKN YKFRLEAPER GAEAGMRSIS IGALLGLADF RIDAFFTAMH
GKYLRDKYPH IDISYSVPRI RHCEGGLKKL NEVYDRELVQ ILLAYRLFDP QGGINISTRE
GKDFRRNLIP LGVSKISAGV STEVGGHSLK EKGTSQFDIN DESSVSEVKE LIKSEGYQPI
FKDWHRF