Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1570 |
Symbol | thiH |
ID | 4204108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1761165 |
End bp | 1762268 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 642566121 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_698886 |
Protein GI | 110801691 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.25811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGTTTTT ATGATGTAGT AGAAAAATAT AGAGATTTTG ATTTTTATGG ATATTTTGAT TCTGTAAAAA AGGAAGATGT GTTAAGAAGT ATTTATGAAA GAAATAAGAG GCCAGAGGAT TTACTTAATT TAATATCTCC TATGGGAGAA TTAGTTTTAG AAGAAATGGC TCAAGAAGCA AGAAATCTCT CCTTAAAATA TTTTGGAAGA ACAATATTAT TATATACACC TATGTATATC TCGAATTATT GTGTAAATAA GTGTTCATAT TGTGGGTATA ATGTAGAAAA TAAAATATGT AGGAAAAAAT TAAATCAAGA AGAAATAGAA AAAGAGGGGG AAGCTATTTC AAAGGAGGGA TTTAAACATA TTCTAATATT AACAGGAGAA AGTGAATATC ATACTCCAGT AGAGTATATA GAAAAGAGTA TTAAAACTTT GAAAGGGAAA TTTCCTTCAA TAACCATTGA AATATACCCA ATGACAGAAG AGGGATATAA AAAAGTGGTA GAAGCAGGTG CTGAAGGGCT TACTGTATAT CAAGAGACCT ATGATGAAAA GGTATATGAT AGGGTTCACG TGGCTGGTCC AAAGAAAAAT TATAAATTCA GATTAGAAGC TCCAGAGAGA GGAGCAGAAG CTGGAATGAG AAGCATAAGT ATAGGAGCCT TATTAGGATT AGCTGATTTT AGAATAGATG CCTTCTTTAC AGCAATGCAT GGAAAATATT TAAGGGATAA GTATCCTCAT ATAGATATAA GTTATTCAGT TCCAAGAATA AGACACTGCG AAGGAGGGCT TAAAAAGTTA AATGAAGTTT ATGATAGGGA ACTAGTTCAA ATACTTTTAG CCTATAGACT ATTTGATCCC CAAGGAGGAA TAAATATATC TACTAGAGAA GGAAAGGATT TTAGAAGAAA TTTAATTCCC TTAGGAGTGA GTAAAATCAG TGCTGGAGTT TCAACTGAGG TTGGAGGCCA TTCTTTAAAA GAAAAAGGTA CAAGTCAATT TGATATAAAT GATGAAAGTT CTGTAAGTGA AGTTAAGGAA TTAATAAAAA GTGAAGGTTA TCAACCTATA TTTAAGGATT GGCATAGATT TTAA
|
Protein sequence | MSFYDVVEKY RDFDFYGYFD SVKKEDVLRS IYERNKRPED LLNLISPMGE LVLEEMAQEA RNLSLKYFGR TILLYTPMYI SNYCVNKCSY CGYNVENKIC RKKLNQEEIE KEGEAISKEG FKHILILTGE SEYHTPVEYI EKSIKTLKGK FPSITIEIYP MTEEGYKKVV EAGAEGLTVY QETYDEKVYD RVHVAGPKKN YKFRLEAPER GAEAGMRSIS IGALLGLADF RIDAFFTAMH GKYLRDKYPH IDISYSVPRI RHCEGGLKKL NEVYDRELVQ ILLAYRLFDP QGGINISTRE GKDFRRNLIP LGVSKISAGV STEVGGHSLK EKGTSQFDIN DESSVSEVKE LIKSEGYQPI FKDWHRF
|
| |