Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1853 |
Symbol | thiH |
ID | 4203863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 2089404 |
End bp | 2090507 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638082723 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_696287 |
Protein GI | 110800869 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.287617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGTTTTT ATGATGTAGT AGAAAAATAT AGAGATTTTG ATTTTTATGG ATATTTTGAT TCTGTAAAAA AGGAAGATGT ATTAAGAAGT ATTTATGAAA GAAATAAGAG ACCAGAGGAT TTACTTAATT TAATATCTCC TATGGGAGAA GAGCTTTTAG AAGAAATGGC TCAAGAAGCA AGAAATCTCT CTTTAAAATA TTTTGGAAGA ACAATATTAC TTTATACACC TATGTATATC TCAAATTATT GTGTAAATAA GTGTTCATAT TGTGGGTATA ATGTAGAAAA TAAAATATGT AGGAAAAAAT TAAATCAAGA AGAAATAGAA AAAGAGGGGA AAGCTATTTC AAAGGATGGA TTTAAACATA TTCTAATATT AACAGGGGAA AGTGAATATC ATACTCCAGT AGAGTATATA GAGGAGAGCA TTAAAACTTT GAAAGAGAAA TTTCCTTCCA TAACTATTGA AATATATCCA ATGACAGAAG AGGGATATAA AAAGGTGGTA GAAGCAGGTG CTGAAGGGCT TACTGTATAT CAAGAGACCT ATGATGAAAA AGTATATGAT AGGGTTCATG TGGCTGGTCC AAAGAAAAAT TATAAATTCA GATTAGAAGC TCCAGAGAGA GGAGCAGAAG CTGGAATGAG AAGCATAAGT ATAGGAGCCT TATTAGGATT AGCTGATTTT AGAATAGATG CCTTCTTTAC AGCAATGCAT GGAAAATATT TAAGAGATAA GTATCCTCAT ATAGATATAA GTTATTCGGT GCCAAGAATA AGACCCTGTG AAGGAGGGCT TAAAAAGTTA AATGAAGTTG ATGATAGGGA ACTAGTACAA ATACTTTTAG CCTATAGACT ATTTGATCCT CAAGGAGGAA TAAATATATC TACTAGAGAA GGAAAGGATT TTAGAAGAAA TTTAATTCCT TTAGGAGTAA GTAAAATTAG TGCTGGAGTT TCTACTGAAG TTGGAGGCCA TTCTTTAAAA GAAAAAGGTA CAAGTCAATT TGATATAAAT GATGAAAGTT CTGTAAGTGA AGTTAAGGAA TTAATAAAAA GTCAAGGTTA TCAACCTATA TTTAAGGATT GGCATAGATT TTAA
|
Protein sequence | MSFYDVVEKY RDFDFYGYFD SVKKEDVLRS IYERNKRPED LLNLISPMGE ELLEEMAQEA RNLSLKYFGR TILLYTPMYI SNYCVNKCSY CGYNVENKIC RKKLNQEEIE KEGKAISKDG FKHILILTGE SEYHTPVEYI EESIKTLKEK FPSITIEIYP MTEEGYKKVV EAGAEGLTVY QETYDEKVYD RVHVAGPKKN YKFRLEAPER GAEAGMRSIS IGALLGLADF RIDAFFTAMH GKYLRDKYPH IDISYSVPRI RPCEGGLKKL NEVDDRELVQ ILLAYRLFDP QGGINISTRE GKDFRRNLIP LGVSKISAGV STEVGGHSLK EKGTSQFDIN DESSVSEVKE LIKSQGYQPI FKDWHRF
|
| |