Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0668 |
Symbol | thiC |
ID | 4206296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 786586 |
End bp | 787896 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 642565228 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_697995 |
Protein GI | 110803617 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTATA CAACTCAAAT GGATGCTGCT AAAAAAGGAA TAATAACAAA GGAAATGCAA GTAGTTGCAG AAAAAGAAGG AATTAATATT GAAACTTTAA TGAATTTAAT GGCTGAAGGA AAAATTGTAA TACCAGCTAA TAAAAATCAT AAAAGTATAA GTGCAGAGGG TGTTGGACAA GGATTAAAAA CTAAAATAAA TGTTAACCTA GGAATTTCAA AGGATTGTGC CAATATAGAA TTAGAGTTAG AAAAAGTTAA AAAAGCAATA GATATGAATG CAGAATCTAT AATGGATTTA AGTAATTATG GTAAAACTTA TGATTTTAGA AAAAGACTTG TAGAAGTTTC TACGGCTATG ATAGGAACTG TACCAATGTA TGATGTAGTA GGTTTCTATG ATAAAGAACT TAAAGATATA ACTGTTGATG AATTTTTTGA TGTTGTAGAA AAACATGCAA AGGATGGAGT TGACTTTGTT ACTATACATG CTGGATTAAA TAGAGAAACA ATTGAAACTT TTAGAAGAAA TAAAAGACTT ACTAATATAG TTTCTAGGGG AGGATCTCTT CTTTTTGCAT GGATGGAATT AAATAATAGA GAAAATCCTT TTTATGAATA TTTTGATAGA TTATTAGATA TATGTGAAAA GTATGATTTA ACTTTAAGTT TAGGGGATGC TTGTAGACCA GGTTCAATAG CTGATGCAAC TGATGCTGTA CAAATCAAAG AATTAATTAC CCTTGGAGAA CTAACAAAAA GAGCTTGGGA AAGAAATGTA CAAGTAATAA TAGAGGGACC AGGTCATATG GCAATGAATG AAATTGAAGC TAATGTTTTA TTAGAGAAAA AATTATGCCA TGGAGCACCA TTTTATGTTT TAGGACCAAT AGTAACTGAT ATTGCACCAG GATATGATCA TATAACAAGT GCTATAGGAG GGGCTATGGC GGCTTCTTAT GGAGCAGATT TTCTTTGTTA TGTAACACCA GCAGAACATT TAAGACTTCC TAATTTAGAG GATGTAAGGG AAGGAATAGT TGCCACAAAG ATAGCGGCTC ATGCAGCTGA TATAGCAAAA GGAATTTCAG GGGCAAGAGA TATAGATAAT AAAATGAGTG ATGCTAGGAA AAGACTAGAT TGGGACGAGA TGTTTTCTTT AGCAATAGAT AGTGAAAAAG CAATTAGATA CAGAAAAGAA TCTACTCCTG AACATAAAGA TAGTTGTACA ATGTGTGGAA AAATGTGCTC TATAAGAAAT ATGAATAAGA TTCTAGAAGG GAAGGATATA AACCTTTTAA GAGAAGACTA A
|
Protein sequence | MNYTTQMDAA KKGIITKEMQ VVAEKEGINI ETLMNLMAEG KIVIPANKNH KSISAEGVGQ GLKTKINVNL GISKDCANIE LELEKVKKAI DMNAESIMDL SNYGKTYDFR KRLVEVSTAM IGTVPMYDVV GFYDKELKDI TVDEFFDVVE KHAKDGVDFV TIHAGLNRET IETFRRNKRL TNIVSRGGSL LFAWMELNNR ENPFYEYFDR LLDICEKYDL TLSLGDACRP GSIADATDAV QIKELITLGE LTKRAWERNV QVIIEGPGHM AMNEIEANVL LEKKLCHGAP FYVLGPIVTD IAPGYDHITS AIGGAMAASY GADFLCYVTP AEHLRLPNLE DVREGIVATK IAAHAADIAK GISGARDIDN KMSDARKRLD WDEMFSLAID SEKAIRYRKE STPEHKDSCT MCGKMCSIRN MNKILEGKDI NLLRED
|
| |