Gene CPR_0668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0668 
SymbolthiC 
ID4206296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp786586 
End bp787896 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content31% 
IMG OID642565228 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_697995 
Protein GI110803617 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATA CAACTCAAAT GGATGCTGCT AAAAAAGGAA TAATAACAAA GGAAATGCAA 
GTAGTTGCAG AAAAAGAAGG AATTAATATT GAAACTTTAA TGAATTTAAT GGCTGAAGGA
AAAATTGTAA TACCAGCTAA TAAAAATCAT AAAAGTATAA GTGCAGAGGG TGTTGGACAA
GGATTAAAAA CTAAAATAAA TGTTAACCTA GGAATTTCAA AGGATTGTGC CAATATAGAA
TTAGAGTTAG AAAAAGTTAA AAAAGCAATA GATATGAATG CAGAATCTAT AATGGATTTA
AGTAATTATG GTAAAACTTA TGATTTTAGA AAAAGACTTG TAGAAGTTTC TACGGCTATG
ATAGGAACTG TACCAATGTA TGATGTAGTA GGTTTCTATG ATAAAGAACT TAAAGATATA
ACTGTTGATG AATTTTTTGA TGTTGTAGAA AAACATGCAA AGGATGGAGT TGACTTTGTT
ACTATACATG CTGGATTAAA TAGAGAAACA ATTGAAACTT TTAGAAGAAA TAAAAGACTT
ACTAATATAG TTTCTAGGGG AGGATCTCTT CTTTTTGCAT GGATGGAATT AAATAATAGA
GAAAATCCTT TTTATGAATA TTTTGATAGA TTATTAGATA TATGTGAAAA GTATGATTTA
ACTTTAAGTT TAGGGGATGC TTGTAGACCA GGTTCAATAG CTGATGCAAC TGATGCTGTA
CAAATCAAAG AATTAATTAC CCTTGGAGAA CTAACAAAAA GAGCTTGGGA AAGAAATGTA
CAAGTAATAA TAGAGGGACC AGGTCATATG GCAATGAATG AAATTGAAGC TAATGTTTTA
TTAGAGAAAA AATTATGCCA TGGAGCACCA TTTTATGTTT TAGGACCAAT AGTAACTGAT
ATTGCACCAG GATATGATCA TATAACAAGT GCTATAGGAG GGGCTATGGC GGCTTCTTAT
GGAGCAGATT TTCTTTGTTA TGTAACACCA GCAGAACATT TAAGACTTCC TAATTTAGAG
GATGTAAGGG AAGGAATAGT TGCCACAAAG ATAGCGGCTC ATGCAGCTGA TATAGCAAAA
GGAATTTCAG GGGCAAGAGA TATAGATAAT AAAATGAGTG ATGCTAGGAA AAGACTAGAT
TGGGACGAGA TGTTTTCTTT AGCAATAGAT AGTGAAAAAG CAATTAGATA CAGAAAAGAA
TCTACTCCTG AACATAAAGA TAGTTGTACA ATGTGTGGAA AAATGTGCTC TATAAGAAAT
ATGAATAAGA TTCTAGAAGG GAAGGATATA AACCTTTTAA GAGAAGACTA A
 
Protein sequence
MNYTTQMDAA KKGIITKEMQ VVAEKEGINI ETLMNLMAEG KIVIPANKNH KSISAEGVGQ 
GLKTKINVNL GISKDCANIE LELEKVKKAI DMNAESIMDL SNYGKTYDFR KRLVEVSTAM
IGTVPMYDVV GFYDKELKDI TVDEFFDVVE KHAKDGVDFV TIHAGLNRET IETFRRNKRL
TNIVSRGGSL LFAWMELNNR ENPFYEYFDR LLDICEKYDL TLSLGDACRP GSIADATDAV
QIKELITLGE LTKRAWERNV QVIIEGPGHM AMNEIEANVL LEKKLCHGAP FYVLGPIVTD
IAPGYDHITS AIGGAMAASY GADFLCYVTP AEHLRLPNLE DVREGIVATK IAAHAADIAK
GISGARDIDN KMSDARKRLD WDEMFSLAID SEKAIRYRKE STPEHKDSCT MCGKMCSIRN
MNKILEGKDI NLLRED