Gene CPF_0670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0670 
SymbolthiC 
ID4201374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp797635 
End bp798945 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content32% 
IMG OID638081555 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_695123 
Protein GI110798961 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATA CAACTCAAAT GGATGCTGCT AAAAAAGGAA TAATAACAAA GGAAATGCAA 
GTAGTTTCAG AAAAAGAAGG AATTAATATT GAAACTTTAA TGAATTTAAT GGCTGAAGGA
AAAATTGTAA TACCAGCTAA TAAAAATCAT AAAAGTATAA GTGCAGAAGG TGTTGGACAA
GGGTTAAGAA CTAAAATAAA TGTTAACCTA GGAATTTCAA AGGACTGTGC CAATATAGAA
TTAGAGTTAG AAAAAGTTAA AAAAGCAATA GATATGAATG CAGAATCTAT AATGGATTTA
AGTAATTATG GTAAAACTTA TGATTTTAGA AAAAGACTTG TAGAAGTTTC TACGGCTATG
ATAGGAACTG TACCAATGTA TGATGTAGTA GGTTTCTATG ATAAAGAGCT TAAAGATATA
ACTGTTGATG AATTTTTTGA AGTTGTAGAA AAACATGCAA AGGATGGAGT TGACTTTGTT
ACTATACATG CTGGATTAAA TAGAGAAACA ATTGAAACTT TTAGAAGAAA TAAAAGACTT
ACTAATATAG TTTCTAGAGG AGGATCTCTT CTTTTTGCAT GGATGGAATT AAATAATAGA
GAAAATCCTT TCTATGAATA TTTTGATAGA TTATTAGATA TATGTGAAAA GTATGATTTA
ACTTTAAGTT TAGGGGATGC TTGTAGACCA GGTTCAATAG CTGATGCAAC TGATGCTGTA
CAAATCAAAG AATTAATTAC TCTTGGAGAG CTAACAAAAA GAGCTTGGGA AAGAAATGTA
CAAGTAATAA TAGAGGGTCC AGGGCATATG GCAATGAATG AAATAGAAGC TAATGTTTTA
TTAGAGAAAA AATTATGCCA TGGAGCACCA TTTTATGTTT TAGGACCAAT AGTAACTGAT
ATTGCACCAG GATATGATCA TATAACAAGT GCTATAGGAG GGGCTATGGC AGCTTCTTAT
GGAGCAGATT TTCTTTGTTA TGTAACACCA GCAGAACATT TAAGACTTCC TAATTTAGAG
GATGTAAGGG AAGGAATAGT TGCCACAAAG ATAGCGGCTC ATGCAGCTGA CATAGCAAAA
GGAATTTCTG GGGCAAGGGA CATAGATAAT AAAATGAGTG ATGCTAGGAA AAGACTAGAT
TGGGACGAGA TGTTTTCTTT AGCTATAGAT AGTGAAAAAG CCATTAGATA TAGAAAAGAA
TCTACTCCTG AACATAAAGA TAGTTGTACA ATGTGTGGAA AAATGTGCTC TATAAGAAAT
ATGAATAAGA TTCTAGAAGG AAAGGATATA AATCTTTTAA GAGAAGACTA A
 
Protein sequence
MNYTTQMDAA KKGIITKEMQ VVSEKEGINI ETLMNLMAEG KIVIPANKNH KSISAEGVGQ 
GLRTKINVNL GISKDCANIE LELEKVKKAI DMNAESIMDL SNYGKTYDFR KRLVEVSTAM
IGTVPMYDVV GFYDKELKDI TVDEFFEVVE KHAKDGVDFV TIHAGLNRET IETFRRNKRL
TNIVSRGGSL LFAWMELNNR ENPFYEYFDR LLDICEKYDL TLSLGDACRP GSIADATDAV
QIKELITLGE LTKRAWERNV QVIIEGPGHM AMNEIEANVL LEKKLCHGAP FYVLGPIVTD
IAPGYDHITS AIGGAMAASY GADFLCYVTP AEHLRLPNLE DVREGIVATK IAAHAADIAK
GISGARDIDN KMSDARKRLD WDEMFSLAID SEKAIRYRKE STPEHKDSCT MCGKMCSIRN
MNKILEGKDI NLLRED