Gene CPR_0591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0591 
Symbol 
ID4204235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp704680 
End bp706401 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content30% 
IMG OID642565151 
Productcell wall binding repeat-containing protein/zinc carboxypeptidase family protein 
Protein accessionYP_697918 
Protein GI110802994 
COG category[R] General function prediction only 
COG ID[COG5263] FOG: Glucan-binding domain (YG repeat) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000159659 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGCTT CAAAACGAAA AGTTTTAAAA TATGTTATAT TAGGTAGTAT TTTAAGTGCA 
CAATTTTTAA CATTTAATAG TATTAAAGCT AATGAACTAA AGAAAATAAA TAACTCGGAT
AATATTAATG ACATAGTGTT AAAAAGTGAT AATTTAAATG AGAAAAAACA TATTGGATGG
ATAGAGCAAG ATGGAAAATG GTATTATATT AATTCAGATG GAAGTAAACA TACTGGTTGG
TTAAATTTAA ATAATACATA TTATTATTTA GATCCATTTA ATGGAGAAAT GCAAACTGGT
ATGAAAGAAA TCAATGGATA TAAATATTAC TTAGATAATA GTGGAGCTAT ACAAACCGGA
TGGGTTAAAT ATAATGGACA ATATTATTTC TTTGGATCAG ATGGAGCAAT GAGAACTGGC
TGGATAAATG ATGGATGGAC AGATTATTAC CTAAAGAAAG ATGGGACAAT CTACAAAGGT
TGGTTAGATG ATGGTCTTAA TAAATATTTT ATGGACGAAA ATGGTCAAAT GAGAAGAGGT
TGGGTTAAAT ATAATGGAGA ATATTATTTC TTTGGATCAG ATGGAGCGAT GAGAACTGGC
TGGATAAATG ATGGATGGAC AGATTATTAC CTAAAGAAAG ATGGGACAAT CTACAAAGGT
TGGTTAGATG ATGGTCTTAA TAAATACTTC ATGGATGAAA ATGGTCAAAT GAGAAGAGGT
TGGGTTAAAT ATAATGGAGA ATATTATTTC TTTGGACAAG ACGGAGCAAT GAGAACTGGT
TGGATAAATG ATGGATATGC ATATTATTTT ATGAATAGTA ATGGACAAAT TACTAAAGGT
TGGTTTACTG AAGATAATAA TAAATATTAT TTAGGTAATG ATGGTGTAAT GAGAATAGGT
TGGCAAGAGA TAGATAATAA TTGGTATTAT TTTGAACAAT CTGGATTTAT GGCAAGGAAT
AAAATAATAG AAGATTGGTA TGTAAACAGT AATGGGGTTG GTGAAAGATA TATAGAAAAG
TCTACATATG GAAAAAGTGG ACGTGGTAGA GAACTTGAAT ACTATAAAGT TGGAAGTGGA
AAAAAAGTTT TATTTGCTGT TTTTGGAGTT CATGGTTATG AGGATGCTTG GAATGGAGAT
GCTCAAGAAT TATATTTAAT AGCTCAAAAA GCATATAATA ATTTAGTTAA ACAGTATGAA
GTGGGAACAA ATGCAGATGA CTTGAGTGAA TGGAGTGTTT ATTTAATTCC GTCAGCTAAT
CCTGATGGTA GGATTGATGG TTGGACAAAT TATGGTCCTG GAAGAACTAC TATAACAACA
AAAAATGACA TAAATAGATC TTTTCCAATA GGTTTTAGAC CTTACTATAG TGCTAGAAAT
TATACTGGAG ATAGTTATTT AGGATCTCCA GAGGCAAAAG CTTTATATAA TTTTATAAAT
AAGACTATGG ATGGAGCTAC TGAAAAGATA TTATTAGATG TTCATGGATG GGAAGATAAG
ACAATAGGAG ATAGTAATAT AGCAAGCTAT TTTGACAAAG AGTTTGGCTT TAGAAATATA
CCTAAATACC CGGGGGGATT TGTAATAACT TATGGAAATG CAATTGGTGC TAAATCAGTA
TTAGTCGAAT TACCATTTCC AAAATCACAT GAAGATATAT TAAGAAGAGA TTTTTCAGGA
AAATTTTCAA GAGCTTTATT AAATATACTG TTAAATAATT AA
 
Protein sequence
MQASKRKVLK YVILGSILSA QFLTFNSIKA NELKKINNSD NINDIVLKSD NLNEKKHIGW 
IEQDGKWYYI NSDGSKHTGW LNLNNTYYYL DPFNGEMQTG MKEINGYKYY LDNSGAIQTG
WVKYNGQYYF FGSDGAMRTG WINDGWTDYY LKKDGTIYKG WLDDGLNKYF MDENGQMRRG
WVKYNGEYYF FGSDGAMRTG WINDGWTDYY LKKDGTIYKG WLDDGLNKYF MDENGQMRRG
WVKYNGEYYF FGQDGAMRTG WINDGYAYYF MNSNGQITKG WFTEDNNKYY LGNDGVMRIG
WQEIDNNWYY FEQSGFMARN KIIEDWYVNS NGVGERYIEK STYGKSGRGR ELEYYKVGSG
KKVLFAVFGV HGYEDAWNGD AQELYLIAQK AYNNLVKQYE VGTNADDLSE WSVYLIPSAN
PDGRIDGWTN YGPGRTTITT KNDINRSFPI GFRPYYSARN YTGDSYLGSP EAKALYNFIN
KTMDGATEKI LLDVHGWEDK TIGDSNIASY FDKEFGFRNI PKYPGGFVIT YGNAIGAKSV
LVELPFPKSH EDILRRDFSG KFSRALLNIL LNN