Gene CPR_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0833 
SymbolcloSI 
ID4206086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp962323 
End bp963891 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content30% 
IMG OID642565392 
Productalpha-clostripain 
Protein accessionYP_698158 
Protein GI110803836 
COG category 
COG ID 
TIGRFAM ID[TIGR02806] clostripain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00458898 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAGA AAAAGTTATC ATTATTGATG GCTACTATTA CCATTGGTAG CGTTCTATTA 
GGAGGAGCTT CAACAGTAAG TGCTGCTCCA AGACAAAAGC ATAAAACTGT ATCAGAGAAA
ATTAAAGAGG CTGAAAAGAC TGAAGGTGAT AAAAAACTAA CGGTTATGGT ATATGCTGAT
TGTGATAATA ACCTAGAAGA GTACATACTT AATGATATTG AAGAAATGAA AGAGGGTTAT
AAAAATAATC CTAATCTAAA TATTGTAGTA TTAGTTGATA GAATTCCTGG TTATTCTAAT
GATTCAAAAG TGTTAGGTTC TAATTTTGAA GACACAAGAC TTTATAAAAT AGGTGAAAAT
TCTGCTGAAA GAATTAGTGG GAAAAGTGAA TTCCCAGAGA TAACTACTAC AAGTAATTAT
GAAGCTAACA TGGGAGATGC AAATACTTTA AAAAAATTCA TTAAATTCTG TAAAAAGAAC
TATGAAGCTG ATAAGTATAT GTTAATCATG TCTAACCATG GTGGTGGGGC TAAGGATGAT
AAAGATAGAG CTAGTACTGT AAATAAGGCA ATATGCTGGG ATGATAGTAA CAACAAAGAT
TGTCTTTATA CTGGAGAAAT TTCAGATGTT TTAACTAAGG ATGAATCTGT TGATGTTTTA
GTATTTGATG CTTGTTTAAT GGGAACTTCA GAGGTAGCTT ATCAATATAG ACCTAATAAT
GGAAGTTTTG AAGCTAAAAC ACTTGTAGCA TCAGCACCAG TTGTTTGGGG AAATGGATAT
CCTTATGATA AAATATTTTC TCGTTTAAAA TCTACTAAAG GTGACAATGG AGAAGTTGAT
TCTACTTTAG GAGGAAAGGA AAAAATATTT GAGCCATCAC TTGTTACAAA TAATGAACTT
GGTGCCTTAT TTGTAGAAGA ACAACGTGAT TCTGTAAATA GCTATGGAGT AACAGATCAA
CAATTAAGTT GCTATGATCT TTCAAAAATT GAAACGGTTA AAAAATCAGT TGATGCTTTA
GCTAGAAATT TAAGCAAAAA TAATAAAAAA GATGCTATTG AAAACTTAAG AGGTACAGGA
AAAAATGCAC CAACTATGCA CTATTTTAAG AATTATGATG AATATGAATG GATTGAGTAT
CCATACTTTG ATTTATATGA TTTATGTGAA AAAATTAGCT TAAGTAATGA ATTTGATGAA
ACAACTAAAA AATTATCAAA GAATGTTATG AAAAATGTTG ACCAATTAAT TTTATATTCA
TTTGCAGGTA ATGACTTTAA AGGCTTCAAA GAAGGAAAAA ATGGTATAAG TATTTTCCTT
CCTGATGGTA ATAGAAATTA TTATGATCAA TATTCAGGAC AAGTAATACC ACATTGGGCT
ATTCAAAGAT GGTACAATCC TCTAGACACT AATGCTTATA GATTAAGAAG TGGATATGGA
AAACTAGCAT GGTGTAAAGA TGGATTAGAT CCAAAAATAA ATAAAGTTGG TAACTGGTTT
GAACTTTTAG ATTCTTGGTT TGATAAAGAT AATACTTCAC TTGGTGGATA TAATAGATAT
AGATATTAA
 
Protein sequence
MFKKKLSLLM ATITIGSVLL GGASTVSAAP RQKHKTVSEK IKEAEKTEGD KKLTVMVYAD 
CDNNLEEYIL NDIEEMKEGY KNNPNLNIVV LVDRIPGYSN DSKVLGSNFE DTRLYKIGEN
SAERISGKSE FPEITTTSNY EANMGDANTL KKFIKFCKKN YEADKYMLIM SNHGGGAKDD
KDRASTVNKA ICWDDSNNKD CLYTGEISDV LTKDESVDVL VFDACLMGTS EVAYQYRPNN
GSFEAKTLVA SAPVVWGNGY PYDKIFSRLK STKGDNGEVD STLGGKEKIF EPSLVTNNEL
GALFVEEQRD SVNSYGVTDQ QLSCYDLSKI ETVKKSVDAL ARNLSKNNKK DAIENLRGTG
KNAPTMHYFK NYDEYEWIEY PYFDLYDLCE KISLSNEFDE TTKKLSKNVM KNVDQLILYS
FAGNDFKGFK EGKNGISIFL PDGNRNYYDQ YSGQVIPHWA IQRWYNPLDT NAYRLRSGYG
KLAWCKDGLD PKINKVGNWF ELLDSWFDKD NTSLGGYNRY RY