Gene CPR_2649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2649 
Symbol 
ID4206208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2874915 
End bp2876774 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content32% 
IMG OID642567197 
ProductATP-dependent protease 
Protein accessionYP_699884 
Protein GI110803724 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease
[TIGR02903] ATP-dependent protease, Lon family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTG AACTTGGAGT AGAATCTCAA GTTGAAGCAC TAAAAGATAT AATAAATAAT 
ATATTAGACG AAGGTGCATT TAGAGCGAGA GTTATAAGAT TTAAAGTACA AAATTATATA
AATTCAACTG ATCCTTATGA AAGACTTTAT GGATTAAGTA AAATTGTTTC TGAAGGAAAG
GGATTAAGTG AAGTTCCAAC TGAAGAAACT ATAAATGAAG CTTTAGAAGA TGTTTGTGCT
ATGATATCAG ATGCTATTGC TAGAAGATAT GTTCAAAATA AAATAGAAAA AGAAGTTGAA
CAATTCTTAA TGGAAAAGCA AGAAAAGTAT GTTGATGAAC TTAGAGTAAA CATAATGAAA
AAGAAAAAAG GTCCAGAAAA TGCTAAGACA GAGAAAAAGC TTGAGGAACT TGAAGAACTA
GATGAGAGAG TTCCAAATAA GAATATAATG TCTTTATTAA GACCTGATTC ATTTGATGAG
GTAGTTGGTC AAGAGAGAGC TGTTAAGTCA CTTCTTTCAA AACTAGCTTC ACCATATCCT
CAACATATAA TACTTTATGG ACCTCCAGGG GTTGGTAAAA CAACAGCTGC TAGAATTGCT
CTAGAAACAG CTAAGAAATT AAAATCAACT CCATTTGATG ATAGATCAAA ATTCATAGAG
GTTAATGGTA CAACTTTAAG ATGGGATCCA AGAGAAATCA CAAACCCACT TTTAGGTTCA
GTACATGATC CAATATATCA AGGTAGCAAA AGAGACTTAG CTGAAATAGG AGTTCCAGAA
CCAAAACCAG GTTTAGTTAC TGAAGCTCAC GGTGGTATAT TATTTATAGA TGAAATTGGA
GAATTAGATG AAATACTTCA AAATAAACTT TTAAAAGTTT TAGAAGATAA GAGAGTTGAA
TTCTCATCAT CTTACTATGA TCCAGATGAT GAAAATACAC CTAAATATAT AAAATATCTT
TTTGATAAGG GAGCTCCAGC AGATTTTGTT CTAATAGGAG CAACTACTAG AGAGCCGGGA
GAAATAAATC CTGCTTTACG TTCAAGATGT ACAGAGGTTT ATTTTGAACC ACTATCATCA
AGAGATATTG AAAAGATAGT ATTAAATGCA GCTAAGAAGC TTAATGTTAA GCTTGAAGAA
GGTTTAGAAA AGAAAATAGC TTCTTACACT ATAGAAGGTA GAAGAGCTGT AAATATATTA
GCAGATGCTT ATGGTCATGC TATTTATGGC TTAGAGGGAG AAGTTCCAGA AGACTTAGAA
ATAACTTCAA AGGATTTAAA TGAAGTTGTA AGCATAGGAA GATTTACTCC GTATGAAATA
CTAGAAAATT TAGAGGAAAA AGAAGTAGGT CATGTTTATG GACTTGGAGT TTCAGGATTC
TTAGGCTCAA CAATAGAGAT TGAAGCCACT GCTTTTAAAG CTAAGAAAAA GGGTGCTGGA
AAAATAAGAT TCAATGATAC TGCTGGTTCA ATGGCTAAGG ATTCTGTATT TAATGCTGCA
TCTGTAATAA AAAGGCTAAC TGATAAGGAT ATAAATGATT ATGATATACA TGTTAATGTA
ATTGGTGGAG GAAAGATAGA TGGACCATCT GCTGGGGCCG CTATTACAAT ATGTATAATG
AGTGCTTTAT TAGAAAAGCC AATAAGACAA GACTTAGCTA TAACTGGAGA GATTTCTTTA
AGAGGAAAGA TTAAGCCAGT TGGAGGTATA TTTGAAAAAA TATACGGAGC TAGAAGAAAG
GGAATTAAGT TAGTAACTGT TCCTAAAGAT AATGAAAATG AAATCCCTAA AGGATTAGAA
GATATAGAAG TTAAAGCTAT AAGTTCTATA GAAGAGCTTA TGGAAATAGC TTTTAATTAA
 
Protein sequence
MNSELGVESQ VEALKDIINN ILDEGAFRAR VIRFKVQNYI NSTDPYERLY GLSKIVSEGK 
GLSEVPTEET INEALEDVCA MISDAIARRY VQNKIEKEVE QFLMEKQEKY VDELRVNIMK
KKKGPENAKT EKKLEELEEL DERVPNKNIM SLLRPDSFDE VVGQERAVKS LLSKLASPYP
QHIILYGPPG VGKTTAARIA LETAKKLKST PFDDRSKFIE VNGTTLRWDP REITNPLLGS
VHDPIYQGSK RDLAEIGVPE PKPGLVTEAH GGILFIDEIG ELDEILQNKL LKVLEDKRVE
FSSSYYDPDD ENTPKYIKYL FDKGAPADFV LIGATTREPG EINPALRSRC TEVYFEPLSS
RDIEKIVLNA AKKLNVKLEE GLEKKIASYT IEGRRAVNIL ADAYGHAIYG LEGEVPEDLE
ITSKDLNEVV SIGRFTPYEI LENLEEKEVG HVYGLGVSGF LGSTIEIEAT AFKAKKKGAG
KIRFNDTAGS MAKDSVFNAA SVIKRLTDKD INDYDIHVNV IGGGKIDGPS AGAAITICIM
SALLEKPIRQ DLAITGEISL RGKIKPVGGI FEKIYGARRK GIKLVTVPKD NENEIPKGLE
DIEVKAISSI EELMEIAFN