Gene CPR_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2023 
Symbol 
ID4204906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2228073 
End bp2230358 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content26% 
IMG OID642566573 
ProductATP-dependent protease 
Protein accessionYP_699332 
Protein GI110801955 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAG AGTTAACTCC AAAAGAAGTA ATTTATAATG AAGATTTTAC TAGTATAATA 
GGTGAGAAGG AATCCTATAG TAATGAATAT GAAGGAGTTT TTAGGAAAAT TGATGAAGCA
TTAAGTATAA ATAAAGAAGG ATTTAATGTT TATCTAATTG ATGAATTTTC AAAACAAAAG
CTTAAAGATT TAATATCACA TTTAGAAGAT AAAATGAAAA GTAGAGGCAA GCCTAAGGAT
ATATGTTATG TTACCTTAGA AGATATTAGA GTTCCTAAGG TTATATTTTT AGAAAATGGA
ATGGGAGAAA AGTTAGAGGA AACTTTAGAA TATCTAAAAT CTTTTTATTA TGATGAAATA
TATGATTTTT ATAATTCTTC AATAAATAAG GAAAAGGAAG AGATTATAAA TGATATACAA
AAAAAGAGAA ATTATTATAT AGGGGATTTA ATAAAGAGTG CAAAGGAAGA AGGGTTTGAT
TTAAAAGCAA CTTCTTCAGG CTTTGCATTT ATTCCTTTAG TAGATGGTGA AGCTATGACA
GAAGAGGAAT TTGATGAGCT AGAAGAGAAT AGTAAAGGGG ATATTTCTAT AAAAGTTGAT
AAGTTAAAAG AAGGAGCGGA AAGTGTTCTT GAAGAATTAA AAAATATAGA GTTAGATTCT
ATTGAAAAAT TAAAAGAATT GCTTAGAACC TATTTAGAAA ATGAATCAGC AAGTGTTAAA
GGAAAAATAA AGGATAATTT CAAAAATGAG AATGAAGCTT ATAATTATCT TATAGATGTT
TGTGAAAGTT TAGAAAAGCT ATTAGTTGAT AATTACACAA TAAATTTTGA TGATGATGAG
GAAAAGATAA ATGAAATTTT TTCAAAGTAT GTTTGTAATA TCATAAAAAA TAGTAAGGAT
CAAAAGGCGC CTAAGGTAAT TTTTGAAGAA GACCCAAGCT TAAATAATCT TTTAGGAACT
ATAGAGTATG AAAATCATAA TGGAGTATAT TCAACTGATG TAAAACTTAT AAAATCAGGT
TCATTACTTG AAGCAAATGA AGGATGCATA ATACTTAGGT TAAGTTCTTT GGTGAATAAT
ACAAATAGCT ATTATTATTT GAGAAGAACT TTACTTCATG GAAAGATAAA TTATGATTTT
AACAGAGGAT ATTTAGAAGT ACTTTCATTA AATGGGTTAA ATCCAGATCC AATACCTATT
AAGGTAAATG TAATTTTAAT AGGAGATTTT GAAAGTTATG ATATTTTATA TAATAATGAT
GAAGATTTTA AGAAAATATT TAGGGTTAGA GCTGAATTTT CAAGTTTAAT AGGAATAGAT
GAAAATAAAA GATCTTTATT AGATACTATT GATAAAATAA TAATAGATAA TGAGTTAATA
AAGATTTCTA CTTCTGGAAT TAATGCTATT GGAAAACAAT TAGCCAGAAA AGCTGGAACA
AGGAAGAAGA TTATTTGGGA TATTGATGAA ATAGAGAGAA TATTACTTCT TGGAAATGAA
GAGGCAAAAA ATAATAATAA ATCATTGATA GATAAGGATT CAATAGAAGA AGTTATAAAT
CAATGCAGTG AAATTGAAAA AGATTATTTA GAAATGTATG AAGAAAAGAA GATAATTTTA
GATATAGAAG ATAGAATTAT TGGAAGTGTA AATGGATTGT CTGTTATAGA TTTTGGTTAT
ATGAGTTTTG GAAAGCCTAT TAGAATAACT TGTACTTGTT ATAAAGGTAG CGGAAAAATT
ATGGATGCAC AAAGAGAAAG CAATTTAAGT GGAAACATTC ATAATAAATC TTTAAATATT
CTAAGAGGCT TTTTAAGTAG CTTTTTTAAT TCTTATGAAG CCTTACCTGT AGATTTTCAA
CTAAGTTTTG AGCAGCTCTA TGGAAAGATA GAAGGGGATA GTGCTTCTGT GGCAGAAGTA
ATTGCTATGA TTTCTTCTTT AAGTAAAATA CCTGTGGATC AAAGTATTGC AGTAACAGGT
TCATTAAATC AATTTGGACA GGTACAACCA ATAGGTGGAG TAAATGAAAA AATAGAAGGA
TTTTTCAATG TATGCAAGAA AATAGACACT TATATTGGAA AAGCTGTTTT AATACCAGAA
AGCAATAAAG ATGAGCTTAT ATTAAATAGT GAAATAGAAG AGGCTGTGAG AAAGGGTGAA
TTTAAGATAT ATCTTATGAA AGATATAAAT GAAGCTCTTA GTACATTGCT TCTAAATAAC
ACTATGTCTC TTGAAGATAT AGAAAATAAA ATTAGAGAAG AAATTAAAAA ATTTAATGAT
GATTAA
 
Protein sequence
MRRELTPKEV IYNEDFTSII GEKESYSNEY EGVFRKIDEA LSINKEGFNV YLIDEFSKQK 
LKDLISHLED KMKSRGKPKD ICYVTLEDIR VPKVIFLENG MGEKLEETLE YLKSFYYDEI
YDFYNSSINK EKEEIINDIQ KKRNYYIGDL IKSAKEEGFD LKATSSGFAF IPLVDGEAMT
EEEFDELEEN SKGDISIKVD KLKEGAESVL EELKNIELDS IEKLKELLRT YLENESASVK
GKIKDNFKNE NEAYNYLIDV CESLEKLLVD NYTINFDDDE EKINEIFSKY VCNIIKNSKD
QKAPKVIFEE DPSLNNLLGT IEYENHNGVY STDVKLIKSG SLLEANEGCI ILRLSSLVNN
TNSYYYLRRT LLHGKINYDF NRGYLEVLSL NGLNPDPIPI KVNVILIGDF ESYDILYNND
EDFKKIFRVR AEFSSLIGID ENKRSLLDTI DKIIIDNELI KISTSGINAI GKQLARKAGT
RKKIIWDIDE IERILLLGNE EAKNNNKSLI DKDSIEEVIN QCSEIEKDYL EMYEEKKIIL
DIEDRIIGSV NGLSVIDFGY MSFGKPIRIT CTCYKGSGKI MDAQRESNLS GNIHNKSLNI
LRGFLSSFFN SYEALPVDFQ LSFEQLYGKI EGDSASVAEV IAMISSLSKI PVDQSIAVTG
SLNQFGQVQP IGGVNEKIEG FFNVCKKIDT YIGKAVLIPE SNKDELILNS EIEEAVRKGE
FKIYLMKDIN EALSTLLLNN TMSLEDIENK IREEIKKFND D