Gene CPF_2308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2308 
Symbol 
ID4202347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2554974 
End bp2557259 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content28% 
IMG OID638083173 
ProductATP-dependent protease 
Protein accessionYP_696731 
Protein GI110800581 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAG AGTTAACTCC AAAAGAAGTA ATTTATAATG AAGATTTTAC TAGTGGGATA 
GGTGAGAAGG AATCCTATGG TAATGAATAT CAAGGGGTTT TTAAGAAAAT TGATGAAGCA
TTAAGCATAA ATAAAGAAGG ATTTAATGTT TACCTAATTG ATGAATTTTC AAAACAAAAG
CTTAAAGACA TAATGGTACA TCTAGAAGAT AAAATGAAAA GTAGAGGTAA GCCTAAGGAT
ATATGTTATG TTACCTTAGA AGATATTAGG GTTCCTAAGG TTATATTTCT AGAAAATGGA
ATGGGAGAAA GGTTAGATGA AACTTTAGAA TATCTAAAAT CTTTTTATTA TGATGAAATA
TATGCTTTTT ATAATTCTTC AATAAATAAA GAAAAGGAAG AGATTATAAA TGATATACAA
AAAAAGAGAA ATTATTATAT AGGGGACTTA ATAAAGAGTG CAAAGGAAGA AGGATTTGAT
TTAAAAGCAA CTTCTTCAGG CTTTGCATTT ATTCCTTTAG TAGATGGAGA AGCTATGACA
GAAGAGGAGT TTGATGATCT AGAAGAGAAT AGTAAAGGGG ATATTTCTAT AAAGGCTGAC
AAGTTAAAAG AGGGAGCAGA AGGAGTTCTT GAAGAATTAA AAAATATAGA GTTAGATTCC
ATTGAAAAAT TAAAGGGGAT ACTTAGAACC TATTTAGAAA ATGAATCAGC AAGTGTTAAA
GAAAAAATAA AGGATAATTT TAAAGATGAG AATGAAGCTT ATAACTATCT TATAGATGTT
TGTGAAAGTT TAGAAAACCT ATTGATTGAT AATTACACAA TAAACTTTGA TGATGATGAA
GAAAAGATAA ATGAGATTAT TTCAAAGTAT GTTTGTAATA TCATAAAAAA CAGCAAGGGG
CAAGAGGCTC CTAAGGTTAT TTTTGAAGAA GATCCAAGCT TAAATAATCT TTTAGGAACT
ATAGAGTATG AAAATCATAA TGGTGTGTAT TCAACTGATG TAAAACTTAT AAAATCAGGT
TCATTACTTG AAGCAAATGA AGGGTGCATA ATACTTAGGT TAAGTTCTTT AGTTAATAAT
GCAAATAGCT ATTATTATTT GAGAAGGGCG TTACTTCATG GAAAGATAAA TTATGATTTT
AATAGAGGAT ATTTAGAAGT GCTTTCATTA AATGGGTTAA ATCCAGATCC AATACCTATT
AAGGTAAATG TAATTTTAAT AGGAGATTTT GAAAGTTATG ATATTTTGTA TAACCATGAT
GAAGACTTCA AGAAAATATT TAGGATTAGA GCTGAATTTT CAAACTTAAT AGGAATAGAT
GAAAATAAAA AATCTTTGGT AGATCTTGTT GATAAAGTCA TAATAGATAA TGATTTAATA
AAGATCTCTA CTTCTGGAAT TAATGCTATT GGAAAACAAT TAGCAAGAAA AGCTGGAACA
AGGAAGAAAA TTCTTTGGGA TATTGATGAA ATAGAGAGAG TATTACTTCT TGGAAATGAA
GAGGCAAAAA ATAATAATAA GTCATTGATA GATAAGGATT CAATAGAAGC GGTTGTAAAT
CAATGCAGTG AAATTGAAAA AGATTATTTA GAAATGTATG AGGAAAAGAA GATAATTTTA
GATATAGAAG ATAGAATTAT TGGAAGTGTA AATGGATTGT CTGTTATAGA TTTTGGTTAT
ATGAGCTTTG GAAAGCCTAT GAGAATAACT TGTACTTGTT ATAAAGGTAG CGGAAAAATT
ATGGATGCAC AAAGAGAAAG CAATTTAAGT GGAAACATTC ATAATAAATC TTTAAATATT
CTAAGAGGCT TTTTAAGTAG CTTCTTTAAT TCTTATGAAG CTTTACCTGT TGACTTTCAA
CTAAGTTTTG AGCAGCTTTA TGGAAAAATA GAAGGAGATA GTGCTTCTGT GGCAGAAGTA
ATTGCTATGA TTTCTTCTTT AAGTAAAATA CCTGTAGATC AAAGCATTGC AGTAACAGGT
TCATTAAATC AATTTGGACA GGTGCAACCA ATAGGTGGAG TAAATGAAAA AATAGAAGGA
TTTTTCAATG TATGCAAAAA AATTGGCACT TATATTGGAA AAGCTGTTTT AATACCAGAA
AGCAATAAAG ATGAGCTTAT ATTAAATAGT GAAATAGAAG AGGCTGTGAG AAAGGGTGAA
TTTAAGATAT ATCTTATGAA AGATATAAAT GAAGCTCTTA GTACATTGCT TCTAAATGAC
ACTATGCCTC TTGAAGATAT AGGAAATAAA ATTAGTGAAG AAATTAAAAA ATTTAAGGAC
AATTAA
 
Protein sequence
MRRELTPKEV IYNEDFTSGI GEKESYGNEY QGVFKKIDEA LSINKEGFNV YLIDEFSKQK 
LKDIMVHLED KMKSRGKPKD ICYVTLEDIR VPKVIFLENG MGERLDETLE YLKSFYYDEI
YAFYNSSINK EKEEIINDIQ KKRNYYIGDL IKSAKEEGFD LKATSSGFAF IPLVDGEAMT
EEEFDDLEEN SKGDISIKAD KLKEGAEGVL EELKNIELDS IEKLKGILRT YLENESASVK
EKIKDNFKDE NEAYNYLIDV CESLENLLID NYTINFDDDE EKINEIISKY VCNIIKNSKG
QEAPKVIFEE DPSLNNLLGT IEYENHNGVY STDVKLIKSG SLLEANEGCI ILRLSSLVNN
ANSYYYLRRA LLHGKINYDF NRGYLEVLSL NGLNPDPIPI KVNVILIGDF ESYDILYNHD
EDFKKIFRIR AEFSNLIGID ENKKSLVDLV DKVIIDNDLI KISTSGINAI GKQLARKAGT
RKKILWDIDE IERVLLLGNE EAKNNNKSLI DKDSIEAVVN QCSEIEKDYL EMYEEKKIIL
DIEDRIIGSV NGLSVIDFGY MSFGKPMRIT CTCYKGSGKI MDAQRESNLS GNIHNKSLNI
LRGFLSSFFN SYEALPVDFQ LSFEQLYGKI EGDSASVAEV IAMISSLSKI PVDQSIAVTG
SLNQFGQVQP IGGVNEKIEG FFNVCKKIGT YIGKAVLIPE SNKDELILNS EIEEAVRKGE
FKIYLMKDIN EALSTLLLND TMPLEDIGNK ISEEIKKFKD N