Gene CPF_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2136 
Symbol 
ID4201046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2375709 
End bp2378060 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content29% 
IMG OID638083001 
ProductU32 family peptidase 
Protein accessionYP_696564 
Protein GI110799078 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGAA TAGAGTTACT AGCACCGGCA GGAAGTATGG AAAGTTTAAT TGCTGCCATA 
AACAGAGGTG CTGATGCTAT ATATTTAGGA GGAAGCAAGT TTTCAGCAAG AGCTTATGCT
TCTAACTTTG ATAATGAAAC AATGGAAAAG GCTGTGGATT ATGCCCATAC TTATGGCGTT
AAGGTTTATG TTACAGTAAA TACTTTAATT AAGGAAAATG AGTTTCAAGA AGCAGTAGAT
TACATAGGCG TTTTATATAA AATAGGAGTT GATGCTTTAA TAATACAAGA TGTAGGATTA
GCTAAAAGAA TAAGAGAGGT ATATCCAGAT TTTGAATTAC ACGCTTCAAC TCAAATGAGC
ATACATAATG GAGAAGGAGC ACTTTTCTTT AGAGAAAATG GATTTTTTAG AATTGTTTTA
TCAAGAGAAC TTACTTTAGA TGAAATTAAA TATATTTCAA AGGACTTAGG AATAGAAACT
GAGATATTTG TTCATGGAGC CTTATGTGTT TGCTATTCAG GACAATGTTT AATGAGTAGC
TTAATAGGAG GAAGAAGTGG TAATAGAGGA AGATGTGCTC AAAGTTGTAG ACTTCCATAT
ACACTAATTA GAGAAAAAGA TAAGAAAGAG ACTAAAGGAT TCCTTTTAAG TCCTAAGGAT
ATTTGTACTA TTGAAAATGT AGAGGACTTG ATAAAAACAG GAACTAGCTC TTTAAAGGTT
GAAGGAAGAA TGAAGAGACC TGAGTATGTT GCTGGAGTTA TAGAATCTTA TAGAGAAGCT
ATAGATAATT CTTATAGAAA TATTAAAAAG TCTCAAGAGG GAAATAAGCT TAAACTTAAA
AAGTTATTTA ATAGAGAAGG TTTTTCTACA GCATACATGT ATAAAAATGT TGGGAAAGAT
ATGATGGCTT TTAAAACTCC AAGAAATACA GGAGTTTTAC TTGGAGAAGT TCTAAAAAAT
GGAGAAGTCC TTTTATTAGA TGATCTTTCA TTAAAAGATG GAATAAGAAC TAATGATGAT
GGATTTACTG TAAGTAAAAT ACTTATGGGA AATAGAGAGG TTACTGAAGC TAAAAAGGGG
GATAAGGTAA AAATATTCCC TAAAAAATAT AAAGCTAATG ATAAGCTTTA TAAAACTTCA
GATACTAAGC TTTTAAATGA ATTAAAGACC TCTTATGAAA ATCCATATGA GAGAAAGATA
AACTTAAAGG CTTATATGAA ATTTAAGGTT AATGAACCTA TGGAACTTAC TGTGGAGTAC
AATGGAAATT ACTTTACAAA GACTGGTGAT ATGGTTCAAG TTGCAGTAAA TAGACCTATG
GATAAGGAAA AAGTTGAAAA GAACTTAAAG AAATCTGGAG ATATTCCTTA TGAAATAAGT
GAAATAGAAT ACGTAACTTT TGAAGATGGT TTTGCAGCAG TTTCTTCTAT AAATAATTTA
AGAAGAGAAG TTTTAGAAGA GATAAGAAAT TCTGAGATTA AAAAATACAA AAGAGTTATA
GAAAAACAAA ATAATATAGA ACTTAATGAA AAGAATGGTG AAATGCCAGA GGTTTTAGTT
GTTGTAAATA CTAAGGAACA ATTAAGAGCT TCAATAGATT GTGGAATCAA AAATATTGCT
TTAGATATCT TTGGAAAAAA TCAAGGGCAA CTTAATAAGA ATGCTATAAA AGAATTCAAA
GTTGAAGGAA TTAATCTTTA TATAAAAGCA CCTACAATAA TAAAAGAAGA GTTTGAAACA
ATAAGTAATA TAATAGAAGA GAATCTTAAC TTAATAAAAG GTTTAGTTAC AGCTAATACT
GGAATAATAA ATAGATTTAA GGATAAAACA AATATTATAG GTGATTATAA ACTTAATATA
TTTAATTCTT ATGGTATGAA GTTTTATAAT GAACATATGA TGGGAAGCTG TTTAAGTTTA
GAATTAAATA AAAAAGAGTT AAATAAAATG CTTAAGAAGT ATTCTAAGGG AGCTCAAATC
TTTGTTTATG GAAGACCAGA GGTTATGGTT AGTGAATACT GTCCAATAGG AAGTACCTTT
GGTGGAAAAT GTACTTCAAA AAATTGTGAT AATCAATGTG TAAGTAGTAC ATTTACTTTA
AGAGATAGAA TGAATCAAGA TTTTGTTATA AGAACTGATA TATTCTGTAG ATCACACATA
TACAATACAG TACCTGTTAA CTTAATTCAG GAAATTGATG AAATAAAATC TTTAGGAGTA
AATTCTTTTA GATTAGACTT TGTAGATGAA AATTATGAAG AGGTTAGACA GGTTATTGAA
GCTCTAAATA AGGAAGAAGC TTTAAGATTA AAGGATTATA CTAAAGGTCA CTTTAAAAGA
GGCGTTGAGT AA
 
Protein sequence
MERIELLAPA GSMESLIAAI NRGADAIYLG GSKFSARAYA SNFDNETMEK AVDYAHTYGV 
KVYVTVNTLI KENEFQEAVD YIGVLYKIGV DALIIQDVGL AKRIREVYPD FELHASTQMS
IHNGEGALFF RENGFFRIVL SRELTLDEIK YISKDLGIET EIFVHGALCV CYSGQCLMSS
LIGGRSGNRG RCAQSCRLPY TLIREKDKKE TKGFLLSPKD ICTIENVEDL IKTGTSSLKV
EGRMKRPEYV AGVIESYREA IDNSYRNIKK SQEGNKLKLK KLFNREGFST AYMYKNVGKD
MMAFKTPRNT GVLLGEVLKN GEVLLLDDLS LKDGIRTNDD GFTVSKILMG NREVTEAKKG
DKVKIFPKKY KANDKLYKTS DTKLLNELKT SYENPYERKI NLKAYMKFKV NEPMELTVEY
NGNYFTKTGD MVQVAVNRPM DKEKVEKNLK KSGDIPYEIS EIEYVTFEDG FAAVSSINNL
RREVLEEIRN SEIKKYKRVI EKQNNIELNE KNGEMPEVLV VVNTKEQLRA SIDCGIKNIA
LDIFGKNQGQ LNKNAIKEFK VEGINLYIKA PTIIKEEFET ISNIIEENLN LIKGLVTANT
GIINRFKDKT NIIGDYKLNI FNSYGMKFYN EHMMGSCLSL ELNKKELNKM LKKYSKGAQI
FVYGRPEVMV SEYCPIGSTF GGKCTSKNCD NQCVSSTFTL RDRMNQDFVI RTDIFCRSHI
YNTVPVNLIQ EIDEIKSLGV NSFRLDFVDE NYEEVRQVIE ALNKEEALRL KDYTKGHFKR
GVE