Gene CPF_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2971 
Symbol 
ID4201430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3234588 
End bp3236447 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content32% 
IMG OID638083838 
ProductATP-dependent protease 
Protein accessionYP_697325 
Protein GI110800560 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease
[TIGR02903] ATP-dependent protease, Lon family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTG AACTTGGAGT AGAATCTCAA GTTGAAGCAC TAAAAGATAT AATAAATAAT 
ATATTAGACG AAGGTGCATT TAGAGCGAGA GTTATAAGAT TTAAAGTACA AAATTATATA
AATTCAACTG ATCCTTATGA AAGACTTTAT GGATTAAGTA AAATTGTTTC TGAAGGAAAG
GGTTTAAGTG AAGTTCCAAC TGAAGAAACT ATAAATGAAG CTTTAGAAGA TGTTTGTGCT
ATGATATCAG ATGCTATTGC TAGAAGATAT GTCCAAAATA AAATAGAAAA AGAAGTTGAA
CAATTCTTAA TGGAAAAGCA AGAAAAGTAT GTTGATGAAC TTAGAGTAAA CATAATGAAA
AAGAAAAAAG GTCCAGAAAA TGCTAAGACA GAGAAAAAGC TTGAGGAACT TGAAGAACTA
GATGAAAGAG TTCCAAATAA GAATATAATG TCTTTATTAA GACCTGATTC ATTTGATGAG
GTAGTTGGTC AAGAGAGAGC TGTTAAGTCA CTTCTTTCAA AACTAGCTTC ACCATATCCT
CAACATATAA TACTTTATGG ACCTCCAGGG GTTGGTAAAA CAACAGCTGC TAGAATTGCT
TTAGAAACAG CTAAGAAATT AAAATCAACT CCATTTGATG ATAGATCAAA ATTCATAGAG
GTTAATGGTA CAACTTTAAG ATGGGATCCA AGAGAAATCA CAAACCCACT TTTAGGTTCA
GTACATGATC CAATATATCA AGGTAGCAAA AGAGACTTAG CTGAAATAGG AGTTCCAGAA
CCAAAACCAG GTTTAGTTAC TGAAGCTCAT GGTGGTATAT TATTCATAGA TGAAATTGGA
GAATTAGATG AAATACTTCA AAATAAACTT TTAAAAGTTT TAGAAGATAA GAGAGTTGAA
TTCTCATCAT CTTACTATGA TCCAGATGAT GAAAATACAC CTAAATATAT AAAATATCTT
TTTGATAAGG GAGCTCCAGC AGACTTTGTT CTAATAGGAG CAACTACTAG AGAACCAGGA
GAAATCAATC CTGCTTTACG TTCAAGATGT ACAGAGGTTT ATTTTGAACC ACTATCATCA
AGAGATATTG AAAAGATAGT ATTAAATGCA GCTAAGAAGC TTAATGTTAA GCTTGAAGAA
GGTTTAGAAA AGAAAATAGC TTCTTATACT ATAGAAGGTA GAAGAGCTGT AAATATATTA
GCAGATGCTT ATGGTCATGC CATTTATGGC TTAGAGGGAG AAGTTCCAGA AGACTTAGAA
ATAACTTCAA AGGATTTAAA TGAAGTTGTA AGCATAGGAA GATTTACTCC GTATGAAATA
CTAGAAAATT TAGAGGAAAA AGAAGTAGGT CATGTTTATG GACTTGGAGT TTCAGGATTC
TTAGGCTCAA CAATAGAGAT TGAAGCCACT GCTTTTAAAG CTAAGAAAAA GGGTGCTGGA
AAAATAAGAT TCAATGATAC TGCTGGTTCA ATGGCTAAGG ATTCTGTATT TAATGCTGCA
TCTGTAATAA AAAGGCTAAC TGATAAGGAT ATAAATGACT ATGATATACA TGTTAACGTA
ATTGGTGGAG GAAAGATAGA TGGACCATCT GCTGGAGCTG CCATTACCAT ATGTATAATG
AGTGCTTTAT TAGAAAAGCC AATAAGACAA GACTTAGCTA TAACTGGAGA GATTTCTTTA
AGAGGAAAGA TTAAACCAGT TGGAGGTATA TTTGAAAAAA TATACGGGGC TAGAAGAAAG
GGAATTAAGT TAGTAACTGT TCCTAAAGAT AATGAAAATG AAATCCCTAA AGGATTAGAA
GATATAGAAG TTAAAGCTAT AAGTTCTATA GAAGAGCTTA TGGAAATTGC TTTTAATTAA
 
Protein sequence
MNSELGVESQ VEALKDIINN ILDEGAFRAR VIRFKVQNYI NSTDPYERLY GLSKIVSEGK 
GLSEVPTEET INEALEDVCA MISDAIARRY VQNKIEKEVE QFLMEKQEKY VDELRVNIMK
KKKGPENAKT EKKLEELEEL DERVPNKNIM SLLRPDSFDE VVGQERAVKS LLSKLASPYP
QHIILYGPPG VGKTTAARIA LETAKKLKST PFDDRSKFIE VNGTTLRWDP REITNPLLGS
VHDPIYQGSK RDLAEIGVPE PKPGLVTEAH GGILFIDEIG ELDEILQNKL LKVLEDKRVE
FSSSYYDPDD ENTPKYIKYL FDKGAPADFV LIGATTREPG EINPALRSRC TEVYFEPLSS
RDIEKIVLNA AKKLNVKLEE GLEKKIASYT IEGRRAVNIL ADAYGHAIYG LEGEVPEDLE
ITSKDLNEVV SIGRFTPYEI LENLEEKEVG HVYGLGVSGF LGSTIEIEAT AFKAKKKGAG
KIRFNDTAGS MAKDSVFNAA SVIKRLTDKD INDYDIHVNV IGGGKIDGPS AGAAITICIM
SALLEKPIRQ DLAITGEISL RGKIKPVGGI FEKIYGARRK GIKLVTVPKD NENEIPKGLE
DIEVKAISSI EELMEIAFN