Gene ECH74115_0900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0900 
SymbolclpP 
ID6967957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp911487 
End bp913445 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content51% 
IMG OID643384922 
ProductClp protease domain protein 
Protein accessionYP_002269422 
Protein GI209397527 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0740] Protease subunit of ATP-dependent Clp proteases 
TIGRFAM ID[TIGR00493] ATP-dependent Clp protease, proteolytic subunit ClpP 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCTG GTCACCAGAG TGACGCGGAT ATTTATATTT ATGACGAGAT TGGTTTCTGG 
GGTGTTACAG CGAAGCAGTT TATCAGTGAT CTGAATGCAC TGGGCGATAT CACCCACATT
AATCTCAATA TCAATTCACC GGGTGGCGAT GTCTTTGAAG GCATCGCCAT TTTTAATGCA
CTGAAAACAC ATGGTGCGTC CATTACCGTT TATGTCGACG GTGTGGCGGC GTCAATGGCG
TCGGTCATTG CGATGGTGGG AAAACCGGTC ATTATGCCGG AAAACTCCTT CATGATGATT
CATAAACCAT TTGGCTTTAC GGGCGGTGAT GCGGAGGACA TGCGCACCTA TGCCGACCTG
CTCGATAAAG TTGAGGCGGT TCTGTTACCC GCTTATGCAC AGAAAACCGG GAAAACCACC
GATGAAATTG CTGCCATGCT GGCGGATGAG ACCTGGATGT CCGGTGCCGA ATGTCTGGCA
CATGGATTTG CTGATCAGGT GACGCCAGCC GTTAAGGCAA TGGCATGTAT TCAGTCAAAA
CGTACAGAGG AATTTAAAAA GATGCCGGAA TCCATTCGAA ACATGATTAC TCCGCCACGC
AACAGTGCTC CACGCGTACA GGATGATGGA CCTGCAGCCT CCCGGACGCC AGTGCAGGCA
GCAGCACCTG TGGTGGATGA AAACAGTATC CGTGCGCAGG TACTGGCAGA GCAAAAAGCG
CGTGTAAACG GTATTAATGA TCTGTTTGCC ATGTTTGGCG GGCGTTATCA GACGCTGCAG
GCTCAGTGTC TTGCCGATCC TGAATGTTCG CTGGAGCAGG CCCGCGAAAA GCTGTTGAAC
GAGATGGGGC GCGAGTCCAC GCCATCTAAT AAAAATACCC CGGCTCATAT TTATGCCGGA
AACGGTAATT TTGTGGGGGA TGGGATCCGC CAGGCGCTGA TGGCGCGTGC CGGATTTGAA
AAAACCGAAC GTGATAATGT CTACAACGGG ATGACCCTGC GTGAATATGC CCGTATGTCA
CTGACTGAAC GGGGTATTGG GGTTTCCAGT TATAACCCGA TGCAGATGGT CGGTGCGGCG
TTCACACACA GTACGTCTGA CTTCGGTAAT ATTCTGCTGG ATGTTGCGAA CAAAGCCATT
CTGCAGGGCT GGGAAGATGC CCCTGAAACC TATGAACAGT GGACGCGGAA AGGTCAGTTG
TCTGATTTTA AAATTGCCCA TCGTGTGGGT ATGGGGGGCT TCAGTGCTCT GCGTCAGGTG
CGTGAAGGGG CGGAATATAA ATACGTCACC ACCGGAGATA AACAGGCCAC TATTGCACTG
GCGACCTATG GCGAGCTGTT CAGTATCACC CGTCAGGCCA TTATCAATGA TGATCTGAAT
ATGCTGACCG ATGTCCCGAT GAAACTGGGC CGTGCGGCGA AATCCACTAT TGCCGATCTG
GTTTATGCCA TTCTGACGTC TAACCCGAAA ATCTCCACAG ATAATGTAAG TCTGTTCGAT
AAAGCGAAAC ATGCAAACGT ACTGGAGAGC GCTGCAATGG ACGTGGCATC GCTGGATAAA
GCCCGCCAGT TGATGCGCGT TCAGAAAGAG GGGGAGCGTC ATCTGAATAT TCGTCCTGCG
TTCGTACTGG TACCGACGGC GATGGAGTCT GTTGCTAACC AGGTCATTCG CTCCTCAAGT
GTCAAGGGGG CTGACATTAA CGCCGGTATT ATTAACCCGG TGAAAGATTT TGCGACCGTT
ATTGCAGAGC CTCGTCTTGA TGATAACAGC CAGACCACCT TCTACCTGGC TGCGTCAAAA
GGCTCCGATA CGATTGAAGT GGCTTATCTC AACGGTGTGG ATACGCCATA TATTGATCAG
ATGGAGGGCT TCAGTGTGGA TGGCGTGACA ACGAAAGTGC GTATTGACGC CGGTGTCGCG
CCAGTTGATC ACCGCGGTCT GGTGAAATGT ACGGCGTAA
 
Protein sequence
MQAGHQSDAD IYIYDEIGFW GVTAKQFISD LNALGDITHI NLNINSPGGD VFEGIAIFNA 
LKTHGASITV YVDGVAASMA SVIAMVGKPV IMPENSFMMI HKPFGFTGGD AEDMRTYADL
LDKVEAVLLP AYAQKTGKTT DEIAAMLADE TWMSGAECLA HGFADQVTPA VKAMACIQSK
RTEEFKKMPE SIRNMITPPR NSAPRVQDDG PAASRTPVQA AAPVVDENSI RAQVLAEQKA
RVNGINDLFA MFGGRYQTLQ AQCLADPECS LEQAREKLLN EMGRESTPSN KNTPAHIYAG
NGNFVGDGIR QALMARAGFE KTERDNVYNG MTLREYARMS LTERGIGVSS YNPMQMVGAA
FTHSTSDFGN ILLDVANKAI LQGWEDAPET YEQWTRKGQL SDFKIAHRVG MGGFSALRQV
REGAEYKYVT TGDKQATIAL ATYGELFSIT RQAIINDDLN MLTDVPMKLG RAAKSTIADL
VYAILTSNPK ISTDNVSLFD KAKHANVLES AAMDVASLDK ARQLMRVQKE GERHLNIRPA
FVLVPTAMES VANQVIRSSS VKGADINAGI INPVKDFATV IAEPRLDDNS QTTFYLAASK
GSDTIEVAYL NGVDTPYIDQ MEGFSVDGVT TKVRIDAGVA PVDHRGLVKC TA