Gene ECH74115_3202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3202 
SymbolclpP 
ID6971002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2952162 
End bp2954117 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content55% 
IMG OID643387021 
ProductClp protease domain protein 
Protein accessionYP_002271488 
Protein GI209400945 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0740] Protease subunit of ATP-dependent Clp proteases 
TIGRFAM ID[TIGR00493] ATP-dependent Clp protease, proteolytic subunit ClpP 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.790281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00227439 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGCTG GGGGGCCGGG TGACGCGGAT ATTTATATTT ATGACGAGAT TGGTTTCTGG 
GGAGTTACCG CGAAGCAGTT TGTCAGCGAA CTGAATGCAC TGGGTGATAT CACCCACATT
AATCTCCATA TCAATTCACC GGGTGGCGAT GTCTTTGAAG GCATCGCCAT TTTTAATGCC
CTGAAAAATC AGGGGGCGAC CATTACCGTG TATGTGGATG GCGTTGCCGC CTCGATGGCA
TCTGTGATTG CGATGGCCGG TGATACGGTC ATTATGCCGG AAAATGCCTT CATGATGATC
CATAAGCCAT GGGGATTCAG TGGCGGGGAT GCTGAGGATA TGCGCAGTTA TGCCGATTTG
CTGGATAAAG TCGAATCGGT ACTGTTGCCA GCCTATGCGC AGAAAACCGG AAAAACCACC
GATGAAATTG CCGCCATGCT GGCGGATGAA ACCTGGATGT CCGGTGCCGA ATGTCTGGCA
CACGGATTTG CTGACCAGGT GACACCCGCT GTTGAGGCAA TGGCATGTAT TCAGTCAAAA
CGTACAGAGG AATTTAAAAA GATGCCGGAA TCCATCCGAA ACATGATTAC TCCGCCACGC
AACAGTGCCC CGCGTGATAC CACAGTGACA ATCCCTGCAC CGGCGGTAAC AGAACCATCA
CCGGTACCGG CAGTGTCTGA TGAGGCGACC ATTCGCGCCC GCGTTATGGC AGAACAGAAA
GCCCGCATGT CAGGCATTAA CGATCTGTTT GCCATGTTCG GTGGTCGCTA TCAGACGCTT
CAGGCACAGT GCGTGGCTGA TCCTGACTGT TCGCTGGAAA TGGCCCGTGA ACGACTGCTG
AATGAAATGG GCAAGGAGTC CTCGCCGACC AACAAAAATA CACCGGCCCA TATTTATGCC
GGAAACGGCA ATTTTGTGGG GGACGGGATC CGCCAGGCGA TGCTGGCCCG TGCCGGATTT
GAAAATGTCG AGAAGGATAA CGCCTATAAC GGGATGACCC TGCGTGAATG GGCTCGCATG
TCACTGACGG AGCGCGGTAT TGGGGTGGCC AGTTATAACC CCATGCAGAT GGTCGGGCTG
GCGCTGACGC ACAGCACCTC TGATTTTGGC AATATTCTGC TGGATGTGTC GAACAAGGGG
CTGATCCAGG GCTGGGAGGA ATCAGAAGAA ACCTTCCAGA AGTGGACCCG TAAGGGACGC
CTGTCAGACT TCAAAACAGC GTATCGCGTG GGGATGGGCG GTTTTGGTTC TCTGCGCCAG
GTTCGTGAGG GGGCGGAGTA TAAATACATC ACCACCTCAG ATCGCAAGGA GACCATTGCA
CTGGCCACTT ACGGGGAGAT TTTCTCCATC ACCCGCCAGG CCATTATCAA TGATGATCTG
AATATGCTGG TGGACGTGCC GATGAAGATG GGGCGTGCGG CGAAGGCAAC GATTGGTGAC
CTGGTCTACA AGGTGCTGAC GGATAACCCG AAACTGTCCG ACGGTAAGGC GCTGTTCCAT
GCCGATCACA AAAATATTGC CACCGGGGGG ATCTCCGTTT CCGGACTGGA TGCGGCCCGT
CAGATGATGC GCCTGCAGAA AGAAGGCGAT CGTGCCCTGA ATATCCGTCC GGCCTTTATG
CTGGTACCGG TGGCACTGGA GACGGTGGCG AACCAGACCA TCAAATCGGC CAGTGTGAAA
GGGGCGGATG CAAACGCCGG TGTCATTAAC CCTATCCAGA ACTTTGCTGA GGTGATTGCA
GAAGCGCGTC TTGATGCGGC AGACCCGAAA ACCTGGTATC TGGCGGCGGC ACAGGGCACT
GACACCATTG AAGTGGCCTG GCTGGATGGT GTGGACACGC CATACATTGA TCAGCAGGAA
GGTTTCACCA CTGACGGCAT TGCCACAAAA ATCCGTATTG ATGCCGGAGT GGCACCACTT
GACTGGCGCG GGCTGGTGCG TTCGTCGGTG GCCTGA
 
Protein sequence
MQAGGPGDAD IYIYDEIGFW GVTAKQFVSE LNALGDITHI NLHINSPGGD VFEGIAIFNA 
LKNQGATITV YVDGVAASMA SVIAMAGDTV IMPENAFMMI HKPWGFSGGD AEDMRSYADL
LDKVESVLLP AYAQKTGKTT DEIAAMLADE TWMSGAECLA HGFADQVTPA VEAMACIQSK
RTEEFKKMPE SIRNMITPPR NSAPRDTTVT IPAPAVTEPS PVPAVSDEAT IRARVMAEQK
ARMSGINDLF AMFGGRYQTL QAQCVADPDC SLEMARERLL NEMGKESSPT NKNTPAHIYA
GNGNFVGDGI RQAMLARAGF ENVEKDNAYN GMTLREWARM SLTERGIGVA SYNPMQMVGL
ALTHSTSDFG NILLDVSNKG LIQGWEESEE TFQKWTRKGR LSDFKTAYRV GMGGFGSLRQ
VREGAEYKYI TTSDRKETIA LATYGEIFSI TRQAIINDDL NMLVDVPMKM GRAAKATIGD
LVYKVLTDNP KLSDGKALFH ADHKNIATGG ISVSGLDAAR QMMRLQKEGD RALNIRPAFM
LVPVALETVA NQTIKSASVK GADANAGVIN PIQNFAEVIA EARLDAADPK TWYLAAAQGT
DTIEVAWLDG VDTPYIDQQE GFTTDGIATK IRIDAGVAPL DWRGLVRSSV A