Gene ECH74115_4095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4095 
SymbolptsP 
ID6969849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3792985 
End bp3795231 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content54% 
IMG OID643387852 
Productfused phosphoenolpyruvate-protein phosphotransferase PtsP/GAF domain 
Protein accessionYP_002272292 
Protein GI209400141 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000034293 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0472053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACTC GCCTGCGCGA AATAGTCGAA AAGGTAGCCA GCGCACCACG CCTGAATGAG 
GCGTTAAATA TTCTGGTTAC CGACATCTGT CTTGCGATGG ATACCGAGGT CTGTTCGGTC
TACCTGGCCG ATCATGATCG ACGTTGTTAC TACCTGATGG CGACCCGGGG GCTGAAAAAA
CCACGCGGTC GCACTGTAAC GCTCGCCTTT GATGAAGGGA TCGTCGGCCT GGTTGGCAGG
CTGGCGGAAC CGATAAACCT TGCAGATGCG CAAAAGCACC CCAGCTTCAA ATACATCCCC
TCCGTAAAAG AAGAACGTTT CCGCGCGTTT TTAGGCGTAC CAATTATTCA ACGTCGCCAG
TTGCTTGGTG TACTGGTGGT ACAGCAACGA GAGTTGCGCC AGTATGACGA AAGTGAAGAA
TCCTTCCTGG TGACGCTTGC CACCCAGATG GCAGCGATTC TTTCTCAGTC GCAGTTGACT
GCCTTGTTTG GGCAATATCG CCAGACGCGA ATCCGCGCAT TACCGGCAGC ACCTGGTGTA
GCGATTGCCG AAGGCTGGCA GGATGCCACG TTACCTTTAA TGGAACAGGT GTATCAGGCA
TCAACGCTGG ATCCGGCTCT GGAACGCGAA CGACTGACCG GGGCGCTGGA AGAAGCGGCA
AACGAGTTTC GCCGCTACAG CAAACGCTTT GCCGCCGGTG CACAAAAAGA AACGGCGGCT
ATTTTCGATC TTTACTCGCA CCTGCTTTCG GACACCCGGC TGCGTCGCGA ATTGTTTGCC
GAGGTTGATA AAGGCTCGGT GGCAGAATGG GCGGTAAAAA CGGTCATTGA AAAATTTGCC
GAACAGTTTG CCGCGCTAAG CGATAACTAT CTCAAAGAGC GGGCTGGCGA TTTACGTGCG
CTGGGTCAGC GATTGCTGTT TCATCTTGAT GACGCTAATC AAGGGCCGAA CGCCTGGCCG
GAACGTTTCA TTCTGGTGGC AGATGAACTG TCAGCGACAA CTCTTGCTGA GCTGCCCCAG
GATCGCTTAG TCGGTGTTGT CGTGCGAGAT GGCGCAGCCA ACTCCCATGC TGCGATCATG
GTACGTGCGC TGGGGATCCC TACCGTGATG GGCGCGGATA TTCAGCCTTC GGTGCTGCAT
CGTCGGACGC TGATCGTCGA TGGCTATCGC GGTGAATTGC TGGTCGATCC GGAGCCGGTA
CTGCTGCAAG AATATCAGCG GCTAATTAGT GAAGAGATTG AGCTTAGCCG TCTGGCGGAA
GATGACGTCA ATTTACCCGC GCAGTTAAAA AGCGGTGAGC GTATAAAAGT CATGCTCAAT
GCTGGTTTAA GCCCGGAACA TGAAGAAAAA CTGGGCAGCC GTATTGATGG CATCGGTCTT
TATCGCACTG AAATCCCATT CATGCTGCAA AGTGGTTTTC CGTCGGAAGA AGAACAGGTG
GCGCAGTATC AGGGGATGCT GCAAATGTTC AATGATAAAC CCGTCACCTT GCGTACGCTG
GATGTCGGAG CAGATAAGCA GCTGCCTTAC ATGCCGATTA GCGAAGAGAA TCCATGCCTG
GGTTGGCGTG GGATTCGCAT TACGCTCGAT CAGCCGGAGA TCTTCTTGAT CCAGGTGCGG
GCGATGCTGC GTGCTAATGC CGCTACGGGC AACCTGAATA TTCTGTTGCC GATGGTCACA
AGCCTCGATG AAGTTGACGA AGCACGCCGC CTGATTGAAC GTGCCGGACG TGAAGTCGAG
GAGATGATCG GTTACGAAAT TCCCAAACCA CGTATCGGCA TCATGCTGGA AGTGCCGTCA
ATGGTATTTA TGCTGCCGCA TCTGGCAAAG CGGGTCGATT TCATCTCTGT TGGCACCAAC
GATCTGACTC AATACATTCT GGCCGTTGAT CGCAACAATA CCCGGGTGGC GAACATTTAT
GACAGTCTTC ATCCTGCAAT GTTACGAGCT CTGGCGATGA TCGCCCGGGA AGCGGAAATA
CATGGAATCG ATCTCCGTTT GTGCGGTGAA ATGGCGGGCG ATCCCATGTG CGTGGCAATC
CTCATTGGGC TTGGGTATCG CCATCTGTCT ATGAACGGAC GTTCTGTAGC GCGGGTAAAA
TACCTGCTGC GGCGCATTGA TTATGCCGAA GCAGAAAATC TTGCGCAGCG TAGTCTGGAA
GCGCAACTGG CGACCGAAGT TCGCCATCAG GTTGCAGCCT TTATGGAGCG TCGCGGCATG
GGCGGGCTGA TTCGCGGAGG GTTATAG
 
Protein sequence
MLTRLREIVE KVASAPRLNE ALNILVTDIC LAMDTEVCSV YLADHDRRCY YLMATRGLKK 
PRGRTVTLAF DEGIVGLVGR LAEPINLADA QKHPSFKYIP SVKEERFRAF LGVPIIQRRQ
LLGVLVVQQR ELRQYDESEE SFLVTLATQM AAILSQSQLT ALFGQYRQTR IRALPAAPGV
AIAEGWQDAT LPLMEQVYQA STLDPALERE RLTGALEEAA NEFRRYSKRF AAGAQKETAA
IFDLYSHLLS DTRLRRELFA EVDKGSVAEW AVKTVIEKFA EQFAALSDNY LKERAGDLRA
LGQRLLFHLD DANQGPNAWP ERFILVADEL SATTLAELPQ DRLVGVVVRD GAANSHAAIM
VRALGIPTVM GADIQPSVLH RRTLIVDGYR GELLVDPEPV LLQEYQRLIS EEIELSRLAE
DDVNLPAQLK SGERIKVMLN AGLSPEHEEK LGSRIDGIGL YRTEIPFMLQ SGFPSEEEQV
AQYQGMLQMF NDKPVTLRTL DVGADKQLPY MPISEENPCL GWRGIRITLD QPEIFLIQVR
AMLRANAATG NLNILLPMVT SLDEVDEARR LIERAGREVE EMIGYEIPKP RIGIMLEVPS
MVFMLPHLAK RVDFISVGTN DLTQYILAVD RNNTRVANIY DSLHPAMLRA LAMIAREAEI
HGIDLRLCGE MAGDPMCVAI LIGLGYRHLS MNGRSVARVK YLLRRIDYAE AENLAQRSLE
AQLATEVRHQ VAAFMERRGM GGLIRGGL