Gene EcSMS35_2976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2976 
SymbolptsP 
ID6143111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3054820 
End bp3057066 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content54% 
IMG OID641617845 
Productfused phosphoenolpyruvate-protein phosphotransferase PtsP/GAF domain 
Protein accessionYP_001744997 
Protein GI170682936 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000163219 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00191611 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCTCACTC GCCTGCGCGA AATAGTCGAA AAGGTAGCCA GCGCACCACG CCTGAATGAG 
GCGTTAAATA TTCTGGTTAC CGACATCTGT CTTGCGATGG ATACCGAGGT CTGTTCGGTT
TACCTTGCCG ATCATGATCG ACGTTGTTAC TACCTGATGG CGACCCGGGG GCTGAAAAAA
CCACGCGGTC GCACTGTAAC GCTCGCGTTT GATGAAGGGA TCGTCGGCCT GGTTGGCAGG
CTGGCGGAAC CGATAAACCT TGCAGATGCG CAAAAGCACC CCAGCTTCAA ATACGTCCCC
TCCGTAAAAG AAGAACGTTT CCGCGCATTT TTAGGCGTAC CAATTATTCA ACGTCGCCAG
TTGCTTGGTG TACTGGTGGT ACAGCAACGA GAGTTGCGCC AGTATGACGA AAGTGAAGAA
TCCTTCCTGG TGACGCTTGC CACCCAGATG GCAGCTATTC TTTCTCAGTC GCAGTTGACT
GCCTTGTTTG GACAATATCG CCAGACGCGA ATCCGCGCAT TACCGGCAGC ACCTGGTGTG
GCGATTGCCG AAGGCTGGCA GGATGCCACG TTACCTTTAA TGGAACAGGT GTATCAGGCA
TCAACGCTGG ATCCAGCTCT GGAACGCGAA CGACTGACCG GGGCGCTGGA AGAAGCGGCA
AACGAGTTTC GCCGCTACAG CAAACGCTTT GCCGCCGGTG CACAAAAAGA AACGGCGGCT
ATTTTCGATC TTTACTCGCA CCTGCTTTCG GACACCCGGC TGCGTCGCGA ATTGTTTGCC
GAGGTTGATA AAGGCTCGGT GGCAGAGTGG GCGGTAAAAA CGGTCATTGA AAAATTTGCC
GAACAGTTTG CCGCGCTAAG CGATAACTAT CTCAAAGAGC GGGCTGGCGA TTTACGTGCG
CTGGGCCAGC GATTGCTGTT TCATCTTGAT GACGCTAATC AAGGGCCGAA CGCCTGGCCG
GAACGTTTCA TTCTGGTGGC AGATGAACTG TCAGCGACAA CGCTTGCTGA GCTGCCCCAG
GATCGCTTAG TCGGTGTTGT CGTGCGCGAT GGCGCTGCCA ACTCCCATGC TGCGATCATG
GTACGTGCGC TGGGGATACC TACCGTGATG GGCGCGGATA TTCAGCCTTC GGTACTGCAT
CGTCGGACGC TGATCGTCGA TGGTTATCGC GGTGAATTGC TGGTCGATCC GGAGCCGGTA
CTGCTGCAAG AATATCAGCG GCTAATTAGT GAAGAGATCG AGCTTAGCCG TCTGGCGGAA
GATGACGTCA ATTTACCCGC CCAGTTAAAA AGCGGCGAAC GCATTAAAGT CATGCTCAAT
GCCGGTTTAA GCCCGGAACA TGAAGAAAAA CTGGGCAGCC GTATTGATGG CATAGGACTT
TATCGCACTG AAATCCCATT CATGCTGCAA AGTGGTTTTC CGTCGGAAGA AGAACAGGTG
GCGCAGTATC AGGGGATGCT GCAAATGTTC AATGATAAAC CCGTCACCTT GCGTACGCTG
GATGTCGGAG CAGATAAGCA GCTGCCTTAC ATGCCGATCA GCGAAGAGAA TCCATGCCTG
GGTTGGCGTG GGATTCGCAT TACGCTCGAT CAGCCGGAGA TCTTCTTGAT CCAGGTGCGG
GCGATGCTGC GTGCTAATGC CGCTACGGGC AACCTGAATA TTCTGTTGCC GATGGTCACA
AGCCTCGATG AAGTTGACGA AGCACGCCGC CTGATTGAAC GTGCCGGACG TGAAGTCGAG
GAGATGATCG GTTACGAAAT TCCCAAACCA CGTATCGGCA TCATGCTGGA AGTGCCGTCA
ATGGTATTTA TGCTGCCGCA TCTGGCAAAG CGGGTCGATT TCATCTCTGT TGGCACCAAC
GATCTGACGC AATACATCCT GGCTGTTGAT CGCAACAATA CCCGGGTGGC GAACATTTAT
GACAGTCTTC ATCCTGCAAT GTTACGAGCT CTGGCGATGA TCGCCCGGGA AGCGGAAATA
CATGGAATCG ATCTCCGTTT GTGCGGTGAA ATGGCGGGCG ATCCCATGTG CGTGGCAATC
CTCATTGGGC TTGGGTATCG CCATCTGTCT ATGAACGGAC GTTCTGTAGC GCGCGTAAAA
TACCTGCTGC GGCGCATTGA TTTTGCTGAA GCAGAAAATC TTGCGCAGCG TAGTCTGGAA
GCGCAACTGG CGACCGAAGT TCGCCATCAG GTTGCAGCCT TTATGGAGCG TCGCGGCATG
GGCGGGCTGA TTCGCGGAGG GTTATAG
 
Protein sequence
MLTRLREIVE KVASAPRLNE ALNILVTDIC LAMDTEVCSV YLADHDRRCY YLMATRGLKK 
PRGRTVTLAF DEGIVGLVGR LAEPINLADA QKHPSFKYVP SVKEERFRAF LGVPIIQRRQ
LLGVLVVQQR ELRQYDESEE SFLVTLATQM AAILSQSQLT ALFGQYRQTR IRALPAAPGV
AIAEGWQDAT LPLMEQVYQA STLDPALERE RLTGALEEAA NEFRRYSKRF AAGAQKETAA
IFDLYSHLLS DTRLRRELFA EVDKGSVAEW AVKTVIEKFA EQFAALSDNY LKERAGDLRA
LGQRLLFHLD DANQGPNAWP ERFILVADEL SATTLAELPQ DRLVGVVVRD GAANSHAAIM
VRALGIPTVM GADIQPSVLH RRTLIVDGYR GELLVDPEPV LLQEYQRLIS EEIELSRLAE
DDVNLPAQLK SGERIKVMLN AGLSPEHEEK LGSRIDGIGL YRTEIPFMLQ SGFPSEEEQV
AQYQGMLQMF NDKPVTLRTL DVGADKQLPY MPISEENPCL GWRGIRITLD QPEIFLIQVR
AMLRANAATG NLNILLPMVT SLDEVDEARR LIERAGREVE EMIGYEIPKP RIGIMLEVPS
MVFMLPHLAK RVDFISVGTN DLTQYILAVD RNNTRVANIY DSLHPAMLRA LAMIAREAEI
HGIDLRLCGE MAGDPMCVAI LIGLGYRHLS MNGRSVARVK YLLRRIDFAE AENLAQRSLE
AQLATEVRHQ VAAFMERRGM GGLIRGGL