Gene ECH74115_3647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3647 
SymbolptsI 
ID6969889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3362574 
End bp3364301 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content50% 
IMG OID643387442 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_002271895 
Protein GI209400537 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000415722 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCAG GCATTTTAGC ATCCCCGGGT ATCGCTTTCG GTAAAGCTCT GCTTCTGAAA 
GAAGACGAAA TTGTCATTGA CCGGAAAAAA ATTTCTGCCG ACCAGGTTGA TCAGGAAGTT
GAACGTTTTC TGAGCGGTCG TGCCAAGGCA TCAGCCCAGC TGGAAACGAT CAAAACGAAA
GCTGGTGAAA CGTTCGGTGA AGAAAAAGAA GCCATCTTTG AAGGGCATAT TATGCTGCTC
GAAGATGAGG AGCTGGAGCA GGAAATCATA GCCCTGATTA AAGATAAACA CATGACAGCT
GACGCAGCTG CTCATGAAGT TATCGAAGGT CAGGCTTCTG CCCTGGAAGA GCTGGATGAT
GAATACCTGA AAGAACGTGC GGCTGACGTA CGTGATATCG GTAAGCGCCT GCTGCGCAAC
ATCCTGGGCC TGAAGATTAT CGACCTGAGC GCCATTCAGG ATGAAGTCAT TCTGGTTGCC
GCTGACCTGA CACCGTCCGA AACCGCACAG CTGAACCTGA AGAAGGTGCT GGGTTTCATC
ACCGACGCGG GTGGCCGTAC TTCCCACACC TCTATCATGG CGCGTTCTCT GGAACTGCCT
GCTATCGTGG GTACCGGTAG CGTCACCTCT CAGGTGAAAA ATGACGACTA TCTGATTCTG
GATGCCGTAA ATAATCAGGT TTACGTCAAT CCAACCAACG AAGTTATTGA TAAAATGCGC
GCTGTTCAGG AGCAAGTGGC TTCTGAAAAA GCAGAGCTTG CTAAACTGAA AGATCTGCCA
GCTATTACGC TGGACGGTCA CCAAGTAGAA GTATGCGCTA ACATTGGTAC GGTTCGTGAC
GTTGAAGGTG CAGAGCGTAA CGGCGCTGAA GGCGTTGGTC TGTATCGTAC TGAGTTCCTG
TTCATGGACC GCGACGCGCT GCCCACTGAA GAAGAACAGT TTGCTGCTTA CAAAGCAGTG
GCTGAAGCGT GTGGCTCGCA GGCGGTTATC GTTCGTACCA TGGACATCGG CGGCGACAAA
GAGCTGCCAT ACATGAACTT CCCGAAAGAA GAGAACCCGT TCCTCGGCTG GCGCGCTATC
CGTATCGCAA TGGATCGTAA AGAGATCCTG CGCGATCAGC TCCGCGCTAT CCTGCGTGCC
TCGGCTTTCG GTAAATTGCG CATTATGTTC CCGATGATCA TCTCTGTTGA AGAAGTGCGT
GCACTGCGCA AAGAGATCGA AATCTACAAA CAGGAACTGC GCGACGAAGG TAAAGCGTTT
GACGAGTCAA TTGAAATCGG CGTAATGGTG GAAACACCGG CTGCCGCAAC AATTGCACGT
CATTTAGCCA AAGAAGTTGA TTTCTTTAGT ATCGGCACCA ATGATTTAAC GCAGTACACT
CTGGCAGTTG ACCGTGGTAA TGATATGATT TCACACCTTT ACCAGCCAAT GTCACCGTCC
GTGCTGAACT TGATCAAGCA AGTTATTGAT GCTTCTCATG CTGAAGGCAA ATGGACTGGC
ATGTGTGGTG AGCTTGCTGG CGATGAACGT GCTACACTTC TGTTGCTGGG GATGGGTCTG
GACGAATTCT CTATGAGCGC CATTTCTATC CCGCGCATTA AGAAGATTAT CCGTAACACG
AACTTCGAAG ATGCGAAGGT GTTAGCAGAG CAGGCTCTTG CTCAACCGAC AACGGACGAG
TTAATGACGC TGGTTAACAA GTTCATTGAA GAAAAAACAA TCTGCTAA
 
Protein sequence
MISGILASPG IAFGKALLLK EDEIVIDRKK ISADQVDQEV ERFLSGRAKA SAQLETIKTK 
AGETFGEEKE AIFEGHIMLL EDEELEQEII ALIKDKHMTA DAAAHEVIEG QASALEELDD
EYLKERAADV RDIGKRLLRN ILGLKIIDLS AIQDEVILVA ADLTPSETAQ LNLKKVLGFI
TDAGGRTSHT SIMARSLELP AIVGTGSVTS QVKNDDYLIL DAVNNQVYVN PTNEVIDKMR
AVQEQVASEK AELAKLKDLP AITLDGHQVE VCANIGTVRD VEGAERNGAE GVGLYRTEFL
FMDRDALPTE EEQFAAYKAV AEACGSQAVI VRTMDIGGDK ELPYMNFPKE ENPFLGWRAI
RIAMDRKEIL RDQLRAILRA SAFGKLRIMF PMIISVEEVR ALRKEIEIYK QELRDEGKAF
DESIEIGVMV ETPAAATIAR HLAKEVDFFS IGTNDLTQYT LAVDRGNDMI SHLYQPMSPS
VLNLIKQVID ASHAEGKWTG MCGELAGDER ATLLLLGMGL DEFSMSAISI PRIKKIIRNT
NFEDAKVLAE QALAQPTTDE LMTLVNKFIE EKTIC