Gene EcSMS35_2571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2571 
SymbolptsI 
ID6144360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2625056 
End bp2626783 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content50% 
IMG OID641617442 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_001744607 
Protein GI170681013 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00718349 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCAG GCATTTTAGC ATCCCCGGGT ATCGCTTTCG GTAAAGCTCT GCTTCTGAAA 
GAAGACGAAA TTGTCATTGA CCGGAAAAAA ATTTCTGCCG ACCAGGTTGA TCAGGAAGTT
GAACGTTTTC TGAGCGGTCG TGCCAAGGCA TCAGCCCAGC TGGAAACGAT CAAAACGAAA
GCTGGTGAAA CGTTCGGTGA AGAAAAAGAA GCCATCTTTG AAGGGCATAT TATGCTGCTC
GAAGATGAGG AGCTGGAGCA GGAAATCATA GCCCTGATTA AAGATAAGCA CATGACAGCT
GACGCAGCTG CTCATGAAGT TATCGAAGGT CAGGCTTCTG CCCTGGAAGA GCTGGATGAT
GAATACCTGA AAGAACGTGC GGCTGACGTA CGTGATATCG GTAAGCGCCT GCTGCGCAAC
ATCCTGGGCC TGAAGATTAT CGACCTGAGC GCCATTCAGG ATGAAGTCAT TCTGGTTGCC
GCTGACCTGA CGCCGTCCGA AACCGCACAG CTGAACCTGA AGAAGGTGCT GGGTTTCATC
ACCGACGCGG GTGGCCGTAC TTCCCACACC TCTATCATGG CGCGTTCTCT GGAACTGCCT
GCTATCGTGG GTACCGGTAG CGTCACCTCT CAGGTGAAAA ATGACGACTA TCTGATTCTG
GATGCCGTAA ATAATCAGGT TTACGTCAAT CCAACCAACG AAGTTATTGA TAAAATGCGC
GCTGTTCAGG AGCAAGTGGC TTCTGAAAAA GCAGAGCTTG CTAAACTGAA AGATCTGCCA
GCTATTACGC TGGACGGTCA CCAGGTAGAA GTGTGCGCTA ACATTGGTAC GGTTCGTGAC
GTTGAAGGTG CAGAGCGTAA CGGCGCTGAA GGCGTTGGTC TGTATCGTAC TGAGTTCCTG
TTCATGGACC GCGACGCGCT GCCCACTGAA GAAGAACAGT TTGCTGCTTA CAAAGCAGTG
GCTGAAGCGT GTGGCTCGCA GGCGGTTATC GTTCGTACCA TGGACATCGG CGGCGACAAA
GAGCTGCCAT ACATGAACTT CCCGAAAGAA GAGAACCCGT TCCTCGGCTG GCGCGCTATC
CGTATCGCGA TGGATCGTAA AGAGATCCTG CGCGATCAGC TCCGCGCTAT CCTGCGTGCC
TCGGCTTTCG GTAAATTGCG CATTATGTTC CCGATGATCA TCTCTGTTGA AGAAGTGCGT
GCACTGCGCA AAGAGATCGA AATCTACAAA CAGGAACTGC GCGACGAAGG TAAAGCGTTT
GACGAGTCAA TTGAAATCGG CGTAATGGTG GAAACACCGG CTGCCGCAAC AATTGCACGT
CATTTAGCCA AAGAAGTTGA TTTCTTTAGT ATCGGCACCA ATGATTTAAC GCAGTACACT
CTGGCAGTTG ACCGTGGTAA TGATATGATT TCACACCTTT ACCAGCCAAT GTCACCGTCC
GTGCTGAACT TGATCAAGCA AGTTATTGAT GCTTCTCATG CTGAAGGCAA ATGGACTGGC
ATGTGTGGTG AGCTTGCTGG CGATGAACGT GCTACACTTC TGTTGCTGGG GATGGGTCTG
GACGAATTCT CTATGAGCGC CATTTCTATC CCGCGCATTA AGAAGATTAT CCGTAACACG
AACTTCGAAG ATGCGAAGGT GTTAGCAGAG CAGGCTCTTG CTCAACCGAC AACGGACGAG
TTAATGACGC TGGTTAACAA GTTCATTGAA GAAAAAACAA TCTGCTAA
 
Protein sequence
MISGILASPG IAFGKALLLK EDEIVIDRKK ISADQVDQEV ERFLSGRAKA SAQLETIKTK 
AGETFGEEKE AIFEGHIMLL EDEELEQEII ALIKDKHMTA DAAAHEVIEG QASALEELDD
EYLKERAADV RDIGKRLLRN ILGLKIIDLS AIQDEVILVA ADLTPSETAQ LNLKKVLGFI
TDAGGRTSHT SIMARSLELP AIVGTGSVTS QVKNDDYLIL DAVNNQVYVN PTNEVIDKMR
AVQEQVASEK AELAKLKDLP AITLDGHQVE VCANIGTVRD VEGAERNGAE GVGLYRTEFL
FMDRDALPTE EEQFAAYKAV AEACGSQAVI VRTMDIGGDK ELPYMNFPKE ENPFLGWRAI
RIAMDRKEIL RDQLRAILRA SAFGKLRIMF PMIISVEEVR ALRKEIEIYK QELRDEGKAF
DESIEIGVMV ETPAAATIAR HLAKEVDFFS IGTNDLTQYT LAVDRGNDMI SHLYQPMSPS
VLNLIKQVID ASHAEGKWTG MCGELAGDER ATLLLLGMGL DEFSMSAISI PRIKKIIRNT
NFEDAKVLAE QALAQPTTDE LMTLVNKFIE EKTIC