Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4395 |
Symbol | ptsA |
ID | 6144211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4485904 |
End bp | 4488405 |
Gene Length | 2502 bp |
Protein Length | 833 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619216 |
Product | multi-phosphoryl transfer protein 2 |
Protein accession | YP_001746340 |
Protein GI | 170679892 |
COG category | [G] Carbohydrate transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1762] Phosphotransferase system mannitol/fructose-specific IIA domain (Ntr-type) |
TIGRFAM ID | [TIGR00848] PTS system, fructose subfamily, IIA component [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.963786 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTGA TTGTGGAATT TATTTGTGAG CTACCTAACG GCGTACATGC GCGTCCGGCA AGCCACGTTG AAACGCTGTG TAATACTTTT TCATCACAAA TTGAGTGGCA TAACCTGCGC ACTGACCGCA AGGGCAACGC CAAAAGCGCC CTTGCGCTGA TTGGCACCGA TACGCTGGTG GGCGATAACT GCCAGTTACT GATTTCCGGG GCCGACGAAC AGGAAGCGCA CCAGCGTTTA AGCCAATGGC TGCGCGATGA ATTCCCCCAC TGCGACGCAC CGCTGGCGGA AGTAAAATCT GACGAACTGG AACCACTGCC GGTTTCACTG ACCAATCTGA ATCCGCAAAT TATCCGTGCC CGCACCGTGT GCAGCGGTAG CGCAGGCGGC ATTCTGACGC CGATCTCTTC TTTAGATCTC AATGCGCTGG GTAATCTTCC CGCAGCCAAA GGCGTTGACG CCGAACAATC CGCACTGGAA AACGGTCTGA CGCTGGTACT GAAAAACATA GAGTTTCGTC TGCTGGATAG CGACGGTGCT ACCAGCGCGA TTCTGGAAGC GCACCGATCC CTGGCTGGCG ATACTTCCCT GCGCGAACAT TTACTGGCAG GTGTCAGCGC CGGATTAAGC TGCGCCGAAG CGATTGTTGC CAGTGCGAAT CACTTTTGCG AAGAGTTCGC CCGTTCCAGC AGCAGCTACC TGCAAGAACG TGCCCTGGAC GTACGCGACG TCTGCTTCCA GTTACTCCAG CAAATCTACG GTGAGCAACG CTTCCCGGCA CCGGGCAAAC TGACGCAGCC CGCCATTTGT ATGGCTGATG AACTGACCCC CAGCCAGTTC CTCGAACTGG ATAAAAATCA CCTCAAAGGA TTGTTGCTCA AAAGCGGCGG CACCACCTCA CATACGGTGA TCCTTGCCCG TTCGTTCAAC ATTCCGACGC TGGTTGGTGT GGATATTGAT GCCCTCACTC CGTGGCAGCA TCAAACGATT TATATCGACG GCAACGCTGG GGCGATTGTG GTTGAGCCAG GGGAATCCGT GGCCCGTTAT TATCAGCAAG AAGCCCGCGT ACAGGACGCT CTGCGTGAGC AACAGCGTGT CTGGCTGACC CAACAAGCCC GTACCGCTGA CGGTATCCGC ATTGAAATTG CTGCTAACAT CGCTCACTCC GTGGAAGCGC AGGCCGCATT CGGCAATGGT GCGGAAGGCG TTGGTTTATT CCGCACTGAA ATGCTCTATA TGGATCGCAC CAGCGCACCG GGCGAAAGCG AGCTGTACAA CATTTTTTGT CAGGCGCTGG AGTCCGCCAA CGGACGCAGC ATTATTGTGC GCACTATGGA TATTGGCGGT GACAAACCCG TTGATTATCT GAACATTCCC GCAGAGGCAA ACCCGTTCCT CGGTTATCGC GCCGTGCGTA TTTATGAAGA GTACGCGTCG TTATTTACCA CACAACTACG GTCGATTCTC CGTGCCTCCG CTCACGGCAG CCTGAAAATC ATGATCCCGA TGATCTCCTC AATGGAAGAG ATCTTATGGG TAAAAGAAAA ACTGGCAGAA GCCAAACAGC AACTACGTAA CGAACACATT CCGTTTGATG AGAAGATCCA GCTCGGCATT ATGCTGGAAG TGCCGTCGGT GATGTTCATC ATCGATCAAT GCTGCGAAGA GATTGATTTC TTTAGTATCG GTAGTAATGA CCTGACGCAA TATCTGCTGG CAGTGGATCG CGATAACGCT AAGGTTACTC GTCACTATAA CAGCCTGAAT CCGGCCTTCT TGCGGGCGCT CGATTACGCC GTGCAGGCGG TGCATCGCCA GGGCAAATGG ATTGGTCTGT GCGGTGAACT GGGAGCGAAA GGTTCCGTGC TGCCTTTGCT GGTCGGTTTA GGGCTGGATG AACTCAGCAT GAGCGCACCA TCAATTCCGG CGGCGAAAGC GCGGATGGCG CAACTTGATA GCCGTGAGTG CCGCCAGTTG CTCAACCAGG CAATGGCCTG CCGTACGTCG CTGGAAGTGG AACACCTGCT GGCGCAATTC CGCATGACCC AACAAGACGC ACCGCTGGTC ACCGCCGAGT GCATCACACT GGAAAGCGAC TGGCGCAGCA AAGAAGAAGT GCTCAAAGGC ATGACCGATA ACCTGCTGCT GGCGGGCCGC TGCCGCTATC CGCGTAAACT GGAAGCCGAC TTGTGGGCGC GCGAGGCCGT TTTCTCTACC GGTCTGGGCT TTAGTTTCGC CATTCCCCAC AGCAAATCAG AACACATTGA GCAATCCACC ATCAGCGTGG CGCGTCTGGC AGCGCCGGTG CGCTGGGGCG ATGATGAAGC GCAATTCATC ATTATGTTAA CCCTGAACAA ACATGCTGCG GGCGATCAGC ACATGCGCAT TTTCTCGCGC CTCGCTCGTC GCATCATGCA CGAAGAATTC CGTAACGCGC TGGTTAACGC CGCCTCTGCC GACGCTATCG CCAGCCTGCT GCAACATGAA CTGGAACTGT AA
|
Protein sequence | MALIVEFICE LPNGVHARPA SHVETLCNTF SSQIEWHNLR TDRKGNAKSA LALIGTDTLV GDNCQLLISG ADEQEAHQRL SQWLRDEFPH CDAPLAEVKS DELEPLPVSL TNLNPQIIRA RTVCSGSAGG ILTPISSLDL NALGNLPAAK GVDAEQSALE NGLTLVLKNI EFRLLDSDGA TSAILEAHRS LAGDTSLREH LLAGVSAGLS CAEAIVASAN HFCEEFARSS SSYLQERALD VRDVCFQLLQ QIYGEQRFPA PGKLTQPAIC MADELTPSQF LELDKNHLKG LLLKSGGTTS HTVILARSFN IPTLVGVDID ALTPWQHQTI YIDGNAGAIV VEPGESVARY YQQEARVQDA LREQQRVWLT QQARTADGIR IEIAANIAHS VEAQAAFGNG AEGVGLFRTE MLYMDRTSAP GESELYNIFC QALESANGRS IIVRTMDIGG DKPVDYLNIP AEANPFLGYR AVRIYEEYAS LFTTQLRSIL RASAHGSLKI MIPMISSMEE ILWVKEKLAE AKQQLRNEHI PFDEKIQLGI MLEVPSVMFI IDQCCEEIDF FSIGSNDLTQ YLLAVDRDNA KVTRHYNSLN PAFLRALDYA VQAVHRQGKW IGLCGELGAK GSVLPLLVGL GLDELSMSAP SIPAAKARMA QLDSRECRQL LNQAMACRTS LEVEHLLAQF RMTQQDAPLV TAECITLESD WRSKEEVLKG MTDNLLLAGR CRYPRKLEAD LWAREAVFST GLGFSFAIPH SKSEHIEQST ISVARLAAPV RWGDDEAQFI IMLTLNKHAA GDQHMRIFSR LARRIMHEEF RNALVNAASA DAIASLLQHE LEL
|
| |