Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4068 |
Symbol | |
ID | 6065356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4489607 |
End bp | 4492108 |
Gene Length | 2502 bp |
Protein Length | 833 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641603491 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_001726994 |
Protein GI | 170022040 |
COG category | [G] Carbohydrate transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1762] Phosphotransferase system mannitol/fructose-specific IIA domain (Ntr-type) |
TIGRFAM ID | [TIGR00848] PTS system, fructose subfamily, IIA component [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTGA TTGTGGAATT TATTTGTGAG CTACCTAACG GCGTACATGC ACGTCCGGCA AGCCACGTTG AAACGCTGTG TAATACTTTT TCATCACAAA TTGAGTGGCA TAACTTGCGC ACTGACCGCA AGGGCAACGC CAAAAGCGCC CTTGCGCTGA TTGGCACCGA TACGCTGGTG GGCGATAACT GCCAGTTACT GATTTCCGGG GCCGACGAAC AGGAAGCGCA CCAGCGTTTA AGCCAATGGC TGCGCGATGA ATTCCCCCAC TGCGACGCAC CGCTGGCGGA AGTTAAATCT GACGAACTGG AACCACTGCC GGTTTCACTG ACCAATCTGA ATCCGCAAAT TATCCGCGCC CGCACCGTGT GCAGCGGTAG CGCAGGCGGC ATTCTGACGC CGATCTCTTC TTTAGATCTC AATGCGCTGG GTAATCTTCC TGTTGCCAAA GGCGTTGACG CCGAGCAATC CGCGCTGGAA AACGGCCTGA CACTGGTATT GAAAAACATT GAATTTCGAC TGCTGGATAG CGACGGTGCT ACCAGCGCGA TACTGGAAGC TCACCGATCC CTGGCTGGCG ATACTTCTTT ACGTCAGCAC TTACTGGCAG GCGTCAGCGA GGGATTAAGC TGTGCCGAAG CGATTGTCGC CAGTGCACAC CACTTCTGCG AAGAGTTCGC CCGTTCCAGC AGTAGCTACC TGCAAGAACG CGCCCTGGAC GTACGCGACG TCTGCTTCCA GTTACTCCAG CAAATCTACG GTGAGCAACG CTTCCCGGCA CCGGGAAAAC TGACGCAGCC CGCCATTTGT ATGGCTGATG AACTGACCCC CAGCCAGTTC CTCGAACTGG ATAAAAATCA CCTCAAAGGA TTGTTGCTCA AAAGCGGCGG CACCACCTCA CATACGGTGA TCCTTGCCCG TTCGTTCAAC ATTCCAACGC TGGTTGGTGT GGATATTGAT GCCCTCACTC CGTGGCAGCA TCAGACGATT TATATCGACG GCAACGCTGG GGCGATTGTG GTTGAGCCAG GGGAAGCCGT AGCCCGTTAT TATCAGCAAG AAGCCCGCGT ACAGGACGCC CTGCGTGAGC AACAGCGTGT CTGGCTGACC CAACAAGCCC GTACCGCTGA CGGTATCCGC ATTGAAATTG CCGCTAACAT CGCTCACTCC GTGGAAGCAC AGGCCGCGTT CGGCAATGGT GCGGAAGGCG TTGGTTTGTT CCGCACTGAA ATGCTCTATA TGGATCGCAC CAGTGCACCG GGCGAAAGCG AGCTGTACAA CATTTTTTGT CAGGCGCTGG AGTCTGCCAA CGGACGCAGC ATTATTGTGC GCACTATGGA CATTGGCGGC GACAAACCCG TTGATTATCT AAACATTCCC GCAGAGGCAA ACCCGTTCCT CGGTTATCGC GCGGTGCGTA TTTATGAAGA ATACGCCTCG CTGTTCACCA CACAACTACG GTCGATCCTG CGCGCCTCCG CTCACGGCAG CCTGAAAATC ATGATCCCGA TGATCTCCTC AATGGAAGAG ATCTTATGGG TGAAAGAAAA ACTGGCGGAA GCCAAACAGC AACTACGTAA CGAACACATT CCGTTTGATG AGAAGATCCA GCTCGGCATT ATGCTGGAAG TGCCGTCGGT GATGTTCATC ATCGATCAAT GCTGCGAAGA GATTGATTTC TTTAGTATTG GTAGTAATGA CCTGACGCAG TATCTGCTGG CGGTGGATCG CGATAACGCT AAGGTTACTC GTCACTACAA CAGCCTGAAT CCGGCATTCT TGCGGGCGCT CGATTACGCC GTGCAAGCGG TGCATCGCCA GGGAAAATGG ATTGGTCTGT GCGGTGAGCT GGGAGCGAAA GGTTCCGTGC TGCCGTTGCT GGTCGGTTTA GGGCTGGATG AGCTGAGCAT GAGCGCACCA TCAATTCCGG CGGCGAAAGC GCGGATGGCG CAACTTGATA GCCGTGAGTG CCGCAAGTTG CTCAACCAGG CAATGGCCTG CCGTACTTCG CTGGAAGTGG AACACCTGCT GGCGCAATTC CGCATGACCC AACAAGACGC ACCGCTGGTC ACCGCCGAGT GCATCACACT GGAAAGCGAC TGGCGCAGCA AAGAAGAAGT ACTCAAAGGC ATAACCGATA ACCTGCTGCT GGCGGGCCGC TGCCGCTATC CGCGTAAACT GGAAGCCGAC TTGTGGGCGC GCGAGGCCGT TTTCTCTACC GGTCTGGGCT TTAGTTTTGC CATTCCACAC AGCAAATCAG AACACATTGA GCAATCCACC ATCAGCGTAG CGCGTCTGCA AGCGCCGGTG CGCTGGGGCG ATGATGAAGC GCAATTCATC ATTATGTTAA CCCTGAACAA ACACGCTGCG GGCGATCAGC ATATGCGCAT TTTCTCGCGC CTCGCTCGTC GCATCATGCA CGAAGAATTC CGTAACGCGC TGGTTAACGC CGCCTCTGCC GACGCTATCG CCAGCCTGCT GCAACATGAA CTGGAACTGT AA
|
Protein sequence | MALIVEFICE LPNGVHARPA SHVETLCNTF SSQIEWHNLR TDRKGNAKSA LALIGTDTLV GDNCQLLISG ADEQEAHQRL SQWLRDEFPH CDAPLAEVKS DELEPLPVSL TNLNPQIIRA RTVCSGSAGG ILTPISSLDL NALGNLPVAK GVDAEQSALE NGLTLVLKNI EFRLLDSDGA TSAILEAHRS LAGDTSLRQH LLAGVSEGLS CAEAIVASAH HFCEEFARSS SSYLQERALD VRDVCFQLLQ QIYGEQRFPA PGKLTQPAIC MADELTPSQF LELDKNHLKG LLLKSGGTTS HTVILARSFN IPTLVGVDID ALTPWQHQTI YIDGNAGAIV VEPGEAVARY YQQEARVQDA LREQQRVWLT QQARTADGIR IEIAANIAHS VEAQAAFGNG AEGVGLFRTE MLYMDRTSAP GESELYNIFC QALESANGRS IIVRTMDIGG DKPVDYLNIP AEANPFLGYR AVRIYEEYAS LFTTQLRSIL RASAHGSLKI MIPMISSMEE ILWVKEKLAE AKQQLRNEHI PFDEKIQLGI MLEVPSVMFI IDQCCEEIDF FSIGSNDLTQ YLLAVDRDNA KVTRHYNSLN PAFLRALDYA VQAVHRQGKW IGLCGELGAK GSVLPLLVGL GLDELSMSAP SIPAAKARMA QLDSRECRKL LNQAMACRTS LEVEHLLAQF RMTQQDAPLV TAECITLESD WRSKEEVLKG ITDNLLLAGR CRYPRKLEAD LWAREAVFST GLGFSFAIPH SKSEHIEQST ISVARLQAPV RWGDDEAQFI IMLTLNKHAA GDQHMRIFSR LARRIMHEEF RNALVNAASA DAIASLLQHE LEL
|
| |