Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2535 |
Symbol | fryA |
ID | 6144861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2592797 |
End bp | 2595292 |
Gene Length | 2496 bp |
Protein Length | 831 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617407 |
Product | multiphosphoryl transfer protein 1 |
Protein accession | YP_001744578 |
Protein GI | 170679733 |
COG category | [G] Carbohydrate transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1762] Phosphotransferase system mannitol/fructose-specific IIA domain (Ntr-type) |
TIGRFAM ID | [TIGR00848] PTS system, fructose subfamily, IIA component [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAACGA TTCAATTTCT CTGTCCTCTG CCTAACGGTC TGCATGCTCG TCCGGCGTGG GAACTTAAAG AACAGTGCAG CCAGTGGCAA AGCGAAATCA CCTTCATCAA CCATCGCCAG AACGCAAAGG CAGATGCGAA AAGCTCGCTG GCGCTGATTG GCACCGGCAC CCTATTTAAT GACAGTTGCA GCCTGAACAT TAGCGGCAGC GATGAAGAGC AGGCGCGGCG CGTGCTGGAA GAGTACATCC AGGTGCGCTT TATCGACAGC GACAGCGTCC AGCCAACGCA GGCAGAACTG ACAGCGCATC CGCTGCCGCG TTCATTAAGC CGTCTGAACC CGGATTTACT GTACGGCAAT GTGCTGGCAA GCGGCGTCGG CGTGGGTACG CTGACCCTGT TACAGAGCGA CAGCCTCGAC AGTTATCGGG CCATCCCGGC CAGTGCGCAA GACTCCACCC GGCTGGAGCA CAGCCTGGCA ACGCTTGCCG AGCAACTGAA CCAGCAACTG CGCGAGCGTG ACGGCGAAAG CAAAACTATC CTCAGCGCCC ATTTGTCGCT GATCCAGGAT GATGAATTTG CAGGCAATAT CCGTCGCCTG ATGGCAGAAC AGCATCAGGG GCTGGGGGCG GCGATCATCA GCAATATGGA GCAGGTTTGC GCCAAACTAT CTGCTTCTGC CAGCGATTAT CTGCGCGAGC GGGTCAGCGA CATTCGCGAT ATCAGCGAAC AGTTACTGCA TATCACCTGG CCGGAACTGA AACCGCGCAA CAATCTGGTG CTGGAAAAAC CGACTATTCT GGTGGCTGAA GATTTAACCC CAAGCCAGTT TTTGAGCCTC GATTTAAAAA ATCTTGCGGG CATGATTCTG GAGAAAACCG GGCGCACCTC GCATACCCTG ATCCTGGCCC GCGCTTCGGC GATCCCGGTA CTGAGCGGCC TGCCGCTGGA TGCGATTGCC CGTTATGCCG GGCAACCTGC AGTGCTTGAC GCCCAGTGCG GCGTGCTGGC GATTAACCCG AATGACGCGG TGAGCGGTTA TTATCAGGTC GCGCAGACGC TGGCGGATAA ACGCCAAAAA CAACAGGCGC AGGCGGCCGC GCAGCTGGCC TATTCCCGTG ATAACAAGCG TATTGATATT GCGGCGAATA TCGGCACCGC TCTGGAAGCG CCAGGCGCGT TTGCCAACGG CGCGGAAGGT GTCGGGCTGT TTCGTACCGA AATGCTCTAT ATGGATCGCG ACAGCGCGCC GGACGAGCAG GAGCAATTTG AAGCCTACCA GCAGGTGCTA CTGGCGGCGG GCGACAAGCC GATTATCTTC CGCACGATGG ACATCGGCGG CGATAAAAGC ATTCCTTACC TCAACATTCC CCAGGAAGAG AACCCGTTCC TCGGCTATCG CGCGGTACGT ATTTACCCGG AATTTGCTGG CCTGTTCCGC ACTCAACTGC GGGCGATTTT GCGTGCTGCC TGTTTCGGCA ACGCCCAGTT GATGATCCCG ATGGTTCACA GCCTCGATCA GATCTTATGG GTGAAGGGCG AGATCCAAAA AGCGATCGTT GAGCTTAAGC GCGATTGCCT GCGTCATGCA GAGACGATTA CGCTGGGTAT CATGGTGGAA GTGCCGTCGG TGTGCTACAT CATCGATCAC TTCTGCGATG AGGTCGATTT CTTCAGTATC GGCTCCAACG ATATGACCCA GTATCTGTAT GCGGTCGATC GTAATAACCC GCGCGTATCG CCGCTGTATA ACCCGATTAC GCCATCGTTC CTGCGCATGT TGCAGCAGAT TGTTACCACT GCGCATCAGC GGGGTAAATG GGTAGGTATT TGCGGTGAAC TGGGCGGTGA AAGCCGTTAT CTTCCGCTAC TGCTTGGGCT GGGCCTGGAC GAGCTGAGTA TGAGTAGCCC GCGCATTCCG GCGGTGAAAA GCCAGCTTCG TCAACTGGAT AGCGAGGCGT GTCGGGAACT GGCGCGTCAG GCATGTGAAT GCCGCAGTGC GCAGGAAATT GAAGCGTTAC TCACCGCCTT TACGCCGGAA GAAGACGTTC GCCCACTGCT GGCGCTGGAG AATATCTTTG TTGATCAGGC TTTTAGCAAT AAAGAGCAGG CGATCCAGTT CCTGTGCGGT AACCTCGGCG TTAACGGGCG CACTGAACAT CCGTTCGAGC TGGAAGAGGA TGTCTGGCAG CGGGAAGAGA TTGTTACCAC CGGCGTTGGT TTTGGCGTAG CGATCCCGCA CACCAAATCT CAGTGGATCC GTCATTCCAG TATCAGCATT GCCCGGCTGG TGAAACCGGT TGACTGGCAG TCAGAAATGG GCGAAGTCGA ACTGGTGATC ATGCTGACAC TGGGCGCTAA CGAAGGGATG AATCATGTGA AAGTCTTCTC GCAGCTGGCG CGTAAACTGG TGAATAAAAA CTTCCGCCAG TCGCTGTTCG CCGCGCAAGA TGCGCAAAGC ATCCTGACGC TGCTGGAAAC GGAATTAACC TTCTGA
|
Protein sequence | MLTIQFLCPL PNGLHARPAW ELKEQCSQWQ SEITFINHRQ NAKADAKSSL ALIGTGTLFN DSCSLNISGS DEEQARRVLE EYIQVRFIDS DSVQPTQAEL TAHPLPRSLS RLNPDLLYGN VLASGVGVGT LTLLQSDSLD SYRAIPASAQ DSTRLEHSLA TLAEQLNQQL RERDGESKTI LSAHLSLIQD DEFAGNIRRL MAEQHQGLGA AIISNMEQVC AKLSASASDY LRERVSDIRD ISEQLLHITW PELKPRNNLV LEKPTILVAE DLTPSQFLSL DLKNLAGMIL EKTGRTSHTL ILARASAIPV LSGLPLDAIA RYAGQPAVLD AQCGVLAINP NDAVSGYYQV AQTLADKRQK QQAQAAAQLA YSRDNKRIDI AANIGTALEA PGAFANGAEG VGLFRTEMLY MDRDSAPDEQ EQFEAYQQVL LAAGDKPIIF RTMDIGGDKS IPYLNIPQEE NPFLGYRAVR IYPEFAGLFR TQLRAILRAA CFGNAQLMIP MVHSLDQILW VKGEIQKAIV ELKRDCLRHA ETITLGIMVE VPSVCYIIDH FCDEVDFFSI GSNDMTQYLY AVDRNNPRVS PLYNPITPSF LRMLQQIVTT AHQRGKWVGI CGELGGESRY LPLLLGLGLD ELSMSSPRIP AVKSQLRQLD SEACRELARQ ACECRSAQEI EALLTAFTPE EDVRPLLALE NIFVDQAFSN KEQAIQFLCG NLGVNGRTEH PFELEEDVWQ REEIVTTGVG FGVAIPHTKS QWIRHSSISI ARLVKPVDWQ SEMGEVELVI MLTLGANEGM NHVKVFSQLA RKLVNKNFRQ SLFAAQDAQS ILTLLETELT F
|
| |