Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1520 |
Symbol | pykF |
ID | 6146852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1505754 |
End bp | 1507166 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616398 |
Product | pyruvate kinase |
Protein accession | YP_001743578 |
Protein GI | 170683460 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0469] Pyruvate kinase |
TIGRFAM ID | [TIGR01064] pyruvate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000224591 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00000000336273 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAAAAGA CCAAAATTGT TTGCACCATC GGACCGAAAA CCGAATCTGA AGAGATGTTA GCTAAAATGC TGGACGCTGG CATGAACGTT ATGCGTCTGA ACTTCTCTCA TGGTGACTAT GCAGAACACG GTCAGCGCAT TCAGAATCTG CGCAACGTGA TGAGCAAAAC TGGTAAAACC GCCGCTATCC TGCTTGATAC CAAAGGTCCG GAAATCCGCA CCATGAAACT GGAAGGCGGT AACGACGTTT CTCTGAAAGC TGGTCAGACC TTTACTTTCA CTACTGATAA ATCTGTTATC GGCAACAGCG AAATGGTTGC GGTAACGTAT GAAGGTTTCA CAACTGACCT GTCTGTTGGC AACACCGTAC TGGTTGACGA TGGTCTGATC GGTATGGAAG TTACTGCCAT TGAAGGTAAC AAAGTTATCT GTAAAGTGCT GAACAACGGC GACCTGGGTG AAAACAAAGG TGTGAACCTG CCTGGCGTTT CCATTGCCCT GCCAGCACTG GCTGAAAAAG ACAAACAGGA CCTGATCTTT GGTTGCGAAC AAGGCGTAGA CTTTGTCGCT GCTTCCTTTA TTCGTAAGCG TTCTGACGTT ATCGAAATCC GTGAGCACCT GAAAGCGCAC GGCGGCGAAA ACATCCACAT CATCTCCAAA ATCGAAAACC AGGAAGGCCT CAACAACTTC GACGAAATCC TCGAAGCCTC TGACGGCATC ATGGTTGCGC GTGGCGACCT GGGTGTAGAA ATCCCGGTAG AAGAAGTTAT CTTCGCCCAG AAGATGATGA TCGAAAAATG TATCCGTGCA CGTAAAGTCG TTATCACTGC GACCCAGATG CTGGATTCCA TGATCAAAAA CCCACGCCCG ACTCGCGCAG AAGCCGGTGA CGTTGCAAAC GCCATCCTCG ACGGTACTGA CGCAGTGATG CTGTCTGGTG AATCCGCAAA AGGTAAATAC CCGCTGGAAG CGGTTTCTAT CATGGCGACC ATCTGCGAAC GTACCGACCG CGTGATGAAC AGCCGTCTCG AGTTCAACAA TGACAACCGT AAACTGCGCA TTACCGAAGC GGTATGCCGT GGTGCCGTTG AAACTGCTGA AAAACTGGAT GCTCCGCTGA TCGTGGTTGC TACCCAGGGC GGTAAATCTG CTCGCGCAGT ACGTAAATAC TTCCCGGATG CCACCATCCT GGCACTGACC ACCAACGAAA AAACGGCTCA TCAGTTGGTA CTGAGCAAAG GCGTTGTGCC GCAGCTGGTT AAAGAGATCA CTTCTACTGA TGATTTCTAC CGTCTGGGTA AAGAACTGGC TCTGCAGAGC GGTCTGGCAC ACAAAGGTGA CGTTGTAGTT ATGGTTTCTG GTGCACTGGT ACCGAGCGGC ACTACTAACA CCGCATCTGT TCACGTCCTG TAA
|
Protein sequence | MKKTKIVCTI GPKTESEEML AKMLDAGMNV MRLNFSHGDY AEHGQRIQNL RNVMSKTGKT AAILLDTKGP EIRTMKLEGG NDVSLKAGQT FTFTTDKSVI GNSEMVAVTY EGFTTDLSVG NTVLVDDGLI GMEVTAIEGN KVICKVLNNG DLGENKGVNL PGVSIALPAL AEKDKQDLIF GCEQGVDFVA ASFIRKRSDV IEIREHLKAH GGENIHIISK IENQEGLNNF DEILEASDGI MVARGDLGVE IPVEEVIFAQ KMMIEKCIRA RKVVITATQM LDSMIKNPRP TRAEAGDVAN AILDGTDAVM LSGESAKGKY PLEAVSIMAT ICERTDRVMN SRLEFNNDNR KLRITEAVCR GAVETAEKLD APLIVVATQG GKSARAVRKY FPDATILALT TNEKTAHQLV LSKGVVPQLV KEITSTDDFY RLGKELALQS GLAHKGDVVV MVSGALVPSG TTNTASVHVL
|
| |