Gene EcSMS35_1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1520 
SymbolpykF 
ID6146852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1505754 
End bp1507166 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content50% 
IMG OID641616398 
Productpyruvate kinase 
Protein accessionYP_001743578 
Protein GI170683460 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0469] Pyruvate kinase 
TIGRFAM ID[TIGR01064] pyruvate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000224591 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00000000336273 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAAAAGA CCAAAATTGT TTGCACCATC GGACCGAAAA CCGAATCTGA AGAGATGTTA 
GCTAAAATGC TGGACGCTGG CATGAACGTT ATGCGTCTGA ACTTCTCTCA TGGTGACTAT
GCAGAACACG GTCAGCGCAT TCAGAATCTG CGCAACGTGA TGAGCAAAAC TGGTAAAACC
GCCGCTATCC TGCTTGATAC CAAAGGTCCG GAAATCCGCA CCATGAAACT GGAAGGCGGT
AACGACGTTT CTCTGAAAGC TGGTCAGACC TTTACTTTCA CTACTGATAA ATCTGTTATC
GGCAACAGCG AAATGGTTGC GGTAACGTAT GAAGGTTTCA CAACTGACCT GTCTGTTGGC
AACACCGTAC TGGTTGACGA TGGTCTGATC GGTATGGAAG TTACTGCCAT TGAAGGTAAC
AAAGTTATCT GTAAAGTGCT GAACAACGGC GACCTGGGTG AAAACAAAGG TGTGAACCTG
CCTGGCGTTT CCATTGCCCT GCCAGCACTG GCTGAAAAAG ACAAACAGGA CCTGATCTTT
GGTTGCGAAC AAGGCGTAGA CTTTGTCGCT GCTTCCTTTA TTCGTAAGCG TTCTGACGTT
ATCGAAATCC GTGAGCACCT GAAAGCGCAC GGCGGCGAAA ACATCCACAT CATCTCCAAA
ATCGAAAACC AGGAAGGCCT CAACAACTTC GACGAAATCC TCGAAGCCTC TGACGGCATC
ATGGTTGCGC GTGGCGACCT GGGTGTAGAA ATCCCGGTAG AAGAAGTTAT CTTCGCCCAG
AAGATGATGA TCGAAAAATG TATCCGTGCA CGTAAAGTCG TTATCACTGC GACCCAGATG
CTGGATTCCA TGATCAAAAA CCCACGCCCG ACTCGCGCAG AAGCCGGTGA CGTTGCAAAC
GCCATCCTCG ACGGTACTGA CGCAGTGATG CTGTCTGGTG AATCCGCAAA AGGTAAATAC
CCGCTGGAAG CGGTTTCTAT CATGGCGACC ATCTGCGAAC GTACCGACCG CGTGATGAAC
AGCCGTCTCG AGTTCAACAA TGACAACCGT AAACTGCGCA TTACCGAAGC GGTATGCCGT
GGTGCCGTTG AAACTGCTGA AAAACTGGAT GCTCCGCTGA TCGTGGTTGC TACCCAGGGC
GGTAAATCTG CTCGCGCAGT ACGTAAATAC TTCCCGGATG CCACCATCCT GGCACTGACC
ACCAACGAAA AAACGGCTCA TCAGTTGGTA CTGAGCAAAG GCGTTGTGCC GCAGCTGGTT
AAAGAGATCA CTTCTACTGA TGATTTCTAC CGTCTGGGTA AAGAACTGGC TCTGCAGAGC
GGTCTGGCAC ACAAAGGTGA CGTTGTAGTT ATGGTTTCTG GTGCACTGGT ACCGAGCGGC
ACTACTAACA CCGCATCTGT TCACGTCCTG TAA
 
Protein sequence
MKKTKIVCTI GPKTESEEML AKMLDAGMNV MRLNFSHGDY AEHGQRIQNL RNVMSKTGKT 
AAILLDTKGP EIRTMKLEGG NDVSLKAGQT FTFTTDKSVI GNSEMVAVTY EGFTTDLSVG
NTVLVDDGLI GMEVTAIEGN KVICKVLNNG DLGENKGVNL PGVSIALPAL AEKDKQDLIF
GCEQGVDFVA ASFIRKRSDV IEIREHLKAH GGENIHIISK IENQEGLNNF DEILEASDGI
MVARGDLGVE IPVEEVIFAQ KMMIEKCIRA RKVVITATQM LDSMIKNPRP TRAEAGDVAN
AILDGTDAVM LSGESAKGKY PLEAVSIMAT ICERTDRVMN SRLEFNNDNR KLRITEAVCR
GAVETAEKLD APLIVVATQG GKSARAVRKY FPDATILALT TNEKTAHQLV LSKGVVPQLV
KEITSTDDFY RLGKELALQS GLAHKGDVVV MVSGALVPSG TTNTASVHVL