Gene EcSMS35_4863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4863 
SymbolgcxK 
ID6146453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4972272 
End bp4973414 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content51% 
IMG OID641619667 
Productglycerate kinase GcxK 
Protein accessionYP_001746774 
Protein GI170682944 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1929] Glycerate kinase 
TIGRFAM ID[TIGR00045] glycerate kinase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.432659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.859759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAG TTATCTCCCC CGACTCCTTT AAAGAATGCC TTCCCGCATG GAAAGTCGCC 
GAAGCGCTGG CAACCGGCTG GCGCAAGGTC CTGCCTGGCA GCCAACTGGT GTGTTTGCCC
GTGGCTGACG GCGGCGAAGG CACGCTCGAA ACACTCATTC ATGCGACTGA CGGTACGTTT
TACACTAAAA AAGTCACCGG ACCGCTTGGC GAATCAATAC ATGCGCAATA CGGTATTTTA
GGCAACCAAA CCACCGCAGT GATAGAACTG GCACAAGCTT CAGGGCTGGA ACTGGTTTCT
CCTGTCCAGC GTTCTCCCCT TTATACGACG TCGTTTGGCA CTGGCGAACT TATTCTCGCC
GCCCTGGAAC ACAATATTGA TACCGTTATT CTATGCCTGG GTGGCAGTGC TACAAATGAT
GGCGGTATTG GGTTGATGTC GGCACTTGGC GCATCGTTTA CAGACGCCGA AGGGCTATCA
GTCTCTGTTA ATGGGATGGG GCTGGCGGCA ATTCACCACA TTGACTTACA GCACCTCGAT
CCACGATTGA AAAATGTGAA ATTTATTGCA GCCTGTGATG TCACCAACCC ACTAACCGGC
GATAACGGCG CGACTCGGGT TTTTGCTCAA CAAAAAGGGG CCAGTGCTGA CAACCTTGAG
CAACTGGAAC AGGGAATGAA AAACTATGCC CGTTGCATCT ACCGTTGTTG TGGTAAAGAA
GTCGATACGA TACCCGGTTC TGGGGCGGCT GGCGGCGTTG GCGCGGCCTT GATGGCTTTT
CTCGATGCTC GCTTACAACC GGGTATTTCG CTCGTGCTGG AAGCGATTCA ATATACCCAA
CATTTAAAAT ATGCAGCATT GGCGATTGTC GGTGAAGGTA AATTAGACCG TCAAAGCCTG
AATGGCAAAG CACCTGTGGG GGCGGCCAAA ATCGCCCAGA TGATGGGCGT TCCGGTTATC
GCAATTGCCG GGTATATCGA TGATCAACTT GATTTGAATG AGTTACGCCA GTGTGGAATC
GAAGCCTGTT TTTCCGTCGT CAATGGTCCT TGTGATTTAC CCACCGCGCT GAGTCAGGGG
GAAAATAATT TAATTCGTCT CGGAGAAAAT TTGGCAGGGT ATTTTCATGC AGTCCTGAGT
TAA
 
Protein sequence
MKIVISPDSF KECLPAWKVA EALATGWRKV LPGSQLVCLP VADGGEGTLE TLIHATDGTF 
YTKKVTGPLG ESIHAQYGIL GNQTTAVIEL AQASGLELVS PVQRSPLYTT SFGTGELILA
ALEHNIDTVI LCLGGSATND GGIGLMSALG ASFTDAEGLS VSVNGMGLAA IHHIDLQHLD
PRLKNVKFIA ACDVTNPLTG DNGATRVFAQ QKGASADNLE QLEQGMKNYA RCIYRCCGKE
VDTIPGSGAA GGVGAALMAF LDARLQPGIS LVLEAIQYTQ HLKYAALAIV GEGKLDRQSL
NGKAPVGAAK IAQMMGVPVI AIAGYIDDQL DLNELRQCGI EACFSVVNGP CDLPTALSQG
ENNLIRLGEN LAGYFHAVLS