Gene EcSMS35_3419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3419 
SymbolgarK 
ID6143943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3500461 
End bp3501606 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content53% 
IMG OID641618248 
Productglycerate kinase I 
Protein accessionYP_001745397 
Protein GI170683586 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1929] Glycerate kinase 
TIGRFAM ID[TIGR00045] glycerate kinase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG TAATCGCCCC AGACTCTTAT AAAGAAAGTT TATCTGCCAG CGAGGTTGCG 
CAGGCGATAG AAAAAGGATT TCGGGAAATT TTTCCTGATG CACAGTACGT TTCTGTTCCT
GTTGCCGATG GTGGCGAAGG AACGGTGGAA GCGATGATTG CAGCCACCCA GGGTTCCGAA
CGTCACGCCT GGGTTACAGG GCCGCTGGGC GAGAAGGTGA ATGCCAGTTG GGGGATCTCC
GGCGATGGCA AAACCGCGTT TATTGAAATG GCGGCGGCCA GTGGGCTGGA GCTGGTACCT
GCGGAAAAAC GTGATCCACT CGTGACCACT TCACGCGGCA CAGGCGAGTT AATTCTGCAG
GCGCTGGAGA GCGGCGCGAC AAACATTATT ATCGGCATTG GCGGTAGCGC TACAAATGAT
GGCGGTGCAG GCATGGTACA GGCGCTGGGG GCGAAATTAT GCGACGCCAA CGGCAATGAA
ATTGGTTTTG GCGGCGGTAG TCTTAATACT CTGAATGATA TTGATATTTC CGGCCTCGAT
CCGCGCTTAA AAGATTGCGT CATTCGCGTC GCTTGTGATG TCACCAATCC GCTGGTGGGC
GATAACGGCG CATCGCGCAT CTTTGGCCCA CAAAAGGGAG CCAGTGAAGC GATGATTGTT
GAGCTGGACA ATAACCTTTC TCACTATGCC GATGTCATTA AAAAAGCGCT GCATGTTGAT
GTGAAAGATG TCCCCGGCGC AGGAGCTGCG GGTGGTATGG GCGCGGCGCT AATGGCGTTT
CTTGGTGCGG AACTGAAAAG CGGTATTGAA ATCGTCACGA CGGCGCTGAA TCTGGAGGAA
CATATTCACG ATTGTACGCT GGTGATCACC GGTGAAGGGC GTATTGACAG CCAGAGTATA
CACGGCAAGG TTCCGATTGG CGTCGCAAAC GTGGCGAAAA AGTACCATAA ACCGGTGATT
GGCATTGCGG GTAGCCTGAC CAATGATGTT GGCGTTGTAC ATCAGCATGG CATTGATGCG
GTCTTCAGCG TATTGACCAG TATTGGTACG TTGGACGAAG CATTCCGCGG GGCTTATGAC
AATATTTACC GTGCTTCACG TAATATCGCC GCGACACTGG CGATTGGAAT GCGCAACGCG
GGGTGA
 
Protein sequence
MKIVIAPDSY KESLSASEVA QAIEKGFREI FPDAQYVSVP VADGGEGTVE AMIAATQGSE 
RHAWVTGPLG EKVNASWGIS GDGKTAFIEM AAASGLELVP AEKRDPLVTT SRGTGELILQ
ALESGATNII IGIGGSATND GGAGMVQALG AKLCDANGNE IGFGGGSLNT LNDIDISGLD
PRLKDCVIRV ACDVTNPLVG DNGASRIFGP QKGASEAMIV ELDNNLSHYA DVIKKALHVD
VKDVPGAGAA GGMGAALMAF LGAELKSGIE IVTTALNLEE HIHDCTLVIT GEGRIDSQSI
HGKVPIGVAN VAKKYHKPVI GIAGSLTNDV GVVHQHGIDA VFSVLTSIGT LDEAFRGAYD
NIYRASRNIA ATLAIGMRNA G