Gene EcE24377A_0784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0784 
SymbolgalK 
ID5586335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp808296 
End bp809444 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content53% 
IMG OID640924496 
Productgalactokinase 
Protein accessionYP_001461911 
Protein GI157158993 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGA AAGAAAAAAC ACAATCTCTG TTTGCCAACG CATTTGGCTA CCCTGCCACT 
CACACCATTC AGGCGCCTGG CCGCGTGAAT TTGATTGGTG AACACACCGA CTACAACGAC
GGTTTCGTTC TGCCCTGCGC GATTGATTAT CAAACCGTGA TCAGCTGTGC GCCACGCGAT
GACCGTAAAG TTCGCGTAAT GGCAGCCGAT TATGAAAATC AGCTCGACGA GTTTTCCCTC
GATGCGCCCA TTGTCGCGCA TGAAAACTAT CAATGGGCGA ACTACGTTCG TGGCGTGGTG
AAACATCTGC AACTGCGTAA CAACAGCTTC GGCGGCGTGG ACATGGTGAT CAGCGGCAAT
GTGCCGCAGG GTGCCGGGTT AAGTTCTTCC GCTTCACTGG AAGTCGCGGT CGGAACCGTA
TTGCAGCAGC TTTATCATCT GCCGCTGGAC GGCGCACAAA TCGCGCTTAA CGGTCAGGAA
GCAGAAAACC AGTTTGTTGG CTGTAACTGC GGGATCATGG ATCAGCTAAT TTCCGCACTC
GGCAAGAAAG ATCATGCCTT GCTGATTGAC TGCCGCTCAC TGGGGACTAA AGCAGTTTCC
ATGCCGAAAG GTGTGGCTGT CGTCATCATC AACAGTAACT TCAAACGTAC CCTGGTTGGC
AGCGAATACA ACACCCGTCG TGAACAGTGC GAAACCGGTG CGCGTTTCTT CCAGCAGCCA
GCCCTGCGCG ATGTCACCAT TGAAGAGTTC AACGCTGTTG CACATGAGCT GGACCCAATC
GTGGCGAAAC GCGTGCGGCA TATCCTGACT GAAAACGCCC GCACCGTTGA AGCTGCCAGC
GCGCTGGAGC AGGGCGACCT GAAACGTATG GGCGAGTTGA TGGCGGAGTC TCATGCCTCT
ATGCGCGATG ATTTCGAAAT CACCGTGCCG CAAATTGACA CTCTGGTAGA AATCGTCAAA
GCTGTGATTG GCGACAAAGG TGGCGTACGC ATGACCGGCG GCGGATTTGG CGGCTGTATC
GTCGCGTTGA TCCCGGAAGA GCTGGTGCCT GCCGTACAGC AAGCTGTCGC TGAACAATAT
GAAGCAAAAA CAGGTATTAA AGAGACTTTT TACGTTTGTA AACCATCACA AGGAGCAGGA
CAGTGCTGA
 
Protein sequence
MSLKEKTQSL FANAFGYPAT HTIQAPGRVN LIGEHTDYND GFVLPCAIDY QTVISCAPRD 
DRKVRVMAAD YENQLDEFSL DAPIVAHENY QWANYVRGVV KHLQLRNNSF GGVDMVISGN
VPQGAGLSSS ASLEVAVGTV LQQLYHLPLD GAQIALNGQE AENQFVGCNC GIMDQLISAL
GKKDHALLID CRSLGTKAVS MPKGVAVVII NSNFKRTLVG SEYNTRREQC ETGARFFQQP
ALRDVTIEEF NAVAHELDPI VAKRVRHILT ENARTVEAAS ALEQGDLKRM GELMAESHAS
MRDDFEITVP QIDTLVEIVK AVIGDKGGVR MTGGGFGGCI VALIPEELVP AVQQAVAEQY
EAKTGIKETF YVCKPSQGAG QC