Gene ECH74115_0860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0860 
SymbolgalK 
ID6969669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp876064 
End bp877212 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content53% 
IMG OID643384885 
Productgalactokinase 
Protein accessionYP_002269385 
Protein GI209398960 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.048492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.889348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGA AAGAAAAAAC ACAATCTCTG TTTGCCAACG CATTTGGCTA CCCTGCCACT 
CATACCATTC AGGCGCCTGG CCGCGTGAAT TTGATTGGTG AACACACCGA CTACAACGAC
GGTTTCGTTC TGCCCTGCGC GATTGATTAT CAAACCGTGA TCAGCTGTGC GCCACGCGAT
GACCGTAAAG TTCGCGTAAT GGCAGCCGAT TATGAAAATC AGCTTGATGA GTTTTCCCTT
GATGCGCCCA TTGTCGCGCA TGAAAACTAT CAATGGGCGA ACTACGTTCG TGGCGTGGTG
AAACATCTGC AACTGCGTAA CAACAGCTTC GGCGGTGTGG ACATGGTGAT CAGCGGCAAT
GTGCCGCAGG GTGCCGGGTT AAGTTCTTCC GCTTCACTGG AAGTCGCGGT CGGAACCGTA
TTGCAGCAGC TTTATCATCT GCCGCTGGAC GGCGCACAAA TCGCGCTTAA CGGTCAGGAA
GCAGAAAACC AGTTTGTTGG CTGTAACTGC GGGATCATGG ATCAGCTAAT TTCCGCACTC
GGCAAGAAAG ATCATGCCTT GCTGATTGAC TGTCGCTCAC TGGGGACCAA AGCAGTTTCC
ATGCCGAAAG GTGTGGCTGT CGTCATCATC AACAGTAACT TCAAACGTAC CCTGGTTGGC
AGCGAATACA ACACCCGTCG TGAACAGTGC GAAACCGGTG CGCGTTTCTT CCAGCAGCCA
GCTCTGCGCG ATGTCACCAT TGAAGAGTTC AACGCTGTTG CACATGAGCT GGACCCAATC
GTGGCGAAAC GCGTGCGTCA TATCCTGACT GAAAACGCCC GCACCGTTGA AGCTGCCAGC
GCGCTGGAGC AAGGCGACCT GAAACGTATG GGCGAGTTGA TGGCGGAGTC TCATGCCTCT
ATGCGCGATG ATTTCGAAAT CACCGTGCCG CAAATTGACA CTCTGGTAGA AATCGTCAAA
GCTGTGATTG GCGACAAAGG TGGCGTACGC ATGACCGGCG GCGGATTTGG CGGCTGTATC
GTCGCGCTGA TCCCGGAAGA GCTGGTGCCT GCCGTACAGC AAGCTGTCGC AGAACAATAT
GAAGCAAAAA CAGGTATTAA AGAGACTTTT TACGTTTGTA AACCATCACA AGGAGCAGGA
CAGTGCTGA
 
Protein sequence
MSLKEKTQSL FANAFGYPAT HTIQAPGRVN LIGEHTDYND GFVLPCAIDY QTVISCAPRD 
DRKVRVMAAD YENQLDEFSL DAPIVAHENY QWANYVRGVV KHLQLRNNSF GGVDMVISGN
VPQGAGLSSS ASLEVAVGTV LQQLYHLPLD GAQIALNGQE AENQFVGCNC GIMDQLISAL
GKKDHALLID CRSLGTKAVS MPKGVAVVII NSNFKRTLVG SEYNTRREQC ETGARFFQQP
ALRDVTIEEF NAVAHELDPI VAKRVRHILT ENARTVEAAS ALEQGDLKRM GELMAESHAS
MRDDFEITVP QIDTLVEIVK AVIGDKGGVR MTGGGFGGCI VALIPEELVP AVQQAVAEQY
EAKTGIKETF YVCKPSQGAG QC