Gene EcHS_A0811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0811 
SymbolgalK 
ID5593700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp815433 
End bp816581 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content53% 
IMG OID640919983 
Productgalactokinase 
Protein accessionYP_001457550 
Protein GI157160232 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGA AAGAAAAAAC ACAATCTCTG TTTGCCAACG CATTTGGCTA CCCTGCCACT 
CACACCATTC AGGCGCCTGG CCGCGTGAAT TTGATTGGTG AACACACCGA CTACAACGAC
GGTTTCGTTC TGCCCTGCGC GATTGATTAT CAAACCGTGA TCAGTTGTGC ACCACGCGAT
GACCGTAAAG TTCGCGTGAT GGCAGCCGAT TATGAAAATC AGCTCGACGA GTTTTCCCTC
GATGCGCCCA TTGTCGCACA TGAAAACTAT CAATGGGCTA ACTACGTTCG TGGCGTGGTG
AAACATCTGC AACTGCGTAA CAACAGCTTC GGCGGCGTGG ACATGGTGAT CAGCGGCAAT
GTGCCGCAGG GTGCCGGGTT AAGTTCTTCC GCTTCACTGG AAGTCGCGGT CGGAACCGTA
TTGCAGCAGC TTTATCATCT GCCGCTGGAC GGCGCACAAA TCGCGCTTAA CGGTCAGGAA
GCAGAAAACC AGTTTGTAGG CTGTAACTGC GGGATCATGG ATCAGCTAAT TTCCGCGCTC
GGCAAGAAAG ATCATGCCTT GCTGATCGAT TGCCGCTCAC TGGGGACCAA AGCAGTTTCC
ATGCCCAAAG GTGTGGCTGT CGTCATCATC AACAGTAACT TCAAACGTAC CCTGGTTGGC
AGCGAATACA ACACCCGTCG TGAACAGTGC GAAACCGGTG CGCGTTTCTT CCAGCAGCCA
GCCCTGCGTG ATGTCACCAT TGAAGAGTTC AACGCTGTTG CGCATGAACT GGACCCGATC
GTGGCGAAAC GCGTGCGGCA TATCCTGACT GAAAACGCCC GCACCGTTGA AGCTGCCAGC
GCGCTGGAGC AGGGCGACCT GAAACGTATG AGCGAGTTGA TGGCGGAGTC TCATGCCTCT
ATGCGCGATG ATTTCGAAAT CACCGTGCCG CAAATTGACA CTCTGGTAGA AATCGTCAAA
GCTGTGATTG GCGACAAAGG TGGCGTACGC ATGACCGGCG GCGGATTTGG CGGCTGTATC
GTCGCGTTGA TCCCGGAAGA GCTGGTGCCT GCCGTACAGC AAGCTGTCGC TGAACAATAT
GAAGCAAAAA CAGGTATTAA AGAGACTTTT TACGTTTGTA AACCATCACA AGGAGCAGGA
CAGTGCTGA
 
Protein sequence
MSLKEKTQSL FANAFGYPAT HTIQAPGRVN LIGEHTDYND GFVLPCAIDY QTVISCAPRD 
DRKVRVMAAD YENQLDEFSL DAPIVAHENY QWANYVRGVV KHLQLRNNSF GGVDMVISGN
VPQGAGLSSS ASLEVAVGTV LQQLYHLPLD GAQIALNGQE AENQFVGCNC GIMDQLISAL
GKKDHALLID CRSLGTKAVS MPKGVAVVII NSNFKRTLVG SEYNTRREQC ETGARFFQQP
ALRDVTIEEF NAVAHELDPI VAKRVRHILT ENARTVEAAS ALEQGDLKRM SELMAESHAS
MRDDFEITVP QIDTLVEIVK AVIGDKGGVR MTGGGFGGCI VALIPEELVP AVQQAVAEQY
EAKTGIKETF YVCKPSQGAG QC