Gene EcSMS35_0780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0780 
SymbolgalK 
ID6145907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp780841 
End bp781989 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content53% 
IMG OID641615668 
Productgalactokinase 
Protein accessionYP_001742860 
Protein GI170679731 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGA AAGAAAAAAC ACAATCTCTG TTTGCCAACG CATTTGGCTA CCCTGCCACT 
CACACCATTC AGGCGCCTGG CCGCGTGAAT TTGATTGGTG AACACACCGA CTACAACGAC
GGTTTCGTTC TGCCCTGCGC GATTGATTAT CAAACCGTGA TCAGTTGTGC ACCACGCGAT
GACCGTAAAG TTCGCGTGAT GGCAGCCGAT TATGAAAATC AGCTCGATGA GTTTTCCCTC
GATGCGCCCA TTGTCGCACA TGAAAGCTAT CAATGGGCTA ACTACGTTCG TGGCGTGGTG
AAACATCTGC AACTGCGTAA CAACAACTTC GGCGGTGTGG ACATGGTGAT CAGCGGCAAT
GTGCCGCAGG GTGCCGGGTT AAGTTCTTCC GCTTCACTGG AAGTCGCGGT CGGAACCGTA
TTGCAGCAGC TTTATCATCT GCCGCTGGAC GGCGCACAAA TCGCGCTTAA TGGTCAGGAA
GCAGAAAACC AGTTTGTAGG CTGTAACTGT GGAATCATGG ATCAACTGAT TTCCGCGCTC
GGCAAGAAAG ATCATGCCTT GCTGATCGAT TGCCGCTCAC TGGGGACCAA AGCTGTTTCC
ATGCCGAAAG GCGTGGCTGT CGTCATCATC AACAGTAACT TCAAACGTAC CCTGGTTGGC
AGCGAATACA ACACCCGTCG TGAACAGTGC GAAACCGGTG CGCGTTTCTT CCAGCAGCCA
GCGCTGCGCG ATGTCACCAT TGAAGAGTTC AACGCTGTTG CGCATGAACT GGACCCAATC
GTGGCGAAAC GCGTGCGTCA TATCCTGACT GAAAACGCCC GCACCGTTGA AGCTGCCAGC
GCGCTGGAGC AAGGCGACCT GAAACGTATG GGCGAGTTGA TGGCGGAGTC TCATGCCTCT
ATGCGCGATG ATTTCGAAAT CACCGTGCCG CAAATTGACA CTCTGGTAGA AATCGTCAAA
GCTGTGATTG GCGACAAAGG TGGCGTACGC ATGACCGGCG GCGGATTTGG CGGCTGCATC
GTCGCGCTGA TCCCGGAAGA GCTGGTGCCT GCAGTACAGC AAGCTGTCGC TGAACAATAT
GAAGCAAAAA CAGGTATTAA AGAGACTTTT TACGTTTGTA AACCATCACA AGGAGCAGGA
CAGTGCTGA
 
Protein sequence
MSLKEKTQSL FANAFGYPAT HTIQAPGRVN LIGEHTDYND GFVLPCAIDY QTVISCAPRD 
DRKVRVMAAD YENQLDEFSL DAPIVAHESY QWANYVRGVV KHLQLRNNNF GGVDMVISGN
VPQGAGLSSS ASLEVAVGTV LQQLYHLPLD GAQIALNGQE AENQFVGCNC GIMDQLISAL
GKKDHALLID CRSLGTKAVS MPKGVAVVII NSNFKRTLVG SEYNTRREQC ETGARFFQQP
ALRDVTIEEF NAVAHELDPI VAKRVRHILT ENARTVEAAS ALEQGDLKRM GELMAESHAS
MRDDFEITVP QIDTLVEIVK AVIGDKGGVR MTGGGFGGCI VALIPEELVP AVQQAVAEQY
EAKTGIKETF YVCKPSQGAG QC