Gene SbBS512_E0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0678 
SymbolgalK 
ID6269571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp643116 
End bp644264 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content53% 
IMG OID641724874 
Productgalactokinase 
Protein accessionYP_001879407 
Protein GI187732980 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGA AAGAAAAAAC ACAATCTCTG TTTGCCAACG CATTTGGCTA CCCTGCCACT 
CACACCATTC AGGCGCCTGG CCGCGTGAAT TTGATTGGTG AACACACGGA CTACAACGAC
GGTTTCGTTC TGCCCTGCGC GATTGATTAT CAAACCGTGA TCAGCTGTGC ACCACGCGAT
GACCGTAAAG TTCGCGTGAT GGCAGCCGAT TATGAAAATC AGCTCGACGA GTTTTCCCTC
AATGCGCCCA TTGTCGCGCA TGAAAACTAT CAATGGGCGA ACTACGTTCG TGGCGTGGTG
AAACATCTGC AACTGCGTAA CAACAGCTTC GGCGGTGTGG ACATGGTGAT CAGCGGCAAT
GTGCCGCAGG GTGCCGGGTT AAGTTCTTCC GCTTCACTGG AAGTCGCGGT CGGAACCGTA
TTGCAGCAGC TTTATCATCT GCCGCTGGAC GGCGCACAAA TCGCGCTTAA CGGTCAGGAA
GCAGAAAACC AGTTTGTTGG CTGTAACTGC GGGATCATGG ATCAGCTAAT TTCCGCACTC
GGCAAGAAAG ATCATTCCTT GCTGATTGAC TGTCGTTCAC TGGGGACCAA AGCAGTTTCC
ATGCCGAAAG GTGTGGCTGT CGTCATCATC AACAGTAACT TCAAACGTAC CCTGGTTGGC
AGCGAATACA ACACCCGTCG TGAACAGTGC GAAACCGGTG CGCGTTTCTT CCAGCAGCCA
GCCCTGCGCG ATGTCACCAT TGAAGAGTTC AATGCTGTTG CACATGAGCT GGACCCAATC
GTGGCGAAAC GCGTGCGGCA TATCCTGACT GAAAACGCCC GCACCGTTGA AGCTGCCAGC
GCGCTGGAGC AGGGCGACCT GAAACGTATG GGCGAGTTGA TGGCGGAGTC TCATGCCTCT
ATGCGCGATG ATTTCGAAAT CACCGTGCCG CAAATTGACA CTCTGGTAGA AATCGTCAAA
GCTGTGATTG GCGACAAAGG TGGCGTACGC ATGACCGGCG GCGGATTTGG CGGCTGTATC
GTCGCGTTGA TCCCGGAAGA GCTGGTGCCT GCCGTACAGC AAGCTGTCGC TGAACAATAT
GAAGCAAAAA TAGGTATTAA AGAGACTTTT TACGTTTGTA AACCATCACA AGGAGCAGGA
CAGTGCTGA
 
Protein sequence
MSLKEKTQSL FANAFGYPAT HTIQAPGRVN LIGEHTDYND GFVLPCAIDY QTVISCAPRD 
DRKVRVMAAD YENQLDEFSL NAPIVAHENY QWANYVRGVV KHLQLRNNSF GGVDMVISGN
VPQGAGLSSS ASLEVAVGTV LQQLYHLPLD GAQIALNGQE AENQFVGCNC GIMDQLISAL
GKKDHSLLID CRSLGTKAVS MPKGVAVVII NSNFKRTLVG SEYNTRREQC ETGARFFQQP
ALRDVTIEEF NAVAHELDPI VAKRVRHILT ENARTVEAAS ALEQGDLKRM GELMAESHAS
MRDDFEITVP QIDTLVEIVK AVIGDKGGVR MTGGGFGGCI VALIPEELVP AVQQAVAEQY
EAKIGIKETF YVCKPSQGAG QC