Gene SNSL254_A0838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0838 
SymbolgalK 
ID6486081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp842724 
End bp843872 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content57% 
IMG OID642736250 
Productgalactokinase 
Protein accessionYP_002040010 
Protein GI194446739 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGA AAGAGAAAAC GCGCGCGCTG TTTGCTGAAA TTTTCGGCTA CCCTGCTACC 
CACACGATTC AGGCGCCAGG CCGCGTCAAT CTGATCGGCG AGCACACTGA TTACAATGAT
GGTTTTGTTC TGCCCTGCGC TATCGATTAC CAGACCGTAA TTAGCTGTGC GCCGCGCGAC
GATCGTACCG TACGGGTGAT TGCCGCCGAT TACGACAATC AGGTGGACGA ATTTTCACTG
GATGCGCCGA TCGTGACCCA CGATAGCCAG CAGTGGTCTA ACTATGTGCG CGGCGTAGTG
AAACACCTGC AACAGCGTAA CAACGCGTTT GGCGGCGTGG ATATGGTCAT CAGCGGCAAT
GTGCCGCAGG GCGCCGGGTT AAGCTCCTCC GCCTCGCTGG AAGTGGCGGT GGGCACAGTC
TTCCAGCAGC TTTATCACCT GCCGCTGGAC GGCGCGCAAA TTGCGCTCAA CGGACAAGAG
GCCGAGAACC AGTTTGTCGG CTGTAACTGC GGCATTATGG ATCAGCTCAT CTCTGCGCTC
GGCAAAAAAG ATCATGCGTT GCTGATTGAT TGCCGTACGC TCGGCGCCAA AGCGGTTTCC
ATGCCGAAAG GTGTCGCCGT GGTGATCATC AACAGTAACT TTAAGCGCAC GCTGGTAGGC
AGCGAGTATA ATACCCGTCG TGAACAGTGC GAAACCGGCG CCCGTTTCTT CCAGCAGCCG
GCCCTGCGCG ATGTCAGCCT TGAGGCGTTC AATGCCGTCG CCAGCGAACT GGACCCGGTA
GTCGCAAAAC GCGTTCGCCA TGTATTGAGC GAAAATGCGC GCACCGTTGA AGCGGCAAGC
GCGCTGGAGA AAGGTGATTT GCAACGTATG GGCCAACTGA TGGCGGAGTC CCATGCCTCA
ATGCGCGATG ATTTCGAAAT TACCGTCCCG CAGATAGACA CGCTGGTAGA CATTGTCAAA
GCGACCATCG GCGATCGAGG CGGCGTGCGC ATGACCGGCG GCGGCTTTGG CGGCTGTGTT
GTCGCACTGA TCCCGGAAGA TCTGGTTCCC GCTGTTCGGC AGGCCGTTGC GCAACAGTAC
GAAGCGAAAA CCGGAATCAA AGAAACCTTT TATGTATGCA AACCGTCACA AGGAGCAGGA
CAGTGCTAA
 
Protein sequence
MNLKEKTRAL FAEIFGYPAT HTIQAPGRVN LIGEHTDYND GFVLPCAIDY QTVISCAPRD 
DRTVRVIAAD YDNQVDEFSL DAPIVTHDSQ QWSNYVRGVV KHLQQRNNAF GGVDMVISGN
VPQGAGLSSS ASLEVAVGTV FQQLYHLPLD GAQIALNGQE AENQFVGCNC GIMDQLISAL
GKKDHALLID CRTLGAKAVS MPKGVAVVII NSNFKRTLVG SEYNTRREQC ETGARFFQQP
ALRDVSLEAF NAVASELDPV VAKRVRHVLS ENARTVEAAS ALEKGDLQRM GQLMAESHAS
MRDDFEITVP QIDTLVDIVK ATIGDRGGVR MTGGGFGGCV VALIPEDLVP AVRQAVAQQY
EAKTGIKETF YVCKPSQGAG QC