Gene SeHA_C0901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0901 
SymbolgalK 
ID6489304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp886641 
End bp887789 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content57% 
IMG OID642741149 
Productgalactokinase 
Protein accessionYP_002044802 
Protein GI194451386 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGA AAGAGAAAAC GCGCGCGCTG TTTGCTGAAA TTTTCGGCTA CCCTGCCACC 
CACACGATTC AGGCGCCAGG CCGCGTCAAT CTGATCGGCG AGCACACTGA TTACAATGAT
GGTTTTGTTC TGCCCTGCGC TATCGATTAC CAGACCGTAA TTAGCTGTGC GCCGCGCGAC
GATCGTACCG TACGGGTGAT TGCCGCCGAT TACGACAATC AGGTGGACGA ATTTTCACTG
GATGCGCCGA TCGTGACCCA CGATAGCCAG CAGTGGTCTA ACTATGTGCG CGGCGTAGTG
AAACACCTGC AACAGCGTAA CAACGCGTTT GGCGGCGTGG ATATGGTCAT CAGCGGCAAT
GTGCCGCAGG GCGCCGGGTT AAGCTCCTCC GCCTCGCTGG AAGTGGCGGT GGGCACCGTC
TTCCAGCAGC TTTATCACCT GCCGCTGGAC GGCGCGCAAA TTGCGCTCAA CGGACAAGAG
GCCGAGAACC AGTTTGTCGG CTGTAACTGC GGCATTATGG ATCAGCTCAT CTCTGCGCTC
GGCAAAAAAG ATCATGCGTT GCTGATTGAT TGCCGTACGC TCGGCGCCAA AGCGGTTTCC
ATGCCGAAAG GTGTCGCCGT GGTGATCATC AACAGTAACT TTAAGCGCAC GCTGGTGGGC
AGCGAGTATA ATACCCGCCG TGAACAGTGC GAAACCGGCG CCCGTTTCTT CCAGCAGCCG
GCCCTGCGCG ATGTCAGCCT TGAGGCGTTC AATGCCGTTG CCAGCGAACT GGACCCGGTA
GTCGCAAAAC GCGTTCGCCA TGTATTGAGC GAAAATGCGC GCACCGTTGA AGCGGCAAGC
GCGCTGGAGA AAGGTGATTT GCAACGTATG GGCCAACTGA TGGCGGAGTC CCATGCCTCA
ATGCGCGATG ATTTCGAAAT TACCGTCCCG CAGATAGACA CGCTGGTAGA CATCGTCAAA
GCGACCATCG GCGATCAAGG CGGCGTGCGC ATGACCGGCG GCGGCTTCGG CGGCTGTGTT
GTCGCACTGA TCCCGGAAGA TCTGGTTCCC GCTGTTCGGC AGGCCGTTGC GCAACAGTAC
GAAGCGAAAA CCGGAATCAA AGAAACCTTT TATGTATGCA AACCGTCACA AGGAGCAGGA
CAGTGCTAA
 
Protein sequence
MNLKEKTRAL FAEIFGYPAT HTIQAPGRVN LIGEHTDYND GFVLPCAIDY QTVISCAPRD 
DRTVRVIAAD YDNQVDEFSL DAPIVTHDSQ QWSNYVRGVV KHLQQRNNAF GGVDMVISGN
VPQGAGLSSS ASLEVAVGTV FQQLYHLPLD GAQIALNGQE AENQFVGCNC GIMDQLISAL
GKKDHALLID CRTLGAKAVS MPKGVAVVII NSNFKRTLVG SEYNTRREQC ETGARFFQQP
ALRDVSLEAF NAVASELDPV VAKRVRHVLS ENARTVEAAS ALEKGDLQRM GQLMAESHAS
MRDDFEITVP QIDTLVDIVK ATIGDQGGVR MTGGGFGGCV VALIPEDLVP AVRQAVAQQY
EAKTGIKETF YVCKPSQGAG QC