Gene YpsIP31758_2857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2857 
SymbolgalK 
ID5385326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3226210 
End bp3227361 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content51% 
IMG OID640865849 
Productgalactokinase 
Protein accessionYP_001401820 
Protein GI153948037 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAA AACAACATAC CCAGACTATT TTCCGCCAAC AGTTTGACCG CGAGTCTGAC 
ATCACCATTA AAGCGCCGGG CCGCGTCAAT CTGATTGGCG AACATACCGA CTATAACGAT
GGCTTTGTTC TGCCCTGCGC CATTAATTAT GAAACGGTGA TCAGTTGTGG CAAACGCGAC
GATCGCCAGA TTCGTGTTAT TGCCGCCGAC TATGAAAACC AGCAGGATAT ATTCTCTCTT
GATGCACCGA TTGTCCCACA TCCTGAATAT CGCTGGGCTG ACTACGTGCG TGGTGTGGTG
AAACATCTAC AAATGCGCAA CGCTGATTTT GGTGGGGCCG ATCTGGTTAT CTGTGGCAAT
GTCCCGCAGG GTGCTGGCCT CAGTTCCTCT GCATCGTTGG AAGTGGCCGT GGGCCAAGCC
CTGCAATCAC TCTATCAACT CCCTCTTAGC GGTGTAGAAC TGGCGCTGAA TGGGCAAGAG
GCAGAAAACC AATTTGTCGG CTGTAACTGC GGCATTATGG ATCAGTTAAT CTCAGCATTG
GGTAAAAAAG ACCATGCGTT GCTGATTGAT TGTCGGACCT TGGAAACCCG TGCCGTGCCA
ATGCCGGAAA ACATGGCCGT CGTTATTATC AACTCAAACA TTCAACGTGG CCTGGTTGAC
AGCGAATACA ATACTCGCCG CCAACAGTGT GAAGCTGCCG CCCGTTTCTT TGGCGTCAAA
GCATTGCGTG ATGTCGAACC GAGCCTCTTC TTCTCAATAC AAGACGAGCT AGATCCGGTC
GTCGCTAAAC GCGCCCGCCA TGTGATCAGC GAGAATGCAC GCACGCTGGC AGCCGCAGAT
GCCTTGGCCG CCGGGAACTT GAAATTGATG GGGCAATTGA TGCAAGAGTC TCATATTTCT
ATGCGTGATG ACTTTGAGAT CACGGTTCCA CCAATAGATA GACTCGTCGA GATTGTGAAA
TCAGTGATTG GTGATCAAGG TGGGGTGCGC ATGACGGGTG GCGGTTTTGG CGGTTGTATT
ATCGCGTTAA TGCCGCTTGA ATTAGTCGAG CAGGTTCGCA CCACCGTTGC GCAAGAATAC
CCGGCACACA GCGGCGGCAA GAAAGAGACT TTTTATGTCT GTCAGGCTTC ACAAGGAGCG
GGTTTATGCT GA
 
Protein sequence
MSLKQHTQTI FRQQFDRESD ITIKAPGRVN LIGEHTDYND GFVLPCAINY ETVISCGKRD 
DRQIRVIAAD YENQQDIFSL DAPIVPHPEY RWADYVRGVV KHLQMRNADF GGADLVICGN
VPQGAGLSSS ASLEVAVGQA LQSLYQLPLS GVELALNGQE AENQFVGCNC GIMDQLISAL
GKKDHALLID CRTLETRAVP MPENMAVVII NSNIQRGLVD SEYNTRRQQC EAAARFFGVK
ALRDVEPSLF FSIQDELDPV VAKRARHVIS ENARTLAAAD ALAAGNLKLM GQLMQESHIS
MRDDFEITVP PIDRLVEIVK SVIGDQGGVR MTGGGFGGCI IALMPLELVE QVRTTVAQEY
PAHSGGKKET FYVCQASQGA GLC