Gene YpAngola_A1411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1411 
SymbolgalK 
ID5799878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1466069 
End bp1467220 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content51% 
IMG OID641339367 
Productgalactokinase 
Protein accessionYP_001605931 
Protein GI162421746 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTAA AACAACATAC CCAGACTATT TTCCGCCAAC AGTTTGACCG CGAGTCTGAC 
ATCACCATTA AAGCGCCGGG CCGCGTCAAT CTGATTGGCG AACATACCGA CTATAACGAT
GGCTTTGTTC TGCCCTGCGC CATTAATTAT GAAACGGTGA TCAGTTGTGG CAAACGCGAC
GATCGCCAGA TTCGTGTTAT TGCCGCCGAC TATGAAAACC AGCAGGATAT ATTCTCTCTT
GATGCACCGA TTGTCCCGCA TCCTGAATAT CGCTGGGCTG ACTACGTGCG TGGTGTGGTG
AAACATCTAC AAATGCGCAA CGCTGATTTT GGTGGGGCCG ATCTGGTTAT CTGTGGCAAT
GTCCCGCAGG GTGCTGGCCT CAGTTCCTCT GCATCGTTGG AAGTGGCCGT GGGCCAAGCC
CTGCAATCAC TCTATCAACT CCCTCTTAGC GGTGTAGAAC TGGCGCTGAA TGGGCAAGAG
GCAGAAAACC AATTTGTCGG CTGTAACTGC GGCATTATGG ATCAGTTAAT CTCAGCATTG
GGTAAAAAAG ACCATGCGTT GCTGATTGAT TGTCGGACCT TGGAAACCCG TGCCGTGCCA
ATGCCGGAAA ACATGGCCGT CGTTATTATC AACTCAAACA TTCAACGTGG CCTGGTTGAC
AGCGAATACA ATACTCGCCG CCAACAGTGT GAAGCTGCCG CCCGTTTCTT TGGCGTCAAA
GCATTGCGTG ATGTCGAACC GAGCCTCTTC TTCTCAATAC AAGACGAGCT AGATCCGGTC
GTCGCTAAAC GCGCCCGCCA TGTGATCAGC GAGAATGCAC GCACGCTGGC AGCCGCAGAT
GCCTTGGCCG CCGGGAACTT GAAATTGATG GGGCAATTGA TGCAAGAGTC TCATATTTCT
ATGCGTGATG ACTTTGAGAT CACGGTTCCA CCAATAGATA GACTCGTCGA GATTGTGAAA
TCAGTGATTG GTGATCAAGG TGGGGTGCGC ATGACGGGTG GCGGTTTTGG CGGTTGTATT
ATCGCGTTAA TGCCGCTTGA ATTAGTCGAG CAGGTTCGCA CCACCGTTGC GCAAGAATAC
CCGGCACACA GCGGCGGCAA GAAAGAGACT TTTTATGTCT GTCAGGCTTC ACAAGGAGCG
GGTTTATGCT GA
 
Protein sequence
MSLKQHTQTI FRQQFDRESD ITIKAPGRVN LIGEHTDYND GFVLPCAINY ETVISCGKRD 
DRQIRVIAAD YENQQDIFSL DAPIVPHPEY RWADYVRGVV KHLQMRNADF GGADLVICGN
VPQGAGLSSS ASLEVAVGQA LQSLYQLPLS GVELALNGQE AENQFVGCNC GIMDQLISAL
GKKDHALLID CRTLETRAVP MPENMAVVII NSNIQRGLVD SEYNTRRQQC EAAARFFGVK
ALRDVEPSLF FSIQDELDPV VAKRARHVIS ENARTLAAAD ALAAGNLKLM GQLMQESHIS
MRDDFEITVP PIDRLVEIVK SVIGDQGGVR MTGGGFGGCI IALMPLELVE QVRTTVAQEY
PAHSGGKKET FYVCQASQGA GLC