Gene SAG0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0471 
Symbolglk 
ID1013274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp484171 
End bp485139 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content41% 
IMG OID637315673 
Productglucokinase 
Protein accessionNP_687501 
Protein GI22536650 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGA AATTATTGGG AATTGACCTC GGAGGAACGA CCATTAAATT TGGTATCTTG 
ACGCTTGAGG GAGAAGTACA AGAAAAATGG GCAATTGAGA CCAATACTTT AGAAAACGGA
AGACATATCG TTTCTGATAT CGTTGAATCT CTCAAACATC GTTTGAGCCT CTATGGATTA
ACAAAAGATG ACTTTCTCGG TATCGGTATG GGTTCTCCAG GAGCTGTTGA TAGAACTAGT
AAAACAGTAA CAGGTGCTTT TAATCTAAAT TGGGCTGATA CTCAAGAAGT AGGTTCAGTT
ATTGAAAAAG AAGTTGGAAT TCCATTTTTT ATTGATAACG ATGCTAATGT TGCAGCACTT
GGTGAACGCT GGGTAGGTGC TGGTGCCAAT AATCCCGACG TTGTTTTCGT AACCCTCGGA
ACAGGAGTAG GTGGAGGTGT TATCGCAGAT GGTAACCTCA TCCATGGTGT TGCAGGAGCA
GGTGGAGAAA TTGGGCATAT GATTGTTGAT CCAGAAAATG GATTTACGTG CACATGTGGT
AACAAAGGCT GCCTTGAGAC AGTTGCATCA GCGACAGGTG TTGTTAGAGT AGCACGTCAA
CTCGCAGAAC AATATGAGGG TTCGTCTGCC ATTAAAGCAG CGATTGACAA CGGTGATACT
GTTACAAGTA AAGATATTTT TATAGCAGCA GAAGATGGGG ATAAATTTGC TAATTCTGTT
GTTGAACGTG TATCACGTTA CCTTGGACTG GCAGCAGCTA ATATTTCAAA TATTTTAAAC
CCTGATTCTG TGGTTATTGG TGGCGGTGTC TCAGCAGCAG GTGAATTTTT ACGTAGTCGC
GTTGAGAAAT ACTTTGTCAC ATTTGCTTTC CCACAAGTTA AAAAGTCAAC TAAAATTAAG
ATTGCTGAAC TAGGTAATGA TGCTGGTATT ATTGGTGCAG CAAGCTTAGC CAATCAACAA
GCAAGTTAA
 
Protein sequence
MSKKLLGIDL GGTTIKFGIL TLEGEVQEKW AIETNTLENG RHIVSDIVES LKHRLSLYGL 
TKDDFLGIGM GSPGAVDRTS KTVTGAFNLN WADTQEVGSV IEKEVGIPFF IDNDANVAAL
GERWVGAGAN NPDVVFVTLG TGVGGGVIAD GNLIHGVAGA GGEIGHMIVD PENGFTCTCG
NKGCLETVAS ATGVVRVARQ LAEQYEGSSA IKAAIDNGDT VTSKDIFIAA EDGDKFANSV
VERVSRYLGL AAANISNILN PDSVVIGGGV SAAGEFLRSR VEKYFVTFAF PQVKKSTKIK
IAELGNDAGI IGAASLANQQ AS