Gene BAS4165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4165 
Symbol 
ID2851989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4080937 
End bp4081920 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content40% 
IMG OID637507401 
Productglucokinase 
Protein accessionYP_030414 
Protein GI49187162 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.131693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAGA AATGGTTAGT TGGTGTTGAC CTTGGTGGTA CAACGATTAA ATTAGCATTT 
ATTAATGTGT ACGGTGAAAT TTTACATAAG TGGGAAATCC CTACGAATAC AAATGAGCAA
GGAAAACATA TTACACTTGA TGTAGCGAAA GCTATTGATA AAAAGTTAGA AGAGTTAGGT
GAATTAAAAA GTAAGTTAAT CGGTATTGGT ATGGGAGCTC CTGGTCCTGT ACACGTGGCG
TCTGGAATGA TTTATGAAGC GGTTAACTTA GGATGGAAAA ACTATCCGTT AAAAGATTTA
TTAGAAGTAG AAACAGGATT ACCTGTTGTT ATTGATAATG ATGCAAATTT AGCGGCACTT
GGTGAAATGT GGAAGGGTGC TGGTGAAGGA GCAAAAGATT TAATTTGTAT GACACTTGGC
ACTGGTGTTG GCGGCGGTGT AATTGCCAAT GGTGAGATTG TACATGGCGT AAGCGGTGCT
GCTGGTGAGA TTGGACACAT TACAGTCGTT ACAGAGAATG CTTTCCCATG TAATTGCGGG
AAGTCTGGTT GCCTAGAAAC TGTAGCATCT GCAACAGGTA TTGTACGTGT TGCTATGCAG
AAAATACAAG AGACTGATAA AGAAAGTATT CTACGTTCTA TGTTAGCAGA AGAAGGGCGT
ATTACATCAA AAGACGTATT TGAAGCACAT GGACAAGGTG ATGAACTAGC AGGCGAAGTA
GTAGAAAAGG TAGCTTCTTA TTTAGGATTA GCTGTAGCGA ACCTTTCGAG CACGTTGAAT
CCAGAGAAAA TTGTTATTGG TGGAGGCGTT TCTAAAGCTG GAGATGCACT ATTAGAACCA
ATTCAACGTT ATTTCGAGCA ATACGCTTTC TCACGTGCTG TAAAGAGCAC GAAGTTAGCT
ATTGCAACAC TTGGTAATGA TGCAGGTGTT ATCGGAGGAG CTTGGCTTGT AAAAAAGCAC
AAATACGAAG CGAAGATGAT ATAA
 
Protein sequence
MEEKWLVGVD LGGTTIKLAF INVYGEILHK WEIPTNTNEQ GKHITLDVAK AIDKKLEELG 
ELKSKLIGIG MGAPGPVHVA SGMIYEAVNL GWKNYPLKDL LEVETGLPVV IDNDANLAAL
GEMWKGAGEG AKDLICMTLG TGVGGGVIAN GEIVHGVSGA AGEIGHITVV TENAFPCNCG
KSGCLETVAS ATGIVRVAMQ KIQETDKESI LRSMLAEEGR ITSKDVFEAH GQGDELAGEV
VEKVASYLGL AVANLSSTLN PEKIVIGGGV SKAGDALLEP IQRYFEQYAF SRAVKSTKLA
IATLGNDAGV IGGAWLVKKH KYEAKMI