Gene SbBS512_E0841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0841 
SymbolbglX 
ID6270415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp787566 
End bp789863 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content54% 
IMG OID641725013 
Productbeta-glucosidase, periplasmic 
Protein accessionYP_001879540 
Protein GI187734005 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGGC TATGTTCAGT AGGAATCGCG GTGAGTCTGG CCCTGCAGCC AGCACTGGCG 
GATGATTTAT TCGGCAACCA TCCATTAACG CCCGAAGCGC GGGATGCGTT CGTCACCGAA
CTGCTTAAGA AAATGACAGT TGATGAGAAA ATTGGTCAGC TGCGCTTAAT CAGCGTCGGC
CCGGATAACC CGAAAGAGGC GATCCGCGAG ATGATCAAAG ACGGTCAGGT TGGGGCGATT
TTCAACACCG TAACCCGTCA GGATATCCGC GCCATGCAGG ATCAGGTGAT GGAATTAAGC
CGCCTGAAAA TTCCTCTTTT CTTTGCTTAC GACGTGCTGC ACGGTCAGCG CACGGTGTTC
CCGATTAGCC TCGGTCTGGC CTCGTCTTTT AACCTCGATG CAGTGAAAAC GGTCGGACGT
GTCTCTGCTT ATGAAGCGGC AGATGATGGC CTGAATATGA CCTGGGCACC GATGGTCGAT
GTCTCGCGCG ATCCGCGCTG GGGACGTGCT TCCGAAGGTT TTGGCGAAGA TACGTATCTC
ACCTCAACAA TGGGTAAAAC CATGGTGGAA GCGATGCAGG GTAAAAGCCC GGCAGATCGC
TACTCGGTGA TGACCAGCGT CAAACACTTT GCCGCATACG GCGCGGTAGA AGGCGGTAAA
GAGTACAACA CCGTCGATAT GAGTCCGCAG CGCCTGTTTA ATGATTATAT GCCGCCGTAC
AAAGCGGGGC TGGACGCAGG CAGCGGCGCG GTGATGGTGG CGCTGAACTC GCTGAACGGC
ACGCCAGCCA CCTCCGATTC CTGGCTGCTG AAAGATGTTC TGCGCGACCA GTGGGGCTTT
AAAGGCATCA CCGTTTCCGA TCACGGTGCA ATCAAAGAGC TGATTAAACA TGGCACGGCG
GCAGACCCGG AAGATGCGGT GCGCGTGGCG CTGAAATCCG GAATCAACAT GAGTATGAGC
GACGAGTACT ACTCGAAGTA TCTGCCTGGG TTGATTAAAT CCGGTAAAGT GACGATGGCA
GAGCTGGACG ATGCTGCCCG CCATGTACTG AACGTTAAAT ATGATATGGG GTTGTTTAAC
GACCCATACA GCCATCTCGG TCCGAAAGAG TCTGACCCGG TGGATACCAA TGCCGAAAGC
CGCTTGCACC GCAAAGAAGC GCGTGAAGTG GCGCGTGAAA GCCTGGTGTT GCTGAAAAAC
CGTCTCGAAA CGTTACCGCT GAAAAAATCC GCAACCATTG CGGTGGTTGG CCCGCTGGCA
GACAGCAAGC GTGACGTGAT GGGAAGCTGG TCGGCGGCAG GTGTTGCAGA TCAATCCGTG
ACCGTGCTGA CCGGGATTAA AAATGCCGTC GGTGAAAACG GTAAAGTGCT GTATGCCAAA
GGGGCGAACG TTACCAGTGA CAAAGGCATT ATCGATTTCC TGAATCAGTA TGAAGAAGCG
GTCAAAGTCG ATCCGCGTTC GCCGCAAGAG ATGATTGATG AAGCGGTGCA GACGGCGAAA
CAATCTGATG TGGTGGTGGC TGTAGTCGGT GAAGCACAGG GGATGGCGCA CGAGGCCTCC
AGCCGTACCG ATATTACTAT TCCGCAAAGC CAACGTGACT TGATTGCGGC GCTGAAAGCC
ACCGGTAAAC CGCTGGTGCT GGTGCTGATG AACGGGCGTC CGCTGGCGCT GGTGAAAGAA
GATCAGCAGG CTGATGCGAT TCTGGAAACC TGGTTTGCGG GGACTGAAGG CGGTAATGCA
ATTGCCGATG TGTTGTTTGG CGATTACAAC CCGTCCGGCA AGCTGCCGAT GTCCTTCCCG
CGTTCTGTCG GGCAGATCCC GGTGTACTAC AGCCATCTGA ACACCGGTCG TCCGTATAAT
GCCGACAAGC CGAACAAATA CACTTCGCGT TATTTTGATG AAGCTAACGG GGCGCTTTAT
CCGTTCGGCT ATGGTCTGAG CTATACCACT TTCACCGTCT CTGATGTGAA ACTTTCTGCG
CCGACCATGA AGCGTGACGG CAAAGTGACG GCCAGCGTGC AGGTGATGAA CACCGGTAAA
CGCGAAGGGG CCACGGTAGT GCAGATGTAC TTGCAGGATG TGACGGCTTC CATGAGTCGT
CCGGTGAAAC AGCTGAAAGG CTTTGAGAAA ATCACCCTGA AACCGGGCGA AACTCAGACC
GTTAGCTTCC CGATTGATAT CGAGGCGCTG AAGTTCTGGA ATCAACAGAT GAAATATGAC
GCCGAGCCTG GCAAGTTCAA TGTCTTTATC GGCACTGATT CCGCACGCGT TAAGAAAGGC
GAGTTTGAGT TGCTGTAA
 
Protein sequence
MKWLCSVGIA VSLALQPALA DDLFGNHPLT PEARDAFVTE LLKKMTVDEK IGQLRLISVG 
PDNPKEAIRE MIKDGQVGAI FNTVTRQDIR AMQDQVMELS RLKIPLFFAY DVLHGQRTVF
PISLGLASSF NLDAVKTVGR VSAYEAADDG LNMTWAPMVD VSRDPRWGRA SEGFGEDTYL
TSTMGKTMVE AMQGKSPADR YSVMTSVKHF AAYGAVEGGK EYNTVDMSPQ RLFNDYMPPY
KAGLDAGSGA VMVALNSLNG TPATSDSWLL KDVLRDQWGF KGITVSDHGA IKELIKHGTA
ADPEDAVRVA LKSGINMSMS DEYYSKYLPG LIKSGKVTMA ELDDAARHVL NVKYDMGLFN
DPYSHLGPKE SDPVDTNAES RLHRKEAREV ARESLVLLKN RLETLPLKKS ATIAVVGPLA
DSKRDVMGSW SAAGVADQSV TVLTGIKNAV GENGKVLYAK GANVTSDKGI IDFLNQYEEA
VKVDPRSPQE MIDEAVQTAK QSDVVVAVVG EAQGMAHEAS SRTDITIPQS QRDLIAALKA
TGKPLVLVLM NGRPLALVKE DQQADAILET WFAGTEGGNA IADVLFGDYN PSGKLPMSFP
RSVGQIPVYY SHLNTGRPYN ADKPNKYTSR YFDEANGALY PFGYGLSYTT FTVSDVKLSA
PTMKRDGKVT ASVQVMNTGK REGATVVQMY LQDVTASMSR PVKQLKGFEK ITLKPGETQT
VSFPIDIEAL KFWNQQMKYD AEPGKFNVFI GTDSARVKKG EFELL