Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0841 |
Symbol | bglX |
ID | 6270415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 787566 |
End bp | 789863 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641725013 |
Product | beta-glucosidase, periplasmic |
Protein accession | YP_001879540 |
Protein GI | 187734005 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATGGC TATGTTCAGT AGGAATCGCG GTGAGTCTGG CCCTGCAGCC AGCACTGGCG GATGATTTAT TCGGCAACCA TCCATTAACG CCCGAAGCGC GGGATGCGTT CGTCACCGAA CTGCTTAAGA AAATGACAGT TGATGAGAAA ATTGGTCAGC TGCGCTTAAT CAGCGTCGGC CCGGATAACC CGAAAGAGGC GATCCGCGAG ATGATCAAAG ACGGTCAGGT TGGGGCGATT TTCAACACCG TAACCCGTCA GGATATCCGC GCCATGCAGG ATCAGGTGAT GGAATTAAGC CGCCTGAAAA TTCCTCTTTT CTTTGCTTAC GACGTGCTGC ACGGTCAGCG CACGGTGTTC CCGATTAGCC TCGGTCTGGC CTCGTCTTTT AACCTCGATG CAGTGAAAAC GGTCGGACGT GTCTCTGCTT ATGAAGCGGC AGATGATGGC CTGAATATGA CCTGGGCACC GATGGTCGAT GTCTCGCGCG ATCCGCGCTG GGGACGTGCT TCCGAAGGTT TTGGCGAAGA TACGTATCTC ACCTCAACAA TGGGTAAAAC CATGGTGGAA GCGATGCAGG GTAAAAGCCC GGCAGATCGC TACTCGGTGA TGACCAGCGT CAAACACTTT GCCGCATACG GCGCGGTAGA AGGCGGTAAA GAGTACAACA CCGTCGATAT GAGTCCGCAG CGCCTGTTTA ATGATTATAT GCCGCCGTAC AAAGCGGGGC TGGACGCAGG CAGCGGCGCG GTGATGGTGG CGCTGAACTC GCTGAACGGC ACGCCAGCCA CCTCCGATTC CTGGCTGCTG AAAGATGTTC TGCGCGACCA GTGGGGCTTT AAAGGCATCA CCGTTTCCGA TCACGGTGCA ATCAAAGAGC TGATTAAACA TGGCACGGCG GCAGACCCGG AAGATGCGGT GCGCGTGGCG CTGAAATCCG GAATCAACAT GAGTATGAGC GACGAGTACT ACTCGAAGTA TCTGCCTGGG TTGATTAAAT CCGGTAAAGT GACGATGGCA GAGCTGGACG ATGCTGCCCG CCATGTACTG AACGTTAAAT ATGATATGGG GTTGTTTAAC GACCCATACA GCCATCTCGG TCCGAAAGAG TCTGACCCGG TGGATACCAA TGCCGAAAGC CGCTTGCACC GCAAAGAAGC GCGTGAAGTG GCGCGTGAAA GCCTGGTGTT GCTGAAAAAC CGTCTCGAAA CGTTACCGCT GAAAAAATCC GCAACCATTG CGGTGGTTGG CCCGCTGGCA GACAGCAAGC GTGACGTGAT GGGAAGCTGG TCGGCGGCAG GTGTTGCAGA TCAATCCGTG ACCGTGCTGA CCGGGATTAA AAATGCCGTC GGTGAAAACG GTAAAGTGCT GTATGCCAAA GGGGCGAACG TTACCAGTGA CAAAGGCATT ATCGATTTCC TGAATCAGTA TGAAGAAGCG GTCAAAGTCG ATCCGCGTTC GCCGCAAGAG ATGATTGATG AAGCGGTGCA GACGGCGAAA CAATCTGATG TGGTGGTGGC TGTAGTCGGT GAAGCACAGG GGATGGCGCA CGAGGCCTCC AGCCGTACCG ATATTACTAT TCCGCAAAGC CAACGTGACT TGATTGCGGC GCTGAAAGCC ACCGGTAAAC CGCTGGTGCT GGTGCTGATG AACGGGCGTC CGCTGGCGCT GGTGAAAGAA GATCAGCAGG CTGATGCGAT TCTGGAAACC TGGTTTGCGG GGACTGAAGG CGGTAATGCA ATTGCCGATG TGTTGTTTGG CGATTACAAC CCGTCCGGCA AGCTGCCGAT GTCCTTCCCG CGTTCTGTCG GGCAGATCCC GGTGTACTAC AGCCATCTGA ACACCGGTCG TCCGTATAAT GCCGACAAGC CGAACAAATA CACTTCGCGT TATTTTGATG AAGCTAACGG GGCGCTTTAT CCGTTCGGCT ATGGTCTGAG CTATACCACT TTCACCGTCT CTGATGTGAA ACTTTCTGCG CCGACCATGA AGCGTGACGG CAAAGTGACG GCCAGCGTGC AGGTGATGAA CACCGGTAAA CGCGAAGGGG CCACGGTAGT GCAGATGTAC TTGCAGGATG TGACGGCTTC CATGAGTCGT CCGGTGAAAC AGCTGAAAGG CTTTGAGAAA ATCACCCTGA AACCGGGCGA AACTCAGACC GTTAGCTTCC CGATTGATAT CGAGGCGCTG AAGTTCTGGA ATCAACAGAT GAAATATGAC GCCGAGCCTG GCAAGTTCAA TGTCTTTATC GGCACTGATT CCGCACGCGT TAAGAAAGGC GAGTTTGAGT TGCTGTAA
|
Protein sequence | MKWLCSVGIA VSLALQPALA DDLFGNHPLT PEARDAFVTE LLKKMTVDEK IGQLRLISVG PDNPKEAIRE MIKDGQVGAI FNTVTRQDIR AMQDQVMELS RLKIPLFFAY DVLHGQRTVF PISLGLASSF NLDAVKTVGR VSAYEAADDG LNMTWAPMVD VSRDPRWGRA SEGFGEDTYL TSTMGKTMVE AMQGKSPADR YSVMTSVKHF AAYGAVEGGK EYNTVDMSPQ RLFNDYMPPY KAGLDAGSGA VMVALNSLNG TPATSDSWLL KDVLRDQWGF KGITVSDHGA IKELIKHGTA ADPEDAVRVA LKSGINMSMS DEYYSKYLPG LIKSGKVTMA ELDDAARHVL NVKYDMGLFN DPYSHLGPKE SDPVDTNAES RLHRKEAREV ARESLVLLKN RLETLPLKKS ATIAVVGPLA DSKRDVMGSW SAAGVADQSV TVLTGIKNAV GENGKVLYAK GANVTSDKGI IDFLNQYEEA VKVDPRSPQE MIDEAVQTAK QSDVVVAVVG EAQGMAHEAS SRTDITIPQS QRDLIAALKA TGKPLVLVLM NGRPLALVKE DQQADAILET WFAGTEGGNA IADVLFGDYN PSGKLPMSFP RSVGQIPVYY SHLNTGRPYN ADKPNKYTSR YFDEANGALY PFGYGLSYTT FTVSDVKLSA PTMKRDGKVT ASVQVMNTGK REGATVVQMY LQDVTASMSR PVKQLKGFEK ITLKPGETQT VSFPIDIEAL KFWNQQMKYD AEPGKFNVFI GTDSARVKKG EFELL
|
| |