Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0912 |
Symbol | bglX |
ID | 6145851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 918691 |
End bp | 920988 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615800 |
Product | beta-glucosidase, periplasmic |
Protein accession | YP_001742992 |
Protein GI | 170681058 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.611673 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATGGC TATGTTCAGT AGGAATCGCG GTGAGTCTGG CCCTGCAGCC AGCACTGGCG GATGATTTAT TCGGCAACCA TCCATTAACG CCCGAAGCGC GGGATGCGTT CGTCACCGAA CTGCTTAAGA AAATGACAGT TGATGAGAAA ATTGGTCAGC TGCGTTTAAT CAGCGTCGGC CCGGATAATC CGAAAGAGGC GATCCGCGAG ATGATCAAAG ACGGGCAGGT TGGGGCGATT TTCAACACCG TAACCCGTCA GGATATCCGC GCCATGCAGG ATCAGGTGAT GGAATTAAGC CGCCTGAAAA TTCCTCTTTT CTTTGCTTAC GACGTGCTGC ACGGTCAGCG CACGGTGTTC CCGATTAGCC TCGGTCTGGC CTCGTCTTTT AACCTCGATG CGGTGAAAAC AGTCGGGCGT GTCTCTGCTT ATGAAGCGGC AGATGATGGC CTGAATATGA CCTGGGCACC GATGGTCGAT GTCTCGCGCG ATCCGCGCTG GGGACGTGCT TCCGAAGGTT TTGGCGAAGA TACGTATCTC ACCTCAATAA TGGGCAAAAC CATGGTGGAA GCGATGCAGG GTAAAAGCCC GGCAGATCGC TACTCGGTGA TGACCAGCGT CAAACACTTT GCCGCATACG GCGCGGTAGA AGGCGGTAAA GAGTACAACA CCGTCGATAT GAGTCCGCAG CGCCTGTTTA ATGATTATAT GCCGCCGTAC AAAGCGGGGC TGGACGCAGG CAGCGGCGCG GTGATGGTGG CGCTGAACTC GCTGAACGGT ACGCCAGCCA CCTCCGACTC CTGGCTGCTG AAAGATGTTC TGCGCGACCA GTGGGGCTTT AAAGGCATCA CCGTTTCCGA TCACGGCGCA ATCAAAGAGC TGATTAAACA TGGCACGGCG GCAGACCCGG AAGATGCGGT GCGCGTGGCG CTGAAATCCG GCATCAACAT GAGTATGAGC GACGAGTATT ACTCGAAGTA TCTGCCTGGG TTGATCAAAT CCGGCAAAGT GACGATGGAA GAGCTGGATG ACGCTGCCCG TCATGTACTG AACGTTAAAT ATGATATGGG GTTGTTTAAC GACCCGTACA GCCATCTCGG TCCGAAAGAG TCTGACCCGG TGGATACCAA TGCCGAAAAC CGCCTGCACC GCAAAGAAGC GCGTGAAGTG GCACGCGAAA GCCTGGTGTT GCTGAAAAAC CGTCTCGAAA CGTTACCGCT GAAAAAATCA GCCACCATTG CGGTGGTTGG CCCGCTGGCA GACAGCAAGC GTGACGTGAT GGGAAGCTGG TCGGCAGCAG GTGTCGCCGA TCAATCTGTT ACTGTGCTAA CAGGGATTAA AAACGCCGTC GGTGAAAACG GTAAAGTGCT GTACGCCAAA GGGGCGAACG TCACCAGTGA CAAAGGCATT ATCGATTTCC TGAATCAGTA TGAAGAGGCG GTCAAAGTCG ACCCGCGCTC GCCGCAAGAG ATGATTGATG AAGCGGTGCA AACCGCGAAG CAATCTGATG TGGTGGTGGC TGTGGTCGGT GAAGCTCAGG GGATGGCGCA CGAGGCCTCC AGCCGTACCG ATATCACTAT TCCGCAAAGC CAACGTGACT TGATTGCGGC GCTGAAAGCC ACCGGTAAAC CGCTGGTGCT GGTGCTGATG AACGGGCGTC CGCTGGCGCT GGTGAAAGAA GATCAGCAGG CGGATGCGAT TCTGGAAACC TGGTTTGCGG GGACTGAAGG CGGTAATGCA ATTGCCGATG TATTGTTTGG CGATTACAAC CCGTCCGGCA AGCTGCCGAT GTCCTTCCCG CGTTCTGTCG GGCAGATCCC GGTGTACTAC AGTCATCTGA ATACCGGTCG CCCGTATAAT GCCGACAAGC CGAACAAATA CACTTCGCGT TATTTTGATG AAGCTAACGG GGCGCTTTAT CCGTTCGGCT ATGGTCTGAG CTATACCACT TTCACCGTCT CTGATGTGAA ACTTTCTGCG CCGACCATGA AGCGTGACGG CAAAGTGACC GCCAGCGTGC AGGTGACGAA CACCGGTAAG CGCGAAGGGG CGACGGTAGT TCAGATGTAC CTGCAGGATG TGACGGCTTC CATGAGTCGC CCGGTAAAAC AGCTGAAAGG CTTTGAGAAA ATCACCCTGA AACCGGGCGA AACCCAGACC GTCAGCTTCC CGATTGATAT CGAGGCGCTG AAGTTCTGGA ATCAACAGAT GAAATATGAC GCCGAGCCTG GCAAGTTCAA TGTCTTTATC GGCACTGATT CCGCACGCGT TAAGAAAGGC GAGTTTGAGT TGCTGTAA
|
Protein sequence | MKWLCSVGIA VSLALQPALA DDLFGNHPLT PEARDAFVTE LLKKMTVDEK IGQLRLISVG PDNPKEAIRE MIKDGQVGAI FNTVTRQDIR AMQDQVMELS RLKIPLFFAY DVLHGQRTVF PISLGLASSF NLDAVKTVGR VSAYEAADDG LNMTWAPMVD VSRDPRWGRA SEGFGEDTYL TSIMGKTMVE AMQGKSPADR YSVMTSVKHF AAYGAVEGGK EYNTVDMSPQ RLFNDYMPPY KAGLDAGSGA VMVALNSLNG TPATSDSWLL KDVLRDQWGF KGITVSDHGA IKELIKHGTA ADPEDAVRVA LKSGINMSMS DEYYSKYLPG LIKSGKVTME ELDDAARHVL NVKYDMGLFN DPYSHLGPKE SDPVDTNAEN RLHRKEAREV ARESLVLLKN RLETLPLKKS ATIAVVGPLA DSKRDVMGSW SAAGVADQSV TVLTGIKNAV GENGKVLYAK GANVTSDKGI IDFLNQYEEA VKVDPRSPQE MIDEAVQTAK QSDVVVAVVG EAQGMAHEAS SRTDITIPQS QRDLIAALKA TGKPLVLVLM NGRPLALVKE DQQADAILET WFAGTEGGNA IADVLFGDYN PSGKLPMSFP RSVGQIPVYY SHLNTGRPYN ADKPNKYTSR YFDEANGALY PFGYGLSYTT FTVSDVKLSA PTMKRDGKVT ASVQVTNTGK REGATVVQMY LQDVTASMSR PVKQLKGFEK ITLKPGETQT VSFPIDIEAL KFWNQQMKYD AEPGKFNVFI GTDSARVKKG EFELL
|
| |