Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3257 |
Symbol | bglX |
ID | 6970310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2989968 |
End bp | 2992265 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643387070 |
Product | beta-glucosidase, periplasmic |
Protein accession | YP_002271534 |
Protein GI | 209399209 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.00621974 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATGGC TATGTTCAGT AGGAATCGCG GTGAGTCTGG CCCTGCAGCC AGCACTGGCG GATGATTTAT TCGGTAACCA TCCATTAACG CCTGAAGCGC GGGATGCGTT CGTCACCGAA CTGCTTAAGA AAATGACAGT TGATGAGAAA ATTGGTCAGC TGCGCTTAAT CAGCGTCGGC CCGGATAACC CGAAAGAGGC GATCCGCGAG ATGATCAAAG ACGGTCAGGT TGGGGCTATT TTCAACACCG TAACCCGTCA GGATATCCGC GCCATGCAGG ATCAGGTGAT GGAATTAAGC CGCCTGAAAA TTCCTCTTTT CTTTGCTTAC GACGTGCTGC ACGGTCAGCG CACGGTGTTC CCGATTAGCC TCGGTCTGGC CTCGTCTTTT AACCTCGATG CAGTAAAAAC GGTCGGACGT GTCTCTGCTT ATGAAGCGGC AGATGATGGC CTGAATATGA CCTGGGCACC GATGGTCGAT GTCTCGCGCG ATCCGCGCTG GGGACGTGCT TCCGAAGGTT TTGGCGAAGA TACGTATCTC ACCTCAATAA TGGGTAAAAC CATGGTGGAA GCGATGCAGG GTAAAAGCCC GGCAGATCGT TACTCGGTGA TGACCAGCGT CAAACACTTT GCCGCATACG GCGCGGTAGA AGGCGGTAAA GAGTACAACA CCGTCGATAT GAGTCCGCAG CGCCTGTTTA ATGATTATAT GCCGCCGTAC AAAGCGGGGC TGGACGCAGG CAGCGGCGCG GTGATGGTGG CGCTGAACTC GCTGAACGGC ACGCCAGCCA CCTCCGACTC CTGGCTGCTG AAAGATGTTC TGCGCGACCA GTGGGGCTTT AAAGGCATCA CCGTTTCCGA TCACGGTGCA ATCAAAGAGC TGATTAAACA TGGCACGGCG GCAGATCCGG AAGATGCAGT GCGCGTGGCG CTGAAATCCG GAATCAACAT GAGCATGAGC GACGAGTACT ACTCGAAGTA TCTGCCTGGG TTGATCAAAT CCGGTAAAGT GACGATGGCA GAGCTGGACG ATGCTGCCCG CCATGTACTG AACGTTAAAT ATGATATGGG GTTGTTTAAC GACCCATACA GCCATCTCGG TCCGAAAGAG TCTGACCCGG TGGATACCAA TGCCGAAAGC CGCCTGCACC GCAAAGAAGC GCGTGAAGTG GCGCGTGAAA GCCTGGTGTT GCTGAAAAAC CGTCTCGAAA CGTTACCGCT GAAAAAATCA GCCACCATTG CGGTGGTTGG GCCACTGGCG GACAGTAAAC GTGACGTGAT GGGCAGCTGG TCGGCGGCAG GTGTTGCCGA TCAATCCGTG ACCGTGCTGA CCGGGATTAA AAATGCCGTC GGTGAAAACG GTAAAGTGCT GTATGCCAAA GGGGCGAACG TTACCAGTGA CAAAGGCATT ATCGATTTCC TGAATCAGTA TGAAGAAGCG GTCAAAGTCG ATCCGCGTTC GCCGCAAGAG ATGATTGATG AAGCGGTGCA GACGGCGAAA CAATCTGATG TGGTGGTGGC TGTAGTCGGT GAAGCACAGG GGATGGCGCA CGAGGCCTCC AGCCGTACCG ATATTACTAT TCTGCAAAGC CAGCGTGATT TGATTGCCGC CCTGAAAGCC ACCGGCAAAC CGCTGGTGCT GGTGCTGATG AACGGGCGTC CGCTGGCGCT GGTGAAAGAA GATCAGCAGG CTGATGCGAT TCTGGAAACC TGGTTTGCCG GGACTGAAGG CGGTAATGCA ATTGCCGATG TGTTGTTTGG CGATTACAAC CCGTCCGGTA AGCTGCCGAT GTCCTTCCCG CGTTCTGTCG GGCAGATCCC GGTGTACTAC AGCCATCTGA ACACCGGTCG TCCGTATAAT GCCGACAAGC CGAACAAATA CACTTCGCGT TATTTTGATG AAGCTAACGG GGCGCTTTAT CCGTTCGGCT ATGGTCTGAG CTATACCACT TTCACCGTCT CTGATGTGAA ACTTTCTGCG CCGACCATGA AGCGTGACGG CAAAGTGACG GCCAGCGTGC AGGTGACGAA CACCGGTAAA CGCGAAGGGG CCACGGTAGT GCAGATGTAC TTGCAGGATG TGACGGCTTC CATGAGCCGT CCAGTGAAGC AGCTGAAAGG CTTTGAGAAA ATCACCCTGA AGCCGGGCGA AACCCAGACC GTCAGCTTCC CGATTGATAT CGAGGCGCTG AAGTTCTGGA ATCAACAGAT GAAATATGAC GCCGAGCCTG GCAAGTTCAA TGTCTTTATC GGCACTGATT CCGCACGCGT TAAGAAAGGC GAGTTTGAGT TGCTGTAA
|
Protein sequence | MKWLCSVGIA VSLALQPALA DDLFGNHPLT PEARDAFVTE LLKKMTVDEK IGQLRLISVG PDNPKEAIRE MIKDGQVGAI FNTVTRQDIR AMQDQVMELS RLKIPLFFAY DVLHGQRTVF PISLGLASSF NLDAVKTVGR VSAYEAADDG LNMTWAPMVD VSRDPRWGRA SEGFGEDTYL TSIMGKTMVE AMQGKSPADR YSVMTSVKHF AAYGAVEGGK EYNTVDMSPQ RLFNDYMPPY KAGLDAGSGA VMVALNSLNG TPATSDSWLL KDVLRDQWGF KGITVSDHGA IKELIKHGTA ADPEDAVRVA LKSGINMSMS DEYYSKYLPG LIKSGKVTMA ELDDAARHVL NVKYDMGLFN DPYSHLGPKE SDPVDTNAES RLHRKEAREV ARESLVLLKN RLETLPLKKS ATIAVVGPLA DSKRDVMGSW SAAGVADQSV TVLTGIKNAV GENGKVLYAK GANVTSDKGI IDFLNQYEEA VKVDPRSPQE MIDEAVQTAK QSDVVVAVVG EAQGMAHEAS SRTDITILQS QRDLIAALKA TGKPLVLVLM NGRPLALVKE DQQADAILET WFAGTEGGNA IADVLFGDYN PSGKLPMSFP RSVGQIPVYY SHLNTGRPYN ADKPNKYTSR YFDEANGALY PFGYGLSYTT FTVSDVKLSA PTMKRDGKVT ASVQVTNTGK REGATVVQMY LQDVTASMSR PVKQLKGFEK ITLKPGETQT VSFPIDIEAL KFWNQQMKYD AEPGKFNVFI GTDSARVKKG EFELL
|
| |