Gene ECH74115_3257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3257 
SymbolbglX 
ID6970310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2989968 
End bp2992265 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content54% 
IMG OID643387070 
Productbeta-glucosidase, periplasmic 
Protein accessionYP_002271534 
Protein GI209399209 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.00621974 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATGGC TATGTTCAGT AGGAATCGCG GTGAGTCTGG CCCTGCAGCC AGCACTGGCG 
GATGATTTAT TCGGTAACCA TCCATTAACG CCTGAAGCGC GGGATGCGTT CGTCACCGAA
CTGCTTAAGA AAATGACAGT TGATGAGAAA ATTGGTCAGC TGCGCTTAAT CAGCGTCGGC
CCGGATAACC CGAAAGAGGC GATCCGCGAG ATGATCAAAG ACGGTCAGGT TGGGGCTATT
TTCAACACCG TAACCCGTCA GGATATCCGC GCCATGCAGG ATCAGGTGAT GGAATTAAGC
CGCCTGAAAA TTCCTCTTTT CTTTGCTTAC GACGTGCTGC ACGGTCAGCG CACGGTGTTC
CCGATTAGCC TCGGTCTGGC CTCGTCTTTT AACCTCGATG CAGTAAAAAC GGTCGGACGT
GTCTCTGCTT ATGAAGCGGC AGATGATGGC CTGAATATGA CCTGGGCACC GATGGTCGAT
GTCTCGCGCG ATCCGCGCTG GGGACGTGCT TCCGAAGGTT TTGGCGAAGA TACGTATCTC
ACCTCAATAA TGGGTAAAAC CATGGTGGAA GCGATGCAGG GTAAAAGCCC GGCAGATCGT
TACTCGGTGA TGACCAGCGT CAAACACTTT GCCGCATACG GCGCGGTAGA AGGCGGTAAA
GAGTACAACA CCGTCGATAT GAGTCCGCAG CGCCTGTTTA ATGATTATAT GCCGCCGTAC
AAAGCGGGGC TGGACGCAGG CAGCGGCGCG GTGATGGTGG CGCTGAACTC GCTGAACGGC
ACGCCAGCCA CCTCCGACTC CTGGCTGCTG AAAGATGTTC TGCGCGACCA GTGGGGCTTT
AAAGGCATCA CCGTTTCCGA TCACGGTGCA ATCAAAGAGC TGATTAAACA TGGCACGGCG
GCAGATCCGG AAGATGCAGT GCGCGTGGCG CTGAAATCCG GAATCAACAT GAGCATGAGC
GACGAGTACT ACTCGAAGTA TCTGCCTGGG TTGATCAAAT CCGGTAAAGT GACGATGGCA
GAGCTGGACG ATGCTGCCCG CCATGTACTG AACGTTAAAT ATGATATGGG GTTGTTTAAC
GACCCATACA GCCATCTCGG TCCGAAAGAG TCTGACCCGG TGGATACCAA TGCCGAAAGC
CGCCTGCACC GCAAAGAAGC GCGTGAAGTG GCGCGTGAAA GCCTGGTGTT GCTGAAAAAC
CGTCTCGAAA CGTTACCGCT GAAAAAATCA GCCACCATTG CGGTGGTTGG GCCACTGGCG
GACAGTAAAC GTGACGTGAT GGGCAGCTGG TCGGCGGCAG GTGTTGCCGA TCAATCCGTG
ACCGTGCTGA CCGGGATTAA AAATGCCGTC GGTGAAAACG GTAAAGTGCT GTATGCCAAA
GGGGCGAACG TTACCAGTGA CAAAGGCATT ATCGATTTCC TGAATCAGTA TGAAGAAGCG
GTCAAAGTCG ATCCGCGTTC GCCGCAAGAG ATGATTGATG AAGCGGTGCA GACGGCGAAA
CAATCTGATG TGGTGGTGGC TGTAGTCGGT GAAGCACAGG GGATGGCGCA CGAGGCCTCC
AGCCGTACCG ATATTACTAT TCTGCAAAGC CAGCGTGATT TGATTGCCGC CCTGAAAGCC
ACCGGCAAAC CGCTGGTGCT GGTGCTGATG AACGGGCGTC CGCTGGCGCT GGTGAAAGAA
GATCAGCAGG CTGATGCGAT TCTGGAAACC TGGTTTGCCG GGACTGAAGG CGGTAATGCA
ATTGCCGATG TGTTGTTTGG CGATTACAAC CCGTCCGGTA AGCTGCCGAT GTCCTTCCCG
CGTTCTGTCG GGCAGATCCC GGTGTACTAC AGCCATCTGA ACACCGGTCG TCCGTATAAT
GCCGACAAGC CGAACAAATA CACTTCGCGT TATTTTGATG AAGCTAACGG GGCGCTTTAT
CCGTTCGGCT ATGGTCTGAG CTATACCACT TTCACCGTCT CTGATGTGAA ACTTTCTGCG
CCGACCATGA AGCGTGACGG CAAAGTGACG GCCAGCGTGC AGGTGACGAA CACCGGTAAA
CGCGAAGGGG CCACGGTAGT GCAGATGTAC TTGCAGGATG TGACGGCTTC CATGAGCCGT
CCAGTGAAGC AGCTGAAAGG CTTTGAGAAA ATCACCCTGA AGCCGGGCGA AACCCAGACC
GTCAGCTTCC CGATTGATAT CGAGGCGCTG AAGTTCTGGA ATCAACAGAT GAAATATGAC
GCCGAGCCTG GCAAGTTCAA TGTCTTTATC GGCACTGATT CCGCACGCGT TAAGAAAGGC
GAGTTTGAGT TGCTGTAA
 
Protein sequence
MKWLCSVGIA VSLALQPALA DDLFGNHPLT PEARDAFVTE LLKKMTVDEK IGQLRLISVG 
PDNPKEAIRE MIKDGQVGAI FNTVTRQDIR AMQDQVMELS RLKIPLFFAY DVLHGQRTVF
PISLGLASSF NLDAVKTVGR VSAYEAADDG LNMTWAPMVD VSRDPRWGRA SEGFGEDTYL
TSIMGKTMVE AMQGKSPADR YSVMTSVKHF AAYGAVEGGK EYNTVDMSPQ RLFNDYMPPY
KAGLDAGSGA VMVALNSLNG TPATSDSWLL KDVLRDQWGF KGITVSDHGA IKELIKHGTA
ADPEDAVRVA LKSGINMSMS DEYYSKYLPG LIKSGKVTMA ELDDAARHVL NVKYDMGLFN
DPYSHLGPKE SDPVDTNAES RLHRKEAREV ARESLVLLKN RLETLPLKKS ATIAVVGPLA
DSKRDVMGSW SAAGVADQSV TVLTGIKNAV GENGKVLYAK GANVTSDKGI IDFLNQYEEA
VKVDPRSPQE MIDEAVQTAK QSDVVVAVVG EAQGMAHEAS SRTDITILQS QRDLIAALKA
TGKPLVLVLM NGRPLALVKE DQQADAILET WFAGTEGGNA IADVLFGDYN PSGKLPMSFP
RSVGQIPVYY SHLNTGRPYN ADKPNKYTSR YFDEANGALY PFGYGLSYTT FTVSDVKLSA
PTMKRDGKVT ASVQVTNTGK REGATVVQMY LQDVTASMSR PVKQLKGFEK ITLKPGETQT
VSFPIDIEAL KFWNQQMKYD AEPGKFNVFI GTDSARVKKG EFELL