Gene EcHS_A2267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2267 
SymbolbglX 
ID5595000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2262090 
End bp2264387 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content54% 
IMG OID640921396 
Productbeta-glucosidase, periplasmic 
Protein accessionYP_001458932 
Protein GI157161614 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGGC TATGTTCAGT AGGAATCGCG GTGAGTCTGG CCCTGCAGCC AGCACTGGCG 
GATGATTTAT TCGGCAACCA TCCATTAACG CCCGAAGCGC GGGATGCGTT CGTCACCGAA
CTGCTTAAGA AAATGACAGT TGATGAGAAA ATTGGTCAGC TGCGCTTAAT CAGCGTCGGC
CCGGATAACC CGAAAGAGGC GATCCGCGAG ATGATCAAAG ACGGTCAGGT TGGGGCGATT
TTCAACACCG TAACCCGTCA GGATATCCGC GCCATGCAGG ATCAGGTGAT GGAATTAAGC
CGCCTGAAAA TTCCTCTTTT CTTTGCTTAC GACGTGCTGC ACGGTCAGCG CACGGTGTTC
CCGATTAGCC TCGGTCTGGC CTCGTCTTTT AACCTCGATG CAGTGAAAAC GGTCGGACGT
GTCTCTGCTT ATGAAGCGGC AGATGATGGC CTGAATATGA CCTGGGCACC GATGGTCGAT
GTCTCGCGCG ATCCGCGCTG GGGACGTGCT TCCGAAGGTT TTGGCGAAGA TACGTATCTC
ACCTCAACAA TGGGTAAAAC CATGGTGGAA GCGATGCAGG GTAAAAGCCC GGCAGATCGC
TACTCGGTGA TGACCAGCGT CAAACACTTT GCCGCATACG GCGCGGTAGA AGGCGGTAAA
GAGTACAACA CCGTCGATAT GAGTCCGCAG CGCCTGTTTA ATGATTATAT GCCGCCGTAC
AAAGCGGGGC TGGACGCAGG CAGCGGCGCG GTGATGGTGG CGCTGAACTC GCTGAACGGC
ACGCCAGCCA CCTCCGATTC CTGGCTGCTG AAAGATGTTC TGCGCGACCA GTGGGGCTTT
AAAGGCATCA CCGTTTCCGA TCACGGTGCA ATCAAAGAGC TGATTAAACA TGGCACGGCG
GCAGACCCGG AAGATGCGGT GCGCGTGGCG CTGAAATCCG GAATCAACAT GAGCATGAGC
GACGAGTACT ACTCGAAGTA TCTGCCTGGG TTGATTAAAT CCGGCAAAGT GACGATGGCA
GAGCTGGACG ATGCTGCCCG CCATGTACTG AACGTTAAAT ATGATATGGG GTTGTTTAAC
GACCCATACA GCCATTTGGG GCCGAAAGAG TCTGACCCGG TGGATACCAA TGCCGAAAGC
CGCCTGCACC GTAAAGAAGC GCGTGAAGTG GCGCGCGAAA GCTTGGTGTT GCTGAAAAAC
CGTCTCGAAA CGTTACCGCT GAAAAAATCG GCCACCATTG CGGTGGTTGG GCCACTGGCG
GACAGTAAAC GTGACGTGAT GGGCAGCTGG TCCGCAGCCG GTGTTGCCGA TCAATCCGTG
ACCGTACTGA CCGGGATTAA AAATGCCGTC GGTGAAAACG GTAAAGTGCT GTATGCCAAA
GGGGCGAACG TTACCAGTGA CAAAGGCATT ATCGATTTCC TGAATCAGTA TGAAGAAGCG
GTCAAAGTCG ATCCGCGTTC GCCGCAAGAG ATGATTGATG AAGCGGTGCA GACGGCGAAA
CAATCTGATG TGGTGGTGGC TGTAGTCGGT GAAGCACAGG GGATGGCGCA CGAAGCCTCC
AGCCGGACCG ATATCACTAT TCCGCAAAGC CAACGTGACT TGATTGCGGC GCTGAAAGCC
ACCGGTAAAC CGCTGGTGCT GGTGCTTATG AACGGGCGTC CGCTGGCGCT GGTGAAAGAA
GATCAGCAGG CTGATGCGAT TCTGGAAACC TGGTTTGCGG GGACTGAAGG CGGTAATGCA
ATTGCCGATG TGTTGTTTGG CGATTACAAC CCGTCCGGCA AGCTGCCGAT GTCCTTCCCG
CGTTCTGTCG GGCAGATCCC GGTGTACTAC AGCCATCTGA ACACCGGTCG TCCGTATAAT
GCCGACAAGC CGAACAAATA CACTTCGCGT TATTTTGATG AAGCTAACGG GGCGCTTTAT
CCGTTCGGCT ATGGTCTGAG CTATACCACT TTCACCGTCT CTGATGTGAA ACTTTCTGCG
CCGACCATGA AGCGTGACGG CAAAGTGACG GCCAGCGTGC AGGTGACGAA CACCGGTAAG
CGCGAGGGTG CCACGGTAGT GCAGATGTAC TTGCAGGATG TGACCGCTTC CATGAGTCGC
CCTGTGAAAC AGCTGAAAGG CTTTGAGAAA ATCACCCTGA AGCCGGGCGA AACTCAGACC
GTCAGCTTCC CGATTGATAT TGAGGCGCTG AAGTTCTGGA ATCAACAGAT GAAATATGAC
GCCGAGCCTG GCAAGTTCAA TGTCTTTATC GGCACTGATT CCGCACGCGT TAAGAAAGGC
GAGTTTGAGT TGCTGTAA
 
Protein sequence
MKWLCSVGIA VSLALQPALA DDLFGNHPLT PEARDAFVTE LLKKMTVDEK IGQLRLISVG 
PDNPKEAIRE MIKDGQVGAI FNTVTRQDIR AMQDQVMELS RLKIPLFFAY DVLHGQRTVF
PISLGLASSF NLDAVKTVGR VSAYEAADDG LNMTWAPMVD VSRDPRWGRA SEGFGEDTYL
TSTMGKTMVE AMQGKSPADR YSVMTSVKHF AAYGAVEGGK EYNTVDMSPQ RLFNDYMPPY
KAGLDAGSGA VMVALNSLNG TPATSDSWLL KDVLRDQWGF KGITVSDHGA IKELIKHGTA
ADPEDAVRVA LKSGINMSMS DEYYSKYLPG LIKSGKVTMA ELDDAARHVL NVKYDMGLFN
DPYSHLGPKE SDPVDTNAES RLHRKEAREV ARESLVLLKN RLETLPLKKS ATIAVVGPLA
DSKRDVMGSW SAAGVADQSV TVLTGIKNAV GENGKVLYAK GANVTSDKGI IDFLNQYEEA
VKVDPRSPQE MIDEAVQTAK QSDVVVAVVG EAQGMAHEAS SRTDITIPQS QRDLIAALKA
TGKPLVLVLM NGRPLALVKE DQQADAILET WFAGTEGGNA IADVLFGDYN PSGKLPMSFP
RSVGQIPVYY SHLNTGRPYN ADKPNKYTSR YFDEANGALY PFGYGLSYTT FTVSDVKLSA
PTMKRDGKVT ASVQVTNTGK REGATVVQMY LQDVTASMSR PVKQLKGFEK ITLKPGETQT
VSFPIDIEAL KFWNQQMKYD AEPGKFNVFI GTDSARVKKG EFELL