Gene ECD_02062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02062 
SymbolbglX 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2114596 
End bp2116893 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content54% 
IMG OID 
Productbeta-D-glucoside glucohydrolase, periplasmic 
Protein accessionACT43886 
Protein GI253978216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGGC TATGTTCAGT AGGAATCGCG GTGAGTCTGG CCCTGCAGCC AGCACTGGCG 
GATGATTTAT TCGGCAACCA TCCATTAACG CCCGAAGCGC GGGATGCGTT CGTCACCGAA
CTGCTTAAGA AAATGACAGT TGATGAGAAA ATTGGTCAGC TGCGCTTAAT CAGCGTAGGC
CCGGATAATC CGAAAGAGGC GATCCGCGAG ATGATCAAAG ACGGTCAGGT TGGGGCGATT
TTCAACACCG TAACCCGTCA GGATATCCGC GCCATGCAGG ATCAGGTGAT GGAATTAAGC
CGCCTGAAAA TTCCTCTTTT CTTTGCTTAC GACGTGCTGC ACGGTCAGCG CACGGTGTTC
CCGATTAGCC TCGGTCTGGC CTCGTCTTTT AACCTCGATG CGGTGAAAAC GGTCGGGCGT
GTCTCTGCTT ATGAAGCGGC AGATGATGGC CTGAATATGA CCTGGGCACC GATGGTCGAT
GTCTCGCGCG ATCCGCGCTG GGGACGTGCC TCCGAAGGTT TTGGCGAAGA TACGTATCTC
ACCTCAACAA TGGGTAAAAC CATGGTGGAA GCGATGCAGG GTAAAAGCCC GGCAGATCGC
TACTCGGTGA TGACCAGCGT CAAACACTTT GCCGCATACG GCGCGGTAGA AGGCGGTAAA
GAGTACAACA CCGTCGATAT GAGTCCGCAG CGCCTGTTTA ATGATTATAT GCCGCCGTAC
AAAGCGGGGC TGGACGCAGG CAGCGGCGCG GTGATGGTGG CGCTGAACTC GCTGAACGGC
ACGCCAGCCA CCTCCGATTC CTGGCTGCTG AAAGATGTTC TGCGCGACCA GTGGGGTTTT
AAAGGCATCA CCGTTTCCGA TCACGGTGCA ATCAAAGAGC TGATTAAACA TGGCACGGCG
GCAGATCCGG AAGATGCGGT GCGCGTGGCG CTGAAATCCG GCATCAACAT GAGCATGAGC
GACGAGTACT ACTCGAAGTA TCTGCCTGGG TTGATCAAAT CCGGCAAAGT GACGATGGAA
GAGCTGGACG ATGCTGCCCG CCATGTACTG AACGTTAAAT ATGATATGGG ATTGTTTAAC
GACCCATACA GCCATTTGGG GCCGAAAGAG TCTGACCCGG TGGATACCAA TGCCGAAAGC
CGCCTGCACC GTAAAGAAGC GCGTGAAGTG GCGCGCGAAA GCCTGGTGTT GCTGAAAAAC
CGTCTCGAAA CGTTACCGCT GAAAAAATCG GCCACCATTG CGGTGGTTGG GCCACTGGCG
GACAGTAAAC GTGACGTGAT GGGCAGCTGG TCCGCAGCCG GTGTTGCCGA TCAATCCGTG
ACCGTACTGA CCGGGATTAA AAATGCCGTC GGTGAAAACG GTAAAGTGCT GTATGCCAAA
GGGGCGAACG TTACCAGTGA CAAAGGCATT ATCGATTTCC TGAATCAGTA TGAAGAAGCG
GTCAAAGTCG ATCCGCGCTC GCCGCAAGAG ATGATTGATG AAGCGGTGCA GACTGCGAAA
CAATCTGATG TGGTGGTGGC TGTGGTCGGT GAAGCACAGG GGATGGCGCA CGAGGCCTCC
AGCCGTACCG ATATCACTAT TCCGCAAAGC CAACGTGACT TGATTGCGGC GCTGAAAGCC
ACCGGTAAAC CGCTGGTGCT GGTGCTTATG AACGGGCGTC CGCTGGCGCT GGTGAAAGAA
GATCAGCAGG CTGATGCGAT TCTGGAAACC TGGTTTGCGG GGACTGAAGG CGGTAATGCA
ATTGCCGATG TGTTGTTTGG CGATTACAAC CCGTCCGGCA AGCTGCCGAT GTCCTTCCCG
CGTTCTGTCG GGCAGATCCC GGTGTACTAC AGCCATCTGA ACACCGGTCG TCCGTATAAT
GCCGACAAGC CGAACAAATA CACTTCGCGT TATTTTGATG AAGCTAACGG GGCGCTTTAT
CCGTTCGGCT ATGGTCTGAG CTATACCACT TTCACCGTCT CTGATGTGAA ACTTTCTGCG
CCGACCATGA AGCGTGACGG CAAAGTGACG GCCAGCGTGC AGGTGACGAA CACCGGTAAG
CGCGAGGGTG CCACGGTAGT GCAGATGTAC TTGCAGGATG TGACGGCTTC CATGAGTCGC
CCTGTGAAAC AGCTGAAAGG CTTTGAGAAA ATCACCCTGA AGCCGGGCGA AACTCAGACC
GTCAGCTTCC CGATTGATAT TGAGGCGCTG AAGTTCTGGA ATCAACAGAT GAAATATGAC
GCCGAGCCTG GCAAGTTCAA TGTCTTTATC GGCACTGATT CCGCACGCGT TAAGAAAGGC
GAGTTTGAGT TGCTGTAA
 
Protein sequence
MKWLCSVGIA VSLALQPALA DDLFGNHPLT PEARDAFVTE LLKKMTVDEK IGQLRLISVG 
PDNPKEAIRE MIKDGQVGAI FNTVTRQDIR AMQDQVMELS RLKIPLFFAY DVLHGQRTVF
PISLGLASSF NLDAVKTVGR VSAYEAADDG LNMTWAPMVD VSRDPRWGRA SEGFGEDTYL
TSTMGKTMVE AMQGKSPADR YSVMTSVKHF AAYGAVEGGK EYNTVDMSPQ RLFNDYMPPY
KAGLDAGSGA VMVALNSLNG TPATSDSWLL KDVLRDQWGF KGITVSDHGA IKELIKHGTA
ADPEDAVRVA LKSGINMSMS DEYYSKYLPG LIKSGKVTME ELDDAARHVL NVKYDMGLFN
DPYSHLGPKE SDPVDTNAES RLHRKEAREV ARESLVLLKN RLETLPLKKS ATIAVVGPLA
DSKRDVMGSW SAAGVADQSV TVLTGIKNAV GENGKVLYAK GANVTSDKGI IDFLNQYEEA
VKVDPRSPQE MIDEAVQTAK QSDVVVAVVG EAQGMAHEAS SRTDITIPQS QRDLIAALKA
TGKPLVLVLM NGRPLALVKE DQQADAILET WFAGTEGGNA IADVLFGDYN PSGKLPMSFP
RSVGQIPVYY SHLNTGRPYN ADKPNKYTSR YFDEANGALY PFGYGLSYTT FTVSDVKLSA
PTMKRDGKVT ASVQVTNTGK REGATVVQMY LQDVTASMSR PVKQLKGFEK ITLKPGETQT
VSFPIDIEAL KFWNQQMKYD AEPGKFNVFI GTDSARVKKG EFELL