Gene EcSMS35_0912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0912 
SymbolbglX 
ID6145851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp918691 
End bp920988 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content54% 
IMG OID641615800 
Productbeta-glucosidase, periplasmic 
Protein accessionYP_001742992 
Protein GI170681058 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.611673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGGC TATGTTCAGT AGGAATCGCG GTGAGTCTGG CCCTGCAGCC AGCACTGGCG 
GATGATTTAT TCGGCAACCA TCCATTAACG CCCGAAGCGC GGGATGCGTT CGTCACCGAA
CTGCTTAAGA AAATGACAGT TGATGAGAAA ATTGGTCAGC TGCGTTTAAT CAGCGTCGGC
CCGGATAATC CGAAAGAGGC GATCCGCGAG ATGATCAAAG ACGGGCAGGT TGGGGCGATT
TTCAACACCG TAACCCGTCA GGATATCCGC GCCATGCAGG ATCAGGTGAT GGAATTAAGC
CGCCTGAAAA TTCCTCTTTT CTTTGCTTAC GACGTGCTGC ACGGTCAGCG CACGGTGTTC
CCGATTAGCC TCGGTCTGGC CTCGTCTTTT AACCTCGATG CGGTGAAAAC AGTCGGGCGT
GTCTCTGCTT ATGAAGCGGC AGATGATGGC CTGAATATGA CCTGGGCACC GATGGTCGAT
GTCTCGCGCG ATCCGCGCTG GGGACGTGCT TCCGAAGGTT TTGGCGAAGA TACGTATCTC
ACCTCAATAA TGGGCAAAAC CATGGTGGAA GCGATGCAGG GTAAAAGCCC GGCAGATCGC
TACTCGGTGA TGACCAGCGT CAAACACTTT GCCGCATACG GCGCGGTAGA AGGCGGTAAA
GAGTACAACA CCGTCGATAT GAGTCCGCAG CGCCTGTTTA ATGATTATAT GCCGCCGTAC
AAAGCGGGGC TGGACGCAGG CAGCGGCGCG GTGATGGTGG CGCTGAACTC GCTGAACGGT
ACGCCAGCCA CCTCCGACTC CTGGCTGCTG AAAGATGTTC TGCGCGACCA GTGGGGCTTT
AAAGGCATCA CCGTTTCCGA TCACGGCGCA ATCAAAGAGC TGATTAAACA TGGCACGGCG
GCAGACCCGG AAGATGCGGT GCGCGTGGCG CTGAAATCCG GCATCAACAT GAGTATGAGC
GACGAGTATT ACTCGAAGTA TCTGCCTGGG TTGATCAAAT CCGGCAAAGT GACGATGGAA
GAGCTGGATG ACGCTGCCCG TCATGTACTG AACGTTAAAT ATGATATGGG GTTGTTTAAC
GACCCGTACA GCCATCTCGG TCCGAAAGAG TCTGACCCGG TGGATACCAA TGCCGAAAAC
CGCCTGCACC GCAAAGAAGC GCGTGAAGTG GCACGCGAAA GCCTGGTGTT GCTGAAAAAC
CGTCTCGAAA CGTTACCGCT GAAAAAATCA GCCACCATTG CGGTGGTTGG CCCGCTGGCA
GACAGCAAGC GTGACGTGAT GGGAAGCTGG TCGGCAGCAG GTGTCGCCGA TCAATCTGTT
ACTGTGCTAA CAGGGATTAA AAACGCCGTC GGTGAAAACG GTAAAGTGCT GTACGCCAAA
GGGGCGAACG TCACCAGTGA CAAAGGCATT ATCGATTTCC TGAATCAGTA TGAAGAGGCG
GTCAAAGTCG ACCCGCGCTC GCCGCAAGAG ATGATTGATG AAGCGGTGCA AACCGCGAAG
CAATCTGATG TGGTGGTGGC TGTGGTCGGT GAAGCTCAGG GGATGGCGCA CGAGGCCTCC
AGCCGTACCG ATATCACTAT TCCGCAAAGC CAACGTGACT TGATTGCGGC GCTGAAAGCC
ACCGGTAAAC CGCTGGTGCT GGTGCTGATG AACGGGCGTC CGCTGGCGCT GGTGAAAGAA
GATCAGCAGG CGGATGCGAT TCTGGAAACC TGGTTTGCGG GGACTGAAGG CGGTAATGCA
ATTGCCGATG TATTGTTTGG CGATTACAAC CCGTCCGGCA AGCTGCCGAT GTCCTTCCCG
CGTTCTGTCG GGCAGATCCC GGTGTACTAC AGTCATCTGA ATACCGGTCG CCCGTATAAT
GCCGACAAGC CGAACAAATA CACTTCGCGT TATTTTGATG AAGCTAACGG GGCGCTTTAT
CCGTTCGGCT ATGGTCTGAG CTATACCACT TTCACCGTCT CTGATGTGAA ACTTTCTGCG
CCGACCATGA AGCGTGACGG CAAAGTGACC GCCAGCGTGC AGGTGACGAA CACCGGTAAG
CGCGAAGGGG CGACGGTAGT TCAGATGTAC CTGCAGGATG TGACGGCTTC CATGAGTCGC
CCGGTAAAAC AGCTGAAAGG CTTTGAGAAA ATCACCCTGA AACCGGGCGA AACCCAGACC
GTCAGCTTCC CGATTGATAT CGAGGCGCTG AAGTTCTGGA ATCAACAGAT GAAATATGAC
GCCGAGCCTG GCAAGTTCAA TGTCTTTATC GGCACTGATT CCGCACGCGT TAAGAAAGGC
GAGTTTGAGT TGCTGTAA
 
Protein sequence
MKWLCSVGIA VSLALQPALA DDLFGNHPLT PEARDAFVTE LLKKMTVDEK IGQLRLISVG 
PDNPKEAIRE MIKDGQVGAI FNTVTRQDIR AMQDQVMELS RLKIPLFFAY DVLHGQRTVF
PISLGLASSF NLDAVKTVGR VSAYEAADDG LNMTWAPMVD VSRDPRWGRA SEGFGEDTYL
TSIMGKTMVE AMQGKSPADR YSVMTSVKHF AAYGAVEGGK EYNTVDMSPQ RLFNDYMPPY
KAGLDAGSGA VMVALNSLNG TPATSDSWLL KDVLRDQWGF KGITVSDHGA IKELIKHGTA
ADPEDAVRVA LKSGINMSMS DEYYSKYLPG LIKSGKVTME ELDDAARHVL NVKYDMGLFN
DPYSHLGPKE SDPVDTNAEN RLHRKEAREV ARESLVLLKN RLETLPLKKS ATIAVVGPLA
DSKRDVMGSW SAAGVADQSV TVLTGIKNAV GENGKVLYAK GANVTSDKGI IDFLNQYEEA
VKVDPRSPQE MIDEAVQTAK QSDVVVAVVG EAQGMAHEAS SRTDITIPQS QRDLIAALKA
TGKPLVLVLM NGRPLALVKE DQQADAILET WFAGTEGGNA IADVLFGDYN PSGKLPMSFP
RSVGQIPVYY SHLNTGRPYN ADKPNKYTSR YFDEANGALY PFGYGLSYTT FTVSDVKLSA
PTMKRDGKVT ASVQVTNTGK REGATVVQMY LQDVTASMSR PVKQLKGFEK ITLKPGETQT
VSFPIDIEAL KFWNQQMKYD AEPGKFNVFI GTDSARVKKG EFELL