Gene EcE24377A_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2421 
SymbolbglX 
ID5590165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2402113 
End bp2404410 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content54% 
IMG OID640926083 
Productbeta-glucosidase, periplasmic 
Protein accessionYP_001463478 
Protein GI157154880 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGGC TATGTTCAGT AGGAATCGCG GTGAGTCTGG CCCTGCAGCC AGCACTGGCG 
GATGATTTAT TCGGCAACCA TCCATTAACG CCCGAAGCGC GGGATGCGTT CGTCACCGAA
CTGCTTAAGA AAATGACAGT TGATGAGAAA ATTGGTCAGC TGCGCTTAAT CAGCGTCGGC
CCGGATAACC CGAAAGAGGC GATCCGCGAG ATGATCAAAG ACGGTCAGGT TGGGGCGATT
TTCAACACCG TAACCCGTCA GGATATCCGC GCCATGCAGG ATCAGGTGAT GGAATTAAGC
CGCCTGAAAA TTCCTCTTTT CTTTGCTTAC GACGTGCTGC ACGGTCAGCG CACGGTGTTC
CCGATTAGCC TTGGTCTGGC CTCGTCTTTT AACCTCGATG CAGTGAAAAC GGTCGGACGT
GTCTCTGCTT ATGAAGCGGC AGATGATGGC CTGAATATGA CCTGGGCACC GATGGTCGAT
GTCTCGCGCG ATCCGCGCTG GGGACGTGCT TCCGAAGGTT TTGGCGAAGA TACGTATCTC
ACCTCAACAA TGGGTAAAAC CATGGTGGAA GCGATGCAGG GTAAAAGCCC GGCAGATCGC
TACTCGGTGA TGACCAGCGT CAAACACTTT GCCGCATACG GCGCGGTAGA AGGCGGTAAA
GAGTACAACA CCGTCGATAT GAGTCCGCAG CGCCTGTTTA ATGATTATAT GCCGCCGTAC
AAAGCGGGGC TGGACGCAGG CAGCGGCGCG GTGATGGTGG CGCTGAACTC GCTGAACGGC
ACGCCAGCCA CCTCCGATTC CTGGCTGCTG AAAGATGTTC TGCGCGACCA GTGGGGCTTT
AAAGGCATCA CCGTTTCCGA TCACGGTGCA ATCAAAGAGC TGATTAAACA TGGCACGGCG
GCAGACCCGG AAGATGCGGT GCGCGTGGCG CTGAAATCCG GAATCAACAT GAGTATGAGC
GACGAGTATT ACTCGAAGTA TCTGCCTGGG TTGATCAAAT CCGGCAAAGT GACGATGGAA
GAGCTGGACG ATGCTGCCCG CCATGTACTG AACGTTAAAT ATGATATGGG ATTGTTTAAC
GACCCATACA GCCATTTGGG GCCGAAAGAG TCTGACCCGG TGGATACCAA TGCCGAAAGC
CGCCTGCACC GTAAAGAAGC GCGTGAAGTG GCGCGCGAAA GCCTGGTGTT GCTGAAAAAC
CGTCTCGAAA CGTTACCGCT GAAAAAATCG GCCACCATTG CGGTGGTTGG GCCACTGGCG
GACAGTAAAC GTGACGTGAT GGGCAGCTGG TCCGCAGCCG GTGTTGCCGA TCAATCCGTG
ACCGTACTGA CCGGGATTAA AAATTCCGTC GGTGAAAACG GTAAAGTGCT GTATGCCAAA
GGGGCGAACG TTACCAGTGA CAAAGGCATT ATCGATTTCC TGAATCAGTA TGAAGAAGCG
GTCAAAGTCG ATCCGCGTTC GCCGCAAGAG ATGATTGATG AAGCGGTGCA GACGGCGAAA
CAATCTGATG TGGTGGTGGC TGTAGTCGGT GAAGCACAGG GGATGGCGCA CGAAGCCTCC
AGCCGGACCG ATATCACTAT TCCGCAAAGC CAACGTGACT TGATTGCGGC GCTGAAAGCC
ACCGGTAAAC CGCTGGTGCT GGTGCTGATG AACGGGCGTC CGCTGGCGCT GGTGAAAGAA
GATCAGCAGG CTGATGCGAT TCTGGAAACC TGGTTTGCGG GGACTGAAGG CGGTAATGCA
ATTGCCGATG TATTGTTTGG CGATTACAAC CCGTCCGGCA AGCTGCCAAT GTCCTTCCCG
CGTTCTGTCG GGCAGATCCC GGTGTACTAC AGCCATCTGA ATACCGGTCG CCCGTATAAT
GCCGACAAGC CGAACAAATA CACTTCGCGT TATTTTGATG AAGCTAACGG GGCGTTGTAT
CCGTTCGGCT ATGGGCTGAG CTACACCACT TTCACCGTCT CTGATGTGAA ACTTTCTGCG
CCGACCATGA AGCGTGACGG CAAAGTGACT GCCAGCGTGC AGGTGACGAA CACCGGTAAA
CGCGAAGGGG CCACGGTAGT ACAGATGTAC TTGCAGGATG TGACGGCTTC CATGAGCCGT
CCAGTGAAGC AGCTGAAAGG CTTTGAGAAA ATCACCCTGA AACCGGGCGA AACTCAGACT
GTCAGCTTCC CGATTGATAT CGAGGCGCTG AAGTTCTGGA ATCAACAGAT GAAATATGAC
GCCGAGCCTG GCAAGTTCAA TGTCTTTATC GGCACTGATT CCGCACGCGT TAAGAAAGGC
GAGTTTGAGT TGCTGTAA
 
Protein sequence
MKWLCSVGIA VSLALQPALA DDLFGNHPLT PEARDAFVTE LLKKMTVDEK IGQLRLISVG 
PDNPKEAIRE MIKDGQVGAI FNTVTRQDIR AMQDQVMELS RLKIPLFFAY DVLHGQRTVF
PISLGLASSF NLDAVKTVGR VSAYEAADDG LNMTWAPMVD VSRDPRWGRA SEGFGEDTYL
TSTMGKTMVE AMQGKSPADR YSVMTSVKHF AAYGAVEGGK EYNTVDMSPQ RLFNDYMPPY
KAGLDAGSGA VMVALNSLNG TPATSDSWLL KDVLRDQWGF KGITVSDHGA IKELIKHGTA
ADPEDAVRVA LKSGINMSMS DEYYSKYLPG LIKSGKVTME ELDDAARHVL NVKYDMGLFN
DPYSHLGPKE SDPVDTNAES RLHRKEAREV ARESLVLLKN RLETLPLKKS ATIAVVGPLA
DSKRDVMGSW SAAGVADQSV TVLTGIKNSV GENGKVLYAK GANVTSDKGI IDFLNQYEEA
VKVDPRSPQE MIDEAVQTAK QSDVVVAVVG EAQGMAHEAS SRTDITIPQS QRDLIAALKA
TGKPLVLVLM NGRPLALVKE DQQADAILET WFAGTEGGNA IADVLFGDYN PSGKLPMSFP
RSVGQIPVYY SHLNTGRPYN ADKPNKYTSR YFDEANGALY PFGYGLSYTT FTVSDVKLSA
PTMKRDGKVT ASVQVTNTGK REGATVVQMY LQDVTASMSR PVKQLKGFEK ITLKPGETQT
VSFPIDIEAL KFWNQQMKYD AEPGKFNVFI GTDSARVKKG EFELL