Gene ECH74115_4193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4193 
SymbolbglA 
ID6969906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3886089 
End bp3887528 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content51% 
IMG OID643387937 
Product6-phospho-beta-glucosidase BglA 
Protein accessionYP_002272376 
Protein GI209399585 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGTGA AAAAACTCAC CTTACCGAAA GATTTCTTAT GGGGCGGCGC AGTTGCCGCT 
CATCAGGTCG AAGGCGGCTG GAACAAAGGC GGAAAAGGGC CGAGCATTTG TGACGTTCTG
ACCGGTGGCG CACACGGCGT GCCGCGCGAA ATAACCAAAG AAGTCTTGCC AGGCAAATAC
TATCCAAACC ATGAAGCCGT TGATTTTTAT GGTCACTATA AGGAAGACAT CAAGCTATTT
GCCGAAATGG GCTTCAAATG TTTTCGTACA TCCATTGCCT GGACGCGCAT TTTTCCAAAA
GGCGATGAAG CTCAGCCAAA CGAAGAAGGG CTGAAGTTCT ACGACTCTCT GTTCGATGAA
CTGCTGAAAT ACAACATCGA ACCGGTGATC ACCCTCTCCC ACTTTGAAAT GCCGCTGCAT
CTGGTGCAGC AATACGGTAG CTGGACCAAC CGTAAAGTGG TTGATTTCTT TGTACGTTTC
GCGGAAGTGG TATTTGAACG CTATAAGCAC AAAGTCAAAT ACTGGATGAC CTTCAACGAA
ATTAACAACC AGCGTAACTG GCGTGCACCG CTGTTCGGTT ACTGCTGCTC CGGCGTGGTG
TATACCGAGC ATGAAAACCC GGAAGAGACG ATGTATCAGG TGCTGCATCA CCAGTTTGTC
GCCAGCGCCC TGGCAGTGAA AGCCGCGCGT CGCATTAACC TGGAGATGAA AGTCGGCTGT
ATGCTGGCGA TGGTGCCGCT CTATCCCTAC TCATGTAACC CGGACGATGT GATGTTCGCT
CAGGAGTCGA TGCGCGAACG CTACGTCTTT ACCGATGTGC AGCTACGCGG CTATTACCCG
TCGTATGTGT TGAACGAGTG GGAGCGTCGC GGATTTAACA TCAAAATGGA AGATAGCGAT
CTCGATGTAC TGCGCGAAGG CACCTGCGAT TATCTTGGTT TCAGCTATTA CATGACCAAC
GCAGTGAAGG CCGAAGGCGG CACCGGCGAT GCGATCTCTG GTTTTGAAGG CAGCGTACCA
AACCCGTATG TTAAAGCATC TGACTGGGGC TGGCAGATTG ATCCGGTGGG TCTGCGTTAC
GCACTTTGCG AACTGTATGA GCGTTACCAG AAGCCGCTGT TTATTGTCGA AAACGGTTTT
GGCGCTTACG ACAAAGTGGA AGAAGATGGC AGCATCAACG ACGACTACCG CATTGACTAC
CTGCGCGCCC ATATTGAAGA GATGAAAAAA GCAGTAACTT ACGATGGCGT GGATCTGATG
GGCTACACAC CGTGGGGCTG CATCGACTGC GTGTCGTTCA CTACCGGGCA GTACAGCAAA
CGCTACGGCT TTATCTATGT GAATAAACAT GACGACGGTA CTGGCGATAT GTCGCGTTCA
CGTAAGAAGA GCTTTAACTG GTACAAAGAA GTGATTGCCA GCAACGGCGA GAATCTGTAA
 
Protein sequence
MIVKKLTLPK DFLWGGAVAA HQVEGGWNKG GKGPSICDVL TGGAHGVPRE ITKEVLPGKY 
YPNHEAVDFY GHYKEDIKLF AEMGFKCFRT SIAWTRIFPK GDEAQPNEEG LKFYDSLFDE
LLKYNIEPVI TLSHFEMPLH LVQQYGSWTN RKVVDFFVRF AEVVFERYKH KVKYWMTFNE
INNQRNWRAP LFGYCCSGVV YTEHENPEET MYQVLHHQFV ASALAVKAAR RINLEMKVGC
MLAMVPLYPY SCNPDDVMFA QESMRERYVF TDVQLRGYYP SYVLNEWERR GFNIKMEDSD
LDVLREGTCD YLGFSYYMTN AVKAEGGTGD AISGFEGSVP NPYVKASDWG WQIDPVGLRY
ALCELYERYQ KPLFIVENGF GAYDKVEEDG SINDDYRIDY LRAHIEEMKK AVTYDGVDLM
GYTPWGCIDC VSFTTGQYSK RYGFIYVNKH DDGTGDMSRS RKKSFNWYKE VIASNGENL