Gene EcSMS35_3034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3034 
SymbolbglA 
ID6147280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3122343 
End bp3123782 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content51% 
IMG OID641617903 
Product6-phospho-beta-glucosidase BglA 
Protein accessionYP_001745054 
Protein GI170681438 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGTGA AAAAACTCAC CTTACCGAAA GATTTCTTAT GGGGCGGCGC AGTTGCCGCT 
CATCAGGTCG AAGGCGGCTG GAACAAAGGC GGCAAAGGGC CGAGCATTTG TGACGTTCTG
ACCGGTGGCG CACACGGCGT GCCGCGCGAA ATCACCAAAG AAGTCTTGCC AGGCAAATAC
TATCCAAACC ATGAAGCCGT TGATTTTTAT GGTCACTACA AAGAAGACAT CAAGCTATTT
GCCGAAATGG GCTTCAAATG TTTTCGGACT TCCATCGCCT GGACGCGCAT TTTTCCAAAA
GGCGATGAAG CTCAGCCAAA CGAAGAAGGG CTGAAGTTTT ACGACTCTCT GTTCGATGAA
TTGCTGAAAT ACAACATCGA ACCGGTGATC ACCCTATCCC ACTTTGAAAT GCCGCTGCAT
CTGGTGCAGC AATACGGTAG CTGGACCAAC CGTAAAGTGG TTGATTTCTT TGTCCGTTTC
GCAGAAGTGG TATTTGAACG CTATAAGCAT AAAGTCAAAT ACTGGATGAC CTTCAACGAA
ATTAACAACC AGCGTAACTG GCGTGCACCG CTGTTCGGTT ACTGCTGCTC CGGCGTGGTG
TATACCGAGC ATGAAAACCC GGAAGAGACG ATGTACCAGG TGCTGCATCA CCAGTTTGTC
GCCAGCGCCC TGGCGGTGAA AGCCGCGCGT CGCATTAACC CGGAAATGAA AGTCGGCTGT
ATGCTGGCGA TGGTGCCGCT TTACCCTTAC TCCTGTAATC CGGACGATGT GATGTTCGCC
CAGGAGTCGA TGCGCGAACG CTACGTCTTT ACCGATGTGC AGTTGCGTGG CTATTACCCA
TCCTATGTGT TGAACGAGTG GGAGCGTCGC GGATTTAACA TCAAAATGGA AGACGGCGAT
CTGGATGTGC TGCGCGAAGG CACCTGCGAT TATCTTGGTT TCAGCTATTA CATGACCAAC
GCAGTGAAGG CCGAAGGCGG CACCGGTGAT GCCATCTCTG GTTTTGAAGG CAGCGTACCA
AACCCGTATG TTAAAGCGTC TGACTGGGGC TGGCAGATTG ATCCTGTGGG TCTGCGTTAC
GCACTTTGCG AACTGTATGA GCGTTATCAG AAGCCGCTGT TTATTGTCGA AAACGGTTTT
GGCGCTTATG ACAAAGTGGA AGAAGATGGC AGCATCAACG ACGACTACCG CATTGACTAC
CTGCGCGCCC ATATTGAAGA GATGAAAAAA GCGGTGACTT ACGATGGCGT GGATCTGATG
GGCTACACAC CGTGGGGCTG CATCGACTGC GTGTCGTTTA CCACCGGGCA GTACAGCAAA
CGCTACGGCT TTATCTATGT GAATAAACAT GACGACGGTA CTGGCGATAT GTCGCGTTCA
CGTAAGAAGA GCTTTGACTG GTACAAAGAG GTGATTGCCA GCAACGGCGA GAATCTTTAA
 
Protein sequence
MIVKKLTLPK DFLWGGAVAA HQVEGGWNKG GKGPSICDVL TGGAHGVPRE ITKEVLPGKY 
YPNHEAVDFY GHYKEDIKLF AEMGFKCFRT SIAWTRIFPK GDEAQPNEEG LKFYDSLFDE
LLKYNIEPVI TLSHFEMPLH LVQQYGSWTN RKVVDFFVRF AEVVFERYKH KVKYWMTFNE
INNQRNWRAP LFGYCCSGVV YTEHENPEET MYQVLHHQFV ASALAVKAAR RINPEMKVGC
MLAMVPLYPY SCNPDDVMFA QESMRERYVF TDVQLRGYYP SYVLNEWERR GFNIKMEDGD
LDVLREGTCD YLGFSYYMTN AVKAEGGTGD AISGFEGSVP NPYVKASDWG WQIDPVGLRY
ALCELYERYQ KPLFIVENGF GAYDKVEEDG SINDDYRIDY LRAHIEEMKK AVTYDGVDLM
GYTPWGCIDC VSFTTGQYSK RYGFIYVNKH DDGTGDMSRS RKKSFDWYKE VIASNGENL