Gene ECH74115_5324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5324 
Symbol 
ID6970397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4966150 
End bp4968186 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content54% 
IMG OID643388985 
Productalpha-glucosidase 
Protein accessionYP_002273394 
Protein GI209398541 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.539055 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACGC CACGCCCACA ATTACCAGAT TTTGAATTTC ATCAGAATAA CGACAGTTTT 
ACCCTACATT TTCGACAACG TCTTATTTTA ACCCATAGCA AAGATAATCC TTGTTTATGG
ATTGGCTCAG GTATAGCGGA CATCGATATG TTCCGCGGCA ATTTCAGCAT TAAAGATAAA
CTACAGGAGA AAATTGCGCT TACCGACGCC ATCGTCAGCC AGTCACCGGA TGGTTGGTTA
ATCCATTTCA GTCGTGGTTC TGACATTAGC GCCACGCTGA ATATCTCTGC CGACGATCAG
GGCCGTTTAT TGCTGGAACT ACAAAACGAC AACCTTAACC ACAACCGCAT CTGGCTGCGC
CTTGCCGCTC AACCAGAGGA CCATATCTAC GGCTGCGGCG AACAGTTTTC CTACTTCGAT
CTGCGCGGCA AACCGTTCCC GCTATGGACC AGTGAACAAG GCGTTGGTCG CAACAAACAA
ACCTATGTCA CCTGGCAGGC CGACTGCAAA GAAAATGCGG GCGGCGACTA TTACTGGACT
TTCTTCCCAC AGCCTACGTT TGTCAGCACG CAGAAGTATT ATTGCCATGT TGATAACAGT
TGCTATATGA ACTTCGACTT TAGTGCCCCG GAATACCATG AACTGGCGCT GTGGGAAGAC
AAAGCAACGC TGCGTTTTGA ATGTGCTGAC ACATACATCT CCCTGCTGGA AAAATTAACC
GCCCTGCTGG GACGCCAGCC AGAACTGCCC GACTGGATTT ATGACGGAGT AACGCTCGGC
ATTCAGGGCG GGACGGAAGT GTGCCAGAAG AAACTGGACA CCATGCGTAA CGCGGGCGTG
AAGGTCAACG GCATCTGGGC GCAGGACTGG TCCGGTATTC GTATGACCTC TTTTGGCAAA
CGCGTGATGT GGAACTGGAA GTGGAACAGC GAAAACTACC CGCAACTGGA TTCGCGTATC
AAGCAGTGGA ACAAAGAGGG CGTGCAGTTC CTTGCCTATA TCAACCCGTA TGTTGCCAGT
GATAAAGATC TCTGCGAAGA GGCGGCAAAG CGCGGTTATC TGGCAAAAGA TGTCGCCGGT
GGCGACTATC TGGTGGAGTT TGGCGAGTTC TACGGCGGCG TTGTCGATCT CACCAATCCG
GAAGCCTACG CCTGGTTCAA GGAAGTTATC AAAAAGAACA TGATTGAACT CGGCTGCGGC
GGCTGGATGG CTGACTTCGG CGAGTATCTG CCCACCGACA CGTACCTGCA TAACGGCATC
AGCGCGGAAA TTATGCATAA CGCCTGGCCC GCGCTGTGGG CGAAATGTAA CTACGAAGCC
CTTGAAGAAA CGGGCAAGCT CGGCGAGATC CTCTTCTTTA TGCGCGCCGG TTCTACCGGT
AGCCAGAAAT ACTCCACCAT GATGTGGGCG GGTGACCAGA ACGTGGACTG GAGTCTGGAC
GATGGCCTGG CGTCGGTTGT CCCGGCGGCG CTATCTCTGG CAATGACCGG ACATGGCCTG
CATCACAGCG ACATTGGCGG TTATACCACC CTGTTTGAGA TGAAGCGCAG CAAAGAGCTG
CTGCTGCGCT GGTGCGATTT CAGCGCCTTC ACGCCGATGA TGCGCACCCA CGAAGGTAAC
CGTCCTGGCG ACAACTGGCA GTTTGACGGC GACGCGGAAA CCATCGCCCA TTTCGCCCGT
ATGACCACCG TCTTCACCAC CCTGAAACCT TACCTGAAAG AGGCCGTCGC GCTGAATGCG
AAGTCCGGCC TGCCGGTTAT GCGCCCGCTG TTCCTGCATT ACGAAGACGA CGCGCAGACC
TATTCCCTGA AATATCAGTA CCTGTTAGGT CGCGACATTC TGGTTGCTCC GGTACACGAA
GAAGGTCGCA GCGACTGGAC GCTCTATCTG CCAGAGGATA ACTGGGTCCA CGCCTGGACC
GGAGAAACCT TCCACGGTGG AGAAATAACT GTTGAAGCGC CCATCGGCAA GCCGCCGGTC
TTTTATCGCG CCGACAGCGA ATGGGCTGCA CTGTTCGCCT CGTTAAAAAA CATCTAA
 
Protein sequence
MDTPRPQLPD FEFHQNNDSF TLHFRQRLIL THSKDNPCLW IGSGIADIDM FRGNFSIKDK 
LQEKIALTDA IVSQSPDGWL IHFSRGSDIS ATLNISADDQ GRLLLELQND NLNHNRIWLR
LAAQPEDHIY GCGEQFSYFD LRGKPFPLWT SEQGVGRNKQ TYVTWQADCK ENAGGDYYWT
FFPQPTFVST QKYYCHVDNS CYMNFDFSAP EYHELALWED KATLRFECAD TYISLLEKLT
ALLGRQPELP DWIYDGVTLG IQGGTEVCQK KLDTMRNAGV KVNGIWAQDW SGIRMTSFGK
RVMWNWKWNS ENYPQLDSRI KQWNKEGVQF LAYINPYVAS DKDLCEEAAK RGYLAKDVAG
GDYLVEFGEF YGGVVDLTNP EAYAWFKEVI KKNMIELGCG GWMADFGEYL PTDTYLHNGI
SAEIMHNAWP ALWAKCNYEA LEETGKLGEI LFFMRAGSTG SQKYSTMMWA GDQNVDWSLD
DGLASVVPAA LSLAMTGHGL HHSDIGGYTT LFEMKRSKEL LLRWCDFSAF TPMMRTHEGN
RPGDNWQFDG DAETIAHFAR MTTVFTTLKP YLKEAVALNA KSGLPVMRPL FLHYEDDAQT
YSLKYQYLLG RDILVAPVHE EGRSDWTLYL PEDNWVHAWT GETFHGGEIT VEAPIGKPPV
FYRADSEWAA LFASLKNI