Gene ECH74115_5593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5593 
Symbol 
ID6968710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5232498 
End bp5234483 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content53% 
IMG OID643389229 
Productmetallo-beta-lactamase family protein 
Protein accessionYP_002273626 
Protein GI209400388 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2015] Alkyl sulfatase and related hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAACT CTCGGTTATT CCGTTTGAGC AGGATTGTTA TTGCGTTAAC TGCCGCCAGC 
GGCATGATGG TAAATACCGC TAACGCGAAA GAGGAAGCGA AAGCCGCCAC TCAATATACC
CAACAGGTTA ATCAGAATTA CGCCAAATCA TTACCGTTTA GCGATCGTCA GGATTTTGAC
GATGCCCAGC GTGGATTTAT CGCCCCGCTG CTGGATGAAG GTATTCTGCG TGATGCGAAC
GGTAAAGTTT ACTACCGCGC AGACGATTAC AAATTTGATA TTAATGCCGC TGCGCCGGAA
ACCGTAAACC CCAGCCTGTG GCGTCAGTCG CAGATCAACG GCATTTCTGG CCTGTTCAAA
GTCACCGATA AAATGTATCA GGTGCGCGGC CAGGATATCT CTAACATTAC GTTCGTTGAG
GGGGAGAAAG GCATTATTGT CATCGACCCG CTGGTGACGC CGCCTGCCGC AAAAGCCGCA
CTTGACCTTT ACTTCCAGCA TCGTCCGCAA AAACCGATTG TTGCCGTTAT CTACACTCAC
AGCCACACCG ACCACTATGG TGGCGTGAAA GGCATTATCT CTGAAGCCGA TGTTAAATCC
GGCAAAGTTC AGGTGATTGC CCCTGCTGGC TTTATGGACG AAGCCATCAG CGAAAACGTG
CTGGCGGGTA ACATCATGAG CCGCCGTGCG CTCTACTCTT ACGGTCTGTT ACTGCCGCAC
AACGCGCAAG GCAACGTGGG TAATGGCCTT GGCGTGACGC TGGCAACGGG CGACCCGAGC
ATTATTGCAC CGACTAAAAC TATCGTCAGA ACTGGCGAGA AGATGATTAT CGACGGCCTG
GAGTTTGACT TCCTGATGAC CCCAGGTAGC GAAGCGCCAG CCGAAATGCA CTTCTATATT
CCGGCCCTGA AAGCCCTGTG TACCGCCGAG AACGCCACGC ATACCCTGCA CAACTTCTAC
ACTCTGCGCG GCGCGAAAAC CCGCGATACC AGCAAGTGGA CCGAGTACCT GAACGAAACG
CTGGATATGT GGGGTAACGA CGCGGAAGTA CTGTTTATGC CGCACACATG GCCGGTCTGG
GGCAATAAGC ATATCAATGA TTATATTGGT AAATATCGCG ATACTATCAC GTTCATTCAC
GACCAGACCC TGCACCTGGC GAACCAGGGC TACACCATGA ATGAAATCGG CGACATGATT
AAGCTGCCGC CTGCACTTGC CAATAACTGG GCCAGCCGTG GCTATTACGG TTCTGTCAGC
CACAACGCCC GCGCGGTGTA TAACTTCTAT CTTGGCTATT ACGACGGTAA CCCGGCTAAC
CTGCATCCGT ATGGTCAGGT AGAGATGGGC AAACGTTACG TGAAAGCGCT GGGCGGCTCT
GCCCGTGTCA TCAACCTGGC GCAAGAAGCG AATAAGCAAG GTGATTACCG CTGGTCGGCA
GAACTACTGA AACAGGTGAT TGCCGCCAAC CCGGGTGACC AGGTGGCGAA GAATCTGCAA
GCGAATAACT TTGAACAGCT GGGCTATCAG GCCGAGTCCG CCACATGGCG CGGTTTCTAC
CTGACCGGCG CGAAAGAGCT GCGCGAAGGG GTGCATAAGT TCAGCCACGG CACCACCGGT
TCCCCGGACA CCATTCGCGG AATGTCGGTC GAAATGCTGT TCGACTTTAT GGCCGTTCGC
CTCGATAGCG CGAAAGCTGC GGGTAAAAAT ATCAGCCTGA ACTTCAATAT GAGCAACGGC
GATAACCTCA ACCTGACGCT GAACGATAGC GTGCTTAACT ACCGTAAAAC ACTGCAACCG
CAAGCCGACG CCTCTTTCTA CATCAGCCGT GAAGATCTGC ACGCCGTGCT GACCGGACAG
GCAAAAATGG CGGATCTGGT GAAAGCGAAG AAAGCCAAAA TTATTGGCAA TGGCGCGAAA
CTGGAAGAAA TTATCGCCTG TCTGGATAAT TTCGATTTGT GGGTGAATAT CGTAACCCCA
AATTAA
 
Protein sequence
MNNSRLFRLS RIVIALTAAS GMMVNTANAK EEAKAATQYT QQVNQNYAKS LPFSDRQDFD 
DAQRGFIAPL LDEGILRDAN GKVYYRADDY KFDINAAAPE TVNPSLWRQS QINGISGLFK
VTDKMYQVRG QDISNITFVE GEKGIIVIDP LVTPPAAKAA LDLYFQHRPQ KPIVAVIYTH
SHTDHYGGVK GIISEADVKS GKVQVIAPAG FMDEAISENV LAGNIMSRRA LYSYGLLLPH
NAQGNVGNGL GVTLATGDPS IIAPTKTIVR TGEKMIIDGL EFDFLMTPGS EAPAEMHFYI
PALKALCTAE NATHTLHNFY TLRGAKTRDT SKWTEYLNET LDMWGNDAEV LFMPHTWPVW
GNKHINDYIG KYRDTITFIH DQTLHLANQG YTMNEIGDMI KLPPALANNW ASRGYYGSVS
HNARAVYNFY LGYYDGNPAN LHPYGQVEMG KRYVKALGGS ARVINLAQEA NKQGDYRWSA
ELLKQVIAAN PGDQVAKNLQ ANNFEQLGYQ AESATWRGFY LTGAKELREG VHKFSHGTTG
SPDTIRGMSV EMLFDFMAVR LDSAKAAGKN ISLNFNMSNG DNLNLTLNDS VLNYRKTLQP
QADASFYISR EDLHAVLTGQ AKMADLVKAK KAKIIGNGAK LEEIIACLDN FDLWVNIVTP
N