Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4549 |
Symbol | |
ID | 6144367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4650220 |
End bp | 4652205 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619365 |
Product | metallo-beta-lactamase family protein |
Protein accession | YP_001746477 |
Protein GI | 170680012 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2015] Alkyl sulfatase and related hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.30993 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAACT CTCGGTTATT CCGTTTGAGC AGGATTGTTA TTGCGTTAAC TGCCGCCAGC GGCATGATGG TAAATACCGC TAACGCAACA GATGAAGCGA AAGCCGCCAC TCAATATACC CAACAGGTTA ATCAGAATTA CGCCAAATCA TTACCGTTTA GCGATCGTCA GGATTTTGAC GATGCTCAGC GTGGATTTAT CGCTCCGCTG CTGGATGAAG GTATTCTGCG CGATGCTAAT GGCAAACCAT ATTATCGCGG AGAAGATTAT AAATTTGATA TCAATGCGCC CGCGCCCGAA ACCGTTAACC CAAGCCTGTG GCGTCAGTCG CAGATCAACG GCATTTCTGG CCTGTTTAAA GTCACCGACC GCATGTATCA GGTGCGCAGC CAGGATATCT CGAACATCAC CTTCATTGAA GGCGATACGG GCATCATCGT CATTGACCCG CTGGTGACGC CAAATGCAGC AAAAGCCAGC CTTGATCTCT ATTTCAAACA CCGCCCACAA AAACCGATTG TGGCCGTTAT CTACACCCAC AGCCACACTG ACCACTATGG CGGCGTAAAA GGTATTGTCT CAGAAGCCGA TGTAAAAGCA GGCAAAGTGC AGATCATCGC CCCGGCAGGC TTTATGGACG AAGCCATCAG TGAAAACGTG CTGGCGGGTA ATATCATGAG TCGCCGTGCA TTTTACTCTT ACGGCCTGCT ACTCCCGCAT AATGCACAGG GAGATATCGG CAACGGGCTG GGTGTTACGC TTACTACCGG CGGCCCGACA ATTATCGCGC CGACGCGATC TATCACCAAG ACAGGAGAGA AACTCAATAT CGACGGGCTG GATTTCGAAT TCCTGATGGC TCCAGGCAGC GAAGCGCCGT CTGAAATGCA CCTCTATATT CCGGCGTTGA AAGCCCTGTG CACAGCGGAA AACAGCACCC ATACCCTGCA TAACTTCTAC ACCCTGCGTG GCGCGAAAAC CCGTGATACC GCGAAGTGGA CCGATTACCT GAATGAAACG CTGGATAAGT GGGGATCACA AGCAGAAGTG CTGTTTATGC CACATACCTG GCCAGTATGG GGTAATCAAC ATATCAATGA TTATATTGGA AAATATCGCG ACACCATTAA GTATATTCAC GACCAGACCC TGCACCTGGC GAACCAGGGT TACACCATGA ATGAAATCGG CAACATGATT CATCTGCCGG AAACGCTGGA TAAAAACTGG GCCAGCCGTG GCTATTATGG CTCCGTCAGT CATAACGCTC GCGCGGTATA TAACTTTTAC CTCGGCTACT ACGACGGTAA CCCAGCAAAC CTGAATCCGT ATGGCCAGGT CGATATGGGT AAACGTTATG TCAAAGCGCT GGGAGGTTCC GCACATGCTA TCAATCTGGC GCGTGAAGCC TATAACCAGG GCGACTACCG CTGGGCCTCT GAACTGCTGA AACAGGTGAT TGCTGCCAAT CCGGGAGACC AGGTGGCGAA AAACCTACAG GCCGATACCT TCGAACAATT GGGTTATCAG GCCGAATCAG CCACCTGGCG CGGCTTCTAC CTGACGGGAG CGAAAGAACT GCGTGAGGGC GCGAAGAAAA TCGAACACGC CAGCACCGCC TCTCCTGACA CCATCAAGGG TATGACCGTC GAGATGCTGC TTGATTACAT GGCTGTTCGT CTGAACAGTG AGAAAGCCGC GGGCAAATCC ATCAGCCTGA ACTTCAATCT CTCTGACAAC GATAACCTGA ACCTCTCACT CAACAATAGC GTATTGAACT ACCGTAAAGT ACTGCAACCG AAGGTAGACG CATCGTTTTA CATGAGCCGC AGCGATCTGC ACGACGTGCT GGTCGGACAA GCCAAAATGG CGGATCTGGT AAAGGCGAAG AAAGCCAAAA TTATTGGCAA TGGCGCAAAA CTGGAAGAAA TTATTGCCTG TCTGGATAAT TTCGATTTGT GGGTGAATAT CGTAACCCCA AATTAA
|
Protein sequence | MNNSRLFRLS RIVIALTAAS GMMVNTANAT DEAKAATQYT QQVNQNYAKS LPFSDRQDFD DAQRGFIAPL LDEGILRDAN GKPYYRGEDY KFDINAPAPE TVNPSLWRQS QINGISGLFK VTDRMYQVRS QDISNITFIE GDTGIIVIDP LVTPNAAKAS LDLYFKHRPQ KPIVAVIYTH SHTDHYGGVK GIVSEADVKA GKVQIIAPAG FMDEAISENV LAGNIMSRRA FYSYGLLLPH NAQGDIGNGL GVTLTTGGPT IIAPTRSITK TGEKLNIDGL DFEFLMAPGS EAPSEMHLYI PALKALCTAE NSTHTLHNFY TLRGAKTRDT AKWTDYLNET LDKWGSQAEV LFMPHTWPVW GNQHINDYIG KYRDTIKYIH DQTLHLANQG YTMNEIGNMI HLPETLDKNW ASRGYYGSVS HNARAVYNFY LGYYDGNPAN LNPYGQVDMG KRYVKALGGS AHAINLAREA YNQGDYRWAS ELLKQVIAAN PGDQVAKNLQ ADTFEQLGYQ AESATWRGFY LTGAKELREG AKKIEHASTA SPDTIKGMTV EMLLDYMAVR LNSEKAAGKS ISLNFNLSDN DNLNLSLNNS VLNYRKVLQP KVDASFYMSR SDLHDVLVGQ AKMADLVKAK KAKIIGNGAK LEEIIACLDN FDLWVNIVTP N
|
| |