Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4329 |
Symbol | |
ID | 5591767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4335814 |
End bp | 4337799 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640923427 |
Product | metallo-beta-lactamase superfamily protein |
Protein accession | YP_001460872 |
Protein GI | 157163554 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2015] Alkyl sulfatase and related hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 62 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAACT CTCGGTTATT CCGTTTGAGC AGAATTGTTA TTGCGTTAAC TGCCGCCAGC GGCATGATGG TAAATACCGC TAACGCGAAA GAGGAAGCGA AAGCCGCCAC TCAATATACC CAACAGGTTA ATCAGAATTA CGCCAAATCA TTACCGTTTA GCGATCGTCA GGATTTTGAC GACGCTCAGC GTGGATTTAT CGCTCCGCTG CTGGATGAAG GTATTCTGCG CGATGCTAAT GGCAAACCAT ATTATCGCGG GGAAGATTAT AAATTTGATA TCAATGCGCC CGCGCCCGAA ACCGTTAACC CTAGCCTGTG GCGTCAGTCG CAGTTAAACG GTATTTCCGG CCTGTTTAAA GTCACCGACA GAATGTACCA GGTTCGCGGT CAGGATATCT CCAACATCAC CTTTATCGAA GGTGATACCG GGATTATTGT CATCGACCCG CTGGTGACGC CACCGAGCGC AAAAGCCGCC CTTGACCTTT ACTTCCAAAA TCGCCCGCAA AAACCGATTG TTGCGGTTAT TTATACCCAC AGCCATACCG ACCACTATGG CGGCGTGAAA GGCATTATCT CTGAAGCCGA TGTGAAATCC GGTAAAGTGC AGGTTATCGC CCCTGCTGGC TTTATGGACG AAGCCATCAG CGAAAACGTA CTGGCGGGTA ATATTATGAG CCGCCGTGCA CTTTACTCCT ACGGCCTGCT GCTGGCGCAT AACCCTCAGG GTAACATCGG CAATGGTCTT GGCGTAACGC TGGCATCGGG CTACCCGAGC ATCATCGCAC CGAACAAAAC CATCACCAAA ACCGGTGAGA AGATGATTAT CGACGGCCTG GAGTTTGACT TCCTGATGAC CCCAGGTAGC GAAGCACCAG CCGAAATGCA CTTCTATATT CCGGCCCTGA AAGCGCTGTG TACCGCCGAG AACGCCACGC ATACCCTGCA CAACTTCTAC ACTCTGCGCG GCGCGAAAAC CCGCGACACC AGCAAGTGGA CCGAGTATCT GAACGAAACG CTGGATATGT GGGGTAACGA CGCGGAAGTC CTGTTTATGC CGCACACCTG GCCGGTCTGG GGCAATAAGC ATATCAATGA TTATATTGGT AAATATCGCG ATACTATCAA GTACATTCAC GACCAGACCC TGCACCTGGC GAACCAGGGC TACACCATGA ATGAAATCGG CGACATGATT AAACTGCCGC CTGCACTTGC CAATAACTGG GCCAGCCGTG GCTATTACGG TTCTGTCAGC CACAACGCCC GCGCGGTGTA TAACTTCTAT CTTGGCTATT ACGACGGTAA CCCGGCTAAC CTGCATCCGT ATGGTCAGGT GGAGATGGGT AAACGTTACG TGCAGGCGCT GGGCGGTTCT GCCCGTGTCA TCAACCTGGC GCAAGAAGCG AACAAGCAAG GTGATTACCG CTGGTCGGCA GAACTGCTGA AACAGGTGAT TGCCGCCAAC CCGGGTGACC AGGTCGCGAA GAATCTGCAA GCGAATAACT TTGAACAGCT GGGCTATCAG GCCGAGTCCG CCACCTGGCG CGGTTTCTAC CTGACCGGCG CGAAAGAGCT GCGCGAAGGG GTGCATAAGT TCAGCCACGG CACCACCGGT TCCCCGGACA CCATTCGCGG GATGTCGGTC GAAATGCTGT TCGACTTTAT GTCCGTTCGC CTCGATAGCG CGAAAGCCGC GGGTAAAAAT ATCAGCCTGA ACTTCAATAT GAGCAATGGC GATAACCTCA ACCTGACGCT GAACGATAGC GTGCTTAACT ACCGTAAAAC ACTGCAATCC CAAGCTGACG CCTCTTTCTA CATCAGCCGT GAAGATCTGC ACGCCGTGCT GACCGGGCAA GCCAAAATGG CGGATCTGGT AAAAGCGAAG AAAGCCAAAA TTATTGGCAA TGGCGCGAAA CTGGAAGAAA TTATCGCCTG TCTGGATAAT TTCGATTTGT GGGTGAATAT CGTAACCCCA AATTAA
|
Protein sequence | MNNSRLFRLS RIVIALTAAS GMMVNTANAK EEAKAATQYT QQVNQNYAKS LPFSDRQDFD DAQRGFIAPL LDEGILRDAN GKPYYRGEDY KFDINAPAPE TVNPSLWRQS QLNGISGLFK VTDRMYQVRG QDISNITFIE GDTGIIVIDP LVTPPSAKAA LDLYFQNRPQ KPIVAVIYTH SHTDHYGGVK GIISEADVKS GKVQVIAPAG FMDEAISENV LAGNIMSRRA LYSYGLLLAH NPQGNIGNGL GVTLASGYPS IIAPNKTITK TGEKMIIDGL EFDFLMTPGS EAPAEMHFYI PALKALCTAE NATHTLHNFY TLRGAKTRDT SKWTEYLNET LDMWGNDAEV LFMPHTWPVW GNKHINDYIG KYRDTIKYIH DQTLHLANQG YTMNEIGDMI KLPPALANNW ASRGYYGSVS HNARAVYNFY LGYYDGNPAN LHPYGQVEMG KRYVQALGGS ARVINLAQEA NKQGDYRWSA ELLKQVIAAN PGDQVAKNLQ ANNFEQLGYQ AESATWRGFY LTGAKELREG VHKFSHGTTG SPDTIRGMSV EMLFDFMSVR LDSAKAAGKN ISLNFNMSNG DNLNLTLNDS VLNYRKTLQS QADASFYISR EDLHAVLTGQ AKMADLVKAK KAKIIGNGAK LEEIIACLDN FDLWVNIVTP N
|
| |