Gene EcSMS35_4549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4549 
Symbol 
ID6144367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4650220 
End bp4652205 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content50% 
IMG OID641619365 
Productmetallo-beta-lactamase family protein 
Protein accessionYP_001746477 
Protein GI170680012 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2015] Alkyl sulfatase and related hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.30993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAACT CTCGGTTATT CCGTTTGAGC AGGATTGTTA TTGCGTTAAC TGCCGCCAGC 
GGCATGATGG TAAATACCGC TAACGCAACA GATGAAGCGA AAGCCGCCAC TCAATATACC
CAACAGGTTA ATCAGAATTA CGCCAAATCA TTACCGTTTA GCGATCGTCA GGATTTTGAC
GATGCTCAGC GTGGATTTAT CGCTCCGCTG CTGGATGAAG GTATTCTGCG CGATGCTAAT
GGCAAACCAT ATTATCGCGG AGAAGATTAT AAATTTGATA TCAATGCGCC CGCGCCCGAA
ACCGTTAACC CAAGCCTGTG GCGTCAGTCG CAGATCAACG GCATTTCTGG CCTGTTTAAA
GTCACCGACC GCATGTATCA GGTGCGCAGC CAGGATATCT CGAACATCAC CTTCATTGAA
GGCGATACGG GCATCATCGT CATTGACCCG CTGGTGACGC CAAATGCAGC AAAAGCCAGC
CTTGATCTCT ATTTCAAACA CCGCCCACAA AAACCGATTG TGGCCGTTAT CTACACCCAC
AGCCACACTG ACCACTATGG CGGCGTAAAA GGTATTGTCT CAGAAGCCGA TGTAAAAGCA
GGCAAAGTGC AGATCATCGC CCCGGCAGGC TTTATGGACG AAGCCATCAG TGAAAACGTG
CTGGCGGGTA ATATCATGAG TCGCCGTGCA TTTTACTCTT ACGGCCTGCT ACTCCCGCAT
AATGCACAGG GAGATATCGG CAACGGGCTG GGTGTTACGC TTACTACCGG CGGCCCGACA
ATTATCGCGC CGACGCGATC TATCACCAAG ACAGGAGAGA AACTCAATAT CGACGGGCTG
GATTTCGAAT TCCTGATGGC TCCAGGCAGC GAAGCGCCGT CTGAAATGCA CCTCTATATT
CCGGCGTTGA AAGCCCTGTG CACAGCGGAA AACAGCACCC ATACCCTGCA TAACTTCTAC
ACCCTGCGTG GCGCGAAAAC CCGTGATACC GCGAAGTGGA CCGATTACCT GAATGAAACG
CTGGATAAGT GGGGATCACA AGCAGAAGTG CTGTTTATGC CACATACCTG GCCAGTATGG
GGTAATCAAC ATATCAATGA TTATATTGGA AAATATCGCG ACACCATTAA GTATATTCAC
GACCAGACCC TGCACCTGGC GAACCAGGGT TACACCATGA ATGAAATCGG CAACATGATT
CATCTGCCGG AAACGCTGGA TAAAAACTGG GCCAGCCGTG GCTATTATGG CTCCGTCAGT
CATAACGCTC GCGCGGTATA TAACTTTTAC CTCGGCTACT ACGACGGTAA CCCAGCAAAC
CTGAATCCGT ATGGCCAGGT CGATATGGGT AAACGTTATG TCAAAGCGCT GGGAGGTTCC
GCACATGCTA TCAATCTGGC GCGTGAAGCC TATAACCAGG GCGACTACCG CTGGGCCTCT
GAACTGCTGA AACAGGTGAT TGCTGCCAAT CCGGGAGACC AGGTGGCGAA AAACCTACAG
GCCGATACCT TCGAACAATT GGGTTATCAG GCCGAATCAG CCACCTGGCG CGGCTTCTAC
CTGACGGGAG CGAAAGAACT GCGTGAGGGC GCGAAGAAAA TCGAACACGC CAGCACCGCC
TCTCCTGACA CCATCAAGGG TATGACCGTC GAGATGCTGC TTGATTACAT GGCTGTTCGT
CTGAACAGTG AGAAAGCCGC GGGCAAATCC ATCAGCCTGA ACTTCAATCT CTCTGACAAC
GATAACCTGA ACCTCTCACT CAACAATAGC GTATTGAACT ACCGTAAAGT ACTGCAACCG
AAGGTAGACG CATCGTTTTA CATGAGCCGC AGCGATCTGC ACGACGTGCT GGTCGGACAA
GCCAAAATGG CGGATCTGGT AAAGGCGAAG AAAGCCAAAA TTATTGGCAA TGGCGCAAAA
CTGGAAGAAA TTATTGCCTG TCTGGATAAT TTCGATTTGT GGGTGAATAT CGTAACCCCA
AATTAA
 
Protein sequence
MNNSRLFRLS RIVIALTAAS GMMVNTANAT DEAKAATQYT QQVNQNYAKS LPFSDRQDFD 
DAQRGFIAPL LDEGILRDAN GKPYYRGEDY KFDINAPAPE TVNPSLWRQS QINGISGLFK
VTDRMYQVRS QDISNITFIE GDTGIIVIDP LVTPNAAKAS LDLYFKHRPQ KPIVAVIYTH
SHTDHYGGVK GIVSEADVKA GKVQIIAPAG FMDEAISENV LAGNIMSRRA FYSYGLLLPH
NAQGDIGNGL GVTLTTGGPT IIAPTRSITK TGEKLNIDGL DFEFLMAPGS EAPSEMHLYI
PALKALCTAE NSTHTLHNFY TLRGAKTRDT AKWTDYLNET LDKWGSQAEV LFMPHTWPVW
GNQHINDYIG KYRDTIKYIH DQTLHLANQG YTMNEIGNMI HLPETLDKNW ASRGYYGSVS
HNARAVYNFY LGYYDGNPAN LNPYGQVDMG KRYVKALGGS AHAINLAREA YNQGDYRWAS
ELLKQVIAAN PGDQVAKNLQ ADTFEQLGYQ AESATWRGFY LTGAKELREG AKKIEHASTA
SPDTIKGMTV EMLLDYMAVR LNSEKAAGKS ISLNFNLSDN DNLNLSLNNS VLNYRKVLQP
KVDASFYMSR SDLHDVLVGQ AKMADLVKAK KAKIIGNGAK LEEIIACLDN FDLWVNIVTP
N