Gene EcHS_A4329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4329 
Symbol 
ID5591767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4335814 
End bp4337799 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content52% 
IMG OID640923427 
Productmetallo-beta-lactamase superfamily protein 
Protein accessionYP_001460872 
Protein GI157163554 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2015] Alkyl sulfatase and related hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAACT CTCGGTTATT CCGTTTGAGC AGAATTGTTA TTGCGTTAAC TGCCGCCAGC 
GGCATGATGG TAAATACCGC TAACGCGAAA GAGGAAGCGA AAGCCGCCAC TCAATATACC
CAACAGGTTA ATCAGAATTA CGCCAAATCA TTACCGTTTA GCGATCGTCA GGATTTTGAC
GACGCTCAGC GTGGATTTAT CGCTCCGCTG CTGGATGAAG GTATTCTGCG CGATGCTAAT
GGCAAACCAT ATTATCGCGG GGAAGATTAT AAATTTGATA TCAATGCGCC CGCGCCCGAA
ACCGTTAACC CTAGCCTGTG GCGTCAGTCG CAGTTAAACG GTATTTCCGG CCTGTTTAAA
GTCACCGACA GAATGTACCA GGTTCGCGGT CAGGATATCT CCAACATCAC CTTTATCGAA
GGTGATACCG GGATTATTGT CATCGACCCG CTGGTGACGC CACCGAGCGC AAAAGCCGCC
CTTGACCTTT ACTTCCAAAA TCGCCCGCAA AAACCGATTG TTGCGGTTAT TTATACCCAC
AGCCATACCG ACCACTATGG CGGCGTGAAA GGCATTATCT CTGAAGCCGA TGTGAAATCC
GGTAAAGTGC AGGTTATCGC CCCTGCTGGC TTTATGGACG AAGCCATCAG CGAAAACGTA
CTGGCGGGTA ATATTATGAG CCGCCGTGCA CTTTACTCCT ACGGCCTGCT GCTGGCGCAT
AACCCTCAGG GTAACATCGG CAATGGTCTT GGCGTAACGC TGGCATCGGG CTACCCGAGC
ATCATCGCAC CGAACAAAAC CATCACCAAA ACCGGTGAGA AGATGATTAT CGACGGCCTG
GAGTTTGACT TCCTGATGAC CCCAGGTAGC GAAGCACCAG CCGAAATGCA CTTCTATATT
CCGGCCCTGA AAGCGCTGTG TACCGCCGAG AACGCCACGC ATACCCTGCA CAACTTCTAC
ACTCTGCGCG GCGCGAAAAC CCGCGACACC AGCAAGTGGA CCGAGTATCT GAACGAAACG
CTGGATATGT GGGGTAACGA CGCGGAAGTC CTGTTTATGC CGCACACCTG GCCGGTCTGG
GGCAATAAGC ATATCAATGA TTATATTGGT AAATATCGCG ATACTATCAA GTACATTCAC
GACCAGACCC TGCACCTGGC GAACCAGGGC TACACCATGA ATGAAATCGG CGACATGATT
AAACTGCCGC CTGCACTTGC CAATAACTGG GCCAGCCGTG GCTATTACGG TTCTGTCAGC
CACAACGCCC GCGCGGTGTA TAACTTCTAT CTTGGCTATT ACGACGGTAA CCCGGCTAAC
CTGCATCCGT ATGGTCAGGT GGAGATGGGT AAACGTTACG TGCAGGCGCT GGGCGGTTCT
GCCCGTGTCA TCAACCTGGC GCAAGAAGCG AACAAGCAAG GTGATTACCG CTGGTCGGCA
GAACTGCTGA AACAGGTGAT TGCCGCCAAC CCGGGTGACC AGGTCGCGAA GAATCTGCAA
GCGAATAACT TTGAACAGCT GGGCTATCAG GCCGAGTCCG CCACCTGGCG CGGTTTCTAC
CTGACCGGCG CGAAAGAGCT GCGCGAAGGG GTGCATAAGT TCAGCCACGG CACCACCGGT
TCCCCGGACA CCATTCGCGG GATGTCGGTC GAAATGCTGT TCGACTTTAT GTCCGTTCGC
CTCGATAGCG CGAAAGCCGC GGGTAAAAAT ATCAGCCTGA ACTTCAATAT GAGCAATGGC
GATAACCTCA ACCTGACGCT GAACGATAGC GTGCTTAACT ACCGTAAAAC ACTGCAATCC
CAAGCTGACG CCTCTTTCTA CATCAGCCGT GAAGATCTGC ACGCCGTGCT GACCGGGCAA
GCCAAAATGG CGGATCTGGT AAAAGCGAAG AAAGCCAAAA TTATTGGCAA TGGCGCGAAA
CTGGAAGAAA TTATCGCCTG TCTGGATAAT TTCGATTTGT GGGTGAATAT CGTAACCCCA
AATTAA
 
Protein sequence
MNNSRLFRLS RIVIALTAAS GMMVNTANAK EEAKAATQYT QQVNQNYAKS LPFSDRQDFD 
DAQRGFIAPL LDEGILRDAN GKPYYRGEDY KFDINAPAPE TVNPSLWRQS QLNGISGLFK
VTDRMYQVRG QDISNITFIE GDTGIIVIDP LVTPPSAKAA LDLYFQNRPQ KPIVAVIYTH
SHTDHYGGVK GIISEADVKS GKVQVIAPAG FMDEAISENV LAGNIMSRRA LYSYGLLLAH
NPQGNIGNGL GVTLASGYPS IIAPNKTITK TGEKMIIDGL EFDFLMTPGS EAPAEMHFYI
PALKALCTAE NATHTLHNFY TLRGAKTRDT SKWTEYLNET LDMWGNDAEV LFMPHTWPVW
GNKHINDYIG KYRDTIKYIH DQTLHLANQG YTMNEIGDMI KLPPALANNW ASRGYYGSVS
HNARAVYNFY LGYYDGNPAN LHPYGQVEMG KRYVQALGGS ARVINLAQEA NKQGDYRWSA
ELLKQVIAAN PGDQVAKNLQ ANNFEQLGYQ AESATWRGFY LTGAKELREG VHKFSHGTTG
SPDTIRGMSV EMLFDFMSVR LDSAKAAGKN ISLNFNMSNG DNLNLTLNDS VLNYRKTLQS
QADASFYISR EDLHAVLTGQ AKMADLVKAK KAKIIGNGAK LEEIIACLDN FDLWVNIVTP
N