Gene EcE24377A_4641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4641 
Symbol 
ID5589993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4643130 
End bp4645115 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content52% 
IMG OID640928256 
Productmetallo-beta-lactamase superfamily protein 
Protein accessionYP_001465588 
Protein GI157155027 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2015] Alkyl sulfatase and related hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAACT CTCGGTTATT CCGTTTGAGC AGAATTGTTA TTGCGTTAAC TGCCGCCAGC 
GGCATGATGG TAAATACCGC TTACGCAACA GATGAAGCGA AAGCCGCCAC TCAATATACC
CAACAGGTTA ATCAGAATTA CGCCAAATCA TTACCGTTTA GCGATCGTCA GGATTTTGAC
GACGCTCAGC GTGGATTTAT CGCTCCGCTG CTGGATGAAG GTATTCTGCG CGATGCTAAT
GGCAAACCAT ATTATCGCGG GGAAGATTAT AAATTTGATA TCAATGCGCC CGCGCCCGAA
ACCGTTAACC CTAGCCTGTG GCGTCAGTCG CAGTTAAACG GTATTTCCGG CTTGTTTAAA
GTCACCGACA GAATGTACCA GGTTCGCGGT CAGGATATCT CCAACATCAC CTTTATCGAA
GGTGATACCG GGATTATTGT CATCGACCCG CTGGTGACGC CACCGAGCGC AAAAGCCGCC
CTTGACCTTT ACTTCCAAAA TCGCCCGCAA AAACCGATTG TTGCGGTTAT TTATACCCAC
AGCCATACCG ACCACTATGG CGGCGTGAAA GGCATTATCT CTGAAGCCGA TGTGAAATCC
GGTAAAGTGC AGGTTATCGC CCCTGCTGGC TTTATGGACG AAGCCATCAG CGAAAACGTA
CTGGCGGGTA ATATTATGAG CCGCCGTGCA CTTTACTCCT ACGGCCTGCT GCTGGCGCAT
AACCCTCAGG GTAACATCGG CAATGGTCTT GGCGTAACGC TGGCATCGGG CTACCCGAGC
ATCATCGCAC CGAACAAAAC CATCACCAAA ACCGGTGAGA AGATGATTAT CGACGGCCTG
GAGTTTGACT TCCTGATGAC CCCAGGTAGC GAAGCACCAG CCGAAATGCA CTTCTATATT
CCGGCCCTGA AAGCGCTGTG TACCGCCGAG AACGCCACGC ATACCCTGCA CAACTTCTAC
ACTCTGCGCG GCGCGAAAAC CCGCGACACC AGCAAGTGGA CCGAGTATCT GAACGAAACG
CTGGATATGT GGGGTAACGA CGCGGAAGTC CTGTTTATGC CGCACACCTG GCCGGTCTGG
GGCAATAAGC ATATCAATGA TTATATTGGT AAATATCGCG ATACTATCAA GTACATTCAC
GACCAGACCC TGCACCTGGC GAACCAGGGC TACACCATGA ATGAAATCGG CGACATGATT
AAACTGCCGC CTGCACTTGC CAATAACTGG GCCAGCCGTG GCTATTACGG TTCTGTCAGC
CACAACGCCC GCGCGGTGTA TAACTTCTAT CTTGGCTATT ACGACGGTAA CCCGGCTAAC
CTGCATCCGT ATGGTCAGGT GGAGATGGGT AAACGTTACG TGCAGGCGCT GGGCGGTTCT
GCCCGTGTCA TCAACCTGGC GCAAGAAGCG AACAAGCAAG GTGATTACCG CTGGTCGGCA
GAACTGCTGA AACAGGTGAT TACCGCCAAC CCGGGTGACC AGGTCGCGAA GAATCTGCAA
GCGAATAACT TTGAACAGCT GGGCTATCAG GCCGAGTCCG CCACCTGGCG CGGTTTCTAC
CTGACCGGCG CGAAAGAGCT GCGCGAAGGG GTGCATAAGT TCAGCCACGG CACCACCGGT
TCCCCGGACA CCATTCGCGG GATGTCGGTC GAAATGCTGT TCGACTTTAT GTCCGTTCGC
CTCGATAGCG CGAAAGCCGC GGGTAAAAAT ATCAGCCTGA ACTTCAATAT GAGCAATGGC
GATAACCTCA ACCTGACGCT GAACGATAGC GTGCTTAACT ACCGTAAAAC ACTGCAATCC
CAAGCTGACG CCTCTTTCTA CATCAGCCGT GAAGATCTGC ACGCCGTGCT GACCGGGCAA
GCCAAAATGG CGGATCTGGT AAAAGCGAAG AAAGCCAAAA TTATTGGCAA TGGCGCGAAA
CTGGAAGAAA TTATCGCCTG TCTGGATAAT TTCGATTTGT GGGTGAATAT CGTAACCCCA
AATTAA
 
Protein sequence
MNNSRLFRLS RIVIALTAAS GMMVNTAYAT DEAKAATQYT QQVNQNYAKS LPFSDRQDFD 
DAQRGFIAPL LDEGILRDAN GKPYYRGEDY KFDINAPAPE TVNPSLWRQS QLNGISGLFK
VTDRMYQVRG QDISNITFIE GDTGIIVIDP LVTPPSAKAA LDLYFQNRPQ KPIVAVIYTH
SHTDHYGGVK GIISEADVKS GKVQVIAPAG FMDEAISENV LAGNIMSRRA LYSYGLLLAH
NPQGNIGNGL GVTLASGYPS IIAPNKTITK TGEKMIIDGL EFDFLMTPGS EAPAEMHFYI
PALKALCTAE NATHTLHNFY TLRGAKTRDT SKWTEYLNET LDMWGNDAEV LFMPHTWPVW
GNKHINDYIG KYRDTIKYIH DQTLHLANQG YTMNEIGDMI KLPPALANNW ASRGYYGSVS
HNARAVYNFY LGYYDGNPAN LHPYGQVEMG KRYVQALGGS ARVINLAQEA NKQGDYRWSA
ELLKQVITAN PGDQVAKNLQ ANNFEQLGYQ AESATWRGFY LTGAKELREG VHKFSHGTTG
SPDTIRGMSV EMLFDFMSVR LDSAKAAGKN ISLNFNMSNG DNLNLTLNDS VLNYRKTLQS
QADASFYISR EDLHAVLTGQ AKMADLVKAK KAKIIGNGAK LEEIIACLDN FDLWVNIVTP
N