Gene EcSMS35_1714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1714 
Symbol 
ID6144119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1721250 
End bp1723226 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content45% 
IMG OID641616590 
Productmetallo-beta-lactamase family protein 
Protein accessionYP_001743768 
Protein GI170683013 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2015] Alkyl sulfatase and related hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0306095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTTA ATCATATTGT TAAAAGTCTG CTAATAACGG GATTATTCAC CACCAGTTCT 
TTGCCACTCC TCGCGGCGGA AGCCCCTAAA GACGCCACCG CAGCCACGCA ACAAGCAAAC
AATTTACTCT ACAACCAGTT GCCGTTTTCT GATAACACTG ACTTTACTGA TGCCCATAAA
GGTTTTGTTG CCCCTATCCC TCAAGACGTG ATTAAGGGCG AAAAAGGAAA TGTTATCTGG
GATCCACAAC AATACGCTTT TATTAAAGAA GGTGATAAAG CCCCTGATAC GGTGAATCCA
AGTTTATGGC GTCAGGCGCA ACTGATAAAT ATCAGCGGGT TATTCGAAGT AACTGAAGGT
GTCTATCAGA TTCGCAATCT CGATCTATCA AATATGACCA TCATTGAAGG CAAAGAGGGT
ATTACCATCG TTGATCCCCT TGTTTCAGCA GAAACAGCCA AAGTGGGTAT GGATCTCTAT
TACAAAAATC GCGACAAAAA ACCGGTAGTA GCGGTTATCT ACACACACAG TCATGTCGAT
CACTACGGTG GTGTTCGCGG CGTAATTGAT GAAGCTGACG TTAAATCAGG CAAAGTTAAA
ATCTACGCGC CAGCCGGATT TTTGGAGGAA GCCGTTTCCG AAAATATTAT GGCTGGCAAT
GTGATGAGCC GACGAGCCAG TTATATGTAT GGCAACTTAT TGAAACCTGA TGTTAAAGGC
CAGGTTGGCG CGGGACTGGG AACAACCACT TCAGCAGGAA CGGTCACGTT GATTGCTCCT
ACTAACTACA TCACCAAAAC CGGACAAAAA GAAACTATTG ACGGGTTGAC CTATGATTTC
TTGATGGCGC CAGGCTCTGA AGCGCCTTCT GAAATGCTGT GGTTTATTGA AGAAAAGAAA
CTGATCGAAA CCGCTGAAGA TGTCACGCAC ACCCTTCATA ACACCTATTC GCTGCGTGGG
GCTAAAATTC GCCAACCGCT GCCGTGGTCA AAATATATTA ACGAAGCTCT CAATTTATGG
GGAGATAAAG CCGAGATTAT TCTCGCTCAA CATCACTGGC CAACCTGGGG CAACGATAAC
GTGGTCAAAC TGCTTAAAAG CCAGCGTGAT TTGTATCGTT ACATCAATGA TCAAACTCTG
CGTATGGCAA ATCAAGGGAT GACCCGTGAT GAAATTGCCG CAAACTTTAA ACTGCCCTCT
TCACTGGCTA ATACCTGGGC GAATCGCGGC TATTACGGTT CGGTAAGTCA TGACGTGAAA
GCAACTTACG TGCTTTATCT CGGCTGGTTT GATGGTAACC CCGCGACGCT TGATGAACTG
CCGCCAGAAG AAGGCGCAAA AAAATTCGTC GAGTATATGG GCGGTGCTGA TGCCATTTTA
CAGAAAGCCA AACAGGATTA TGACCAGGGT AATTTCCGCT GGGTTGCCCA AGTTGTTAGT
AAGGTGGTGT TTACCGATCC AAACAACCAG GCAGCGAGAA ATCTGGAAGC TGATGCGCTT
GAGCAGTTAG GCTATCAGGC TGAATCTGGG CCGTGGCGTA ATTTCTATCT CACTGGCGCA
CAGGAATTGC GTAATGGCGT ACAAAAATTA CCAACACCAA ATACTGCCAG CCCAGACACC
GTGCGTGCGA TGACGCCGGA GATGTTCTTT GATTATCTTG CCGTCCATAT TAATGGTGAG
AAAGCGGCAG ATGCCAAAAC CGTGTTGAAT TTCGATTTTG GTGAAGATGG CGGCACCTAT
AAAGTGGAGC TTGAAAATGG TGTTCTCAAT CATACCGCTG GTGTAGAGGC TTCGGATGCT
GATGCCACTA TCACTCTGTC TCGTGATGTA TTGAACAAAA TTGTACTGAA AGAAGAGACG
CTGAAAGAAG CCACGGCCAA AGAAGATGTC AAAATTACTG GCAATGCGGA AAAACTCAAC
GAGCTGTTAG GTTATATGGA TAATTTTGAA TTTTGGTTCA ATATAGTGAC ACCATAA
 
Protein sequence
MQLNHIVKSL LITGLFTTSS LPLLAAEAPK DATAATQQAN NLLYNQLPFS DNTDFTDAHK 
GFVAPIPQDV IKGEKGNVIW DPQQYAFIKE GDKAPDTVNP SLWRQAQLIN ISGLFEVTEG
VYQIRNLDLS NMTIIEGKEG ITIVDPLVSA ETAKVGMDLY YKNRDKKPVV AVIYTHSHVD
HYGGVRGVID EADVKSGKVK IYAPAGFLEE AVSENIMAGN VMSRRASYMY GNLLKPDVKG
QVGAGLGTTT SAGTVTLIAP TNYITKTGQK ETIDGLTYDF LMAPGSEAPS EMLWFIEEKK
LIETAEDVTH TLHNTYSLRG AKIRQPLPWS KYINEALNLW GDKAEIILAQ HHWPTWGNDN
VVKLLKSQRD LYRYINDQTL RMANQGMTRD EIAANFKLPS SLANTWANRG YYGSVSHDVK
ATYVLYLGWF DGNPATLDEL PPEEGAKKFV EYMGGADAIL QKAKQDYDQG NFRWVAQVVS
KVVFTDPNNQ AARNLEADAL EQLGYQAESG PWRNFYLTGA QELRNGVQKL PTPNTASPDT
VRAMTPEMFF DYLAVHINGE KAADAKTVLN FDFGEDGGTY KVELENGVLN HTAGVEASDA
DATITLSRDV LNKIVLKEET LKEATAKEDV KITGNAEKLN ELLGYMDNFE FWFNIVTP