Gene EcSMS35_0015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0015 
Symbol 
ID6146080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp16831 
End bp18324 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content46% 
IMG OID641614916 
Productsulfatase 
Protein accessionYP_001742132 
Protein GI170683329 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.689426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAA CATTAATGGC CAGTTTGATC GGCCTTGCAG TTTGCACAGG GAATGCTTTT 
AATCCAGCCG TAGCCGCCGA AACTAAACAA CCCAATTTAG TCATCATTAT GGCGGATGAT
TTAGGCTATG GTGATTTGGC AACATATGGA CATCAGATCG TTAAAACGCC CAATATAGAC
AGGCTTGCCC AGGAAGGCGT CAAATTTACC GACTATTATG CACCTGCCCC CTTAAGTTCT
CCTTCACGGG CGGGACTATT AACAGGGCGG ATGCCATTTC GTACCGGCAT ACGCTCATGG
ATCCCAACGG GAAAAGATGT GGCATTAGGG CGTAATGAAC TCACGATTGC TAATCTACTC
AAAGCGCAAG GGTACGACAC GGCCATGATG GGTAAGCTGC ATCTGAATGC AGGCGGCGAT
CGCACCGATC AGCCGCAGGC AAAAGATATG GGCTTTGATT ACTCACTGGT TAATACGGCG
GGTTTTGTTA CCGACGCTAC TCTGGATAAT GCGAAGGAGC GTCCCCGTTT TGGCATGGTC
TATCCAACGG GCTGGTTGCG TAACGGGCAA CCCACACCAC GTTCCGATAA AATGAGTGGT
GAGTATGTCA GTTCGGAAGT CGTCAACTGG TTGGATAACA AAAAGGACAG TAAGCCTTTC
TTCCTTTATG TCGCTTTTAC CGAAGTGCAC AGTCCCCTGG CTTCGCCCAA AAAATACCTC
GACATGTACT CACAATATAT GAGCGACTAT CAGAAGCAGC ATCCTGATTT ATTTTATGGC
GACTGGGCGG ATAAACCCTG GCGTGGTACA GGAGAATATT ATGCCAACAT CAGTTATCTG
GATGCTCAGG TTGGAAAAGT ACTGGATAAA ATCAAAGCGA TGGGTGAAGA AGATAACACC
ATCGTTATTT TTACCAGTGA TAACGGACCA GTAACGCGTG AAGCGCGCAA AGTTTATGAA
CTGAATTTGG CAGGGGAAAC TGATGGATTA CGTGGTCGCA AGGATAATCT CTGGGAAGGT
GGCATCCGTG TTCCGGCGAT TATTAAATAC GGAAAGCATC TTCCAAAGGG AATGGTTTCA
GATACGCCTG TTTATGGTCT GGACTGGATG CCTACGCTGG CGAAAATGAT GAACTTCAAA
TTACCGACGG ACCGGACTTT TGATGGCGAA TCGTTGGTTC CTGTCCTTGA GAACAAAGCG
CTAAAACGTG AAAAGCCATT AATCTTCGGA ATTGACATGC CATTCCAGGA TGATCCAACC
GACGAATGGG CGATACGTGA TGGTGACTGG AAAATGATCA TCGATCGTAA CAATAAGCCA
AAGTACCTAT ACAACCTCAA AACCGATCGT TTTGAGACCA TTAATCAGAT AGGTAAAAAT
CCAGACATTG AAAAACAAAT GTATGGTAAG TTCTTAAAGT ATAAAGCCGA TATTGATAAT
GATTCATTAA TGAAAGCCAG AGGTGATAAA CCGGAAGCGG TAACCTGGGG CTAA
 
Protein sequence
MQKTLMASLI GLAVCTGNAF NPAVAAETKQ PNLVIIMADD LGYGDLATYG HQIVKTPNID 
RLAQEGVKFT DYYAPAPLSS PSRAGLLTGR MPFRTGIRSW IPTGKDVALG RNELTIANLL
KAQGYDTAMM GKLHLNAGGD RTDQPQAKDM GFDYSLVNTA GFVTDATLDN AKERPRFGMV
YPTGWLRNGQ PTPRSDKMSG EYVSSEVVNW LDNKKDSKPF FLYVAFTEVH SPLASPKKYL
DMYSQYMSDY QKQHPDLFYG DWADKPWRGT GEYYANISYL DAQVGKVLDK IKAMGEEDNT
IVIFTSDNGP VTREARKVYE LNLAGETDGL RGRKDNLWEG GIRVPAIIKY GKHLPKGMVS
DTPVYGLDWM PTLAKMMNFK LPTDRTFDGE SLVPVLENKA LKREKPLIFG IDMPFQDDPT
DEWAIRDGDW KMIIDRNNKP KYLYNLKTDR FETINQIGKN PDIEKQMYGK FLKYKADIDN
DSLMKARGDK PEAVTWG