Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0015 |
Symbol | |
ID | 6146080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 16831 |
End bp | 18324 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641614916 |
Product | sulfatase |
Protein accession | YP_001742132 |
Protein GI | 170683329 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.689426 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAAAA CATTAATGGC CAGTTTGATC GGCCTTGCAG TTTGCACAGG GAATGCTTTT AATCCAGCCG TAGCCGCCGA AACTAAACAA CCCAATTTAG TCATCATTAT GGCGGATGAT TTAGGCTATG GTGATTTGGC AACATATGGA CATCAGATCG TTAAAACGCC CAATATAGAC AGGCTTGCCC AGGAAGGCGT CAAATTTACC GACTATTATG CACCTGCCCC CTTAAGTTCT CCTTCACGGG CGGGACTATT AACAGGGCGG ATGCCATTTC GTACCGGCAT ACGCTCATGG ATCCCAACGG GAAAAGATGT GGCATTAGGG CGTAATGAAC TCACGATTGC TAATCTACTC AAAGCGCAAG GGTACGACAC GGCCATGATG GGTAAGCTGC ATCTGAATGC AGGCGGCGAT CGCACCGATC AGCCGCAGGC AAAAGATATG GGCTTTGATT ACTCACTGGT TAATACGGCG GGTTTTGTTA CCGACGCTAC TCTGGATAAT GCGAAGGAGC GTCCCCGTTT TGGCATGGTC TATCCAACGG GCTGGTTGCG TAACGGGCAA CCCACACCAC GTTCCGATAA AATGAGTGGT GAGTATGTCA GTTCGGAAGT CGTCAACTGG TTGGATAACA AAAAGGACAG TAAGCCTTTC TTCCTTTATG TCGCTTTTAC CGAAGTGCAC AGTCCCCTGG CTTCGCCCAA AAAATACCTC GACATGTACT CACAATATAT GAGCGACTAT CAGAAGCAGC ATCCTGATTT ATTTTATGGC GACTGGGCGG ATAAACCCTG GCGTGGTACA GGAGAATATT ATGCCAACAT CAGTTATCTG GATGCTCAGG TTGGAAAAGT ACTGGATAAA ATCAAAGCGA TGGGTGAAGA AGATAACACC ATCGTTATTT TTACCAGTGA TAACGGACCA GTAACGCGTG AAGCGCGCAA AGTTTATGAA CTGAATTTGG CAGGGGAAAC TGATGGATTA CGTGGTCGCA AGGATAATCT CTGGGAAGGT GGCATCCGTG TTCCGGCGAT TATTAAATAC GGAAAGCATC TTCCAAAGGG AATGGTTTCA GATACGCCTG TTTATGGTCT GGACTGGATG CCTACGCTGG CGAAAATGAT GAACTTCAAA TTACCGACGG ACCGGACTTT TGATGGCGAA TCGTTGGTTC CTGTCCTTGA GAACAAAGCG CTAAAACGTG AAAAGCCATT AATCTTCGGA ATTGACATGC CATTCCAGGA TGATCCAACC GACGAATGGG CGATACGTGA TGGTGACTGG AAAATGATCA TCGATCGTAA CAATAAGCCA AAGTACCTAT ACAACCTCAA AACCGATCGT TTTGAGACCA TTAATCAGAT AGGTAAAAAT CCAGACATTG AAAAACAAAT GTATGGTAAG TTCTTAAAGT ATAAAGCCGA TATTGATAAT GATTCATTAA TGAAAGCCAG AGGTGATAAA CCGGAAGCGG TAACCTGGGG CTAA
|
Protein sequence | MQKTLMASLI GLAVCTGNAF NPAVAAETKQ PNLVIIMADD LGYGDLATYG HQIVKTPNID RLAQEGVKFT DYYAPAPLSS PSRAGLLTGR MPFRTGIRSW IPTGKDVALG RNELTIANLL KAQGYDTAMM GKLHLNAGGD RTDQPQAKDM GFDYSLVNTA GFVTDATLDN AKERPRFGMV YPTGWLRNGQ PTPRSDKMSG EYVSSEVVNW LDNKKDSKPF FLYVAFTEVH SPLASPKKYL DMYSQYMSDY QKQHPDLFYG DWADKPWRGT GEYYANISYL DAQVGKVLDK IKAMGEEDNT IVIFTSDNGP VTREARKVYE LNLAGETDGL RGRKDNLWEG GIRVPAIIKY GKHLPKGMVS DTPVYGLDWM PTLAKMMNFK LPTDRTFDGE SLVPVLENKA LKREKPLIFG IDMPFQDDPT DEWAIRDGDW KMIIDRNNKP KYLYNLKTDR FETINQIGKN PDIEKQMYGK FLKYKADIDN DSLMKARGDK PEAVTWG
|
| |