Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3805 |
Symbol | |
ID | 5318103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 257084 |
End bp | 258742 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640775617 |
Product | sulfatase |
Protein accession | YP_001312550 |
Protein GI | 150375954 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTTC CAGATGCTGA CAAGCGCAAA AATGCGCAGG GTTCAATCTC CATAGACCGC CGAAGCCTTT TGCTGGGAGG CACCATCCTT GCGGCTGCCG CTGCAGCGAA CGGAGCCGTG GCCGTCGGCA GCGCGAAGGC TCAGGAGCAA TCTTCGGCAG GTAGCGGCAA GACTCCGAAC ATCCTGGTCA TCTTCGGCGA CGACATCGGC ATTCCGCAAA TCAGCGCCTA CACCATGGGC CTGATGGGCT ATCGCACGCC GAACATCGAT CGTATCGCCG CCGAGGGCGC GATCTTCACC GATGCCTATG GCCAGCAGAG CTGCACCGCA GGGCGCGCTT CCTTCATACT CGGCCAGGAG CCGTTCCGCA CGGGGCTGCT GACGATCGGC ATGCCCGGCG ATCCGCACGG CATTCAGGAC TGGATGCCGA CCATCGCCGA CGTCATGAAG TCGAAGGGAT ATGCAACCGG CCAGTTCGGC AAGAATCACC TGGGCGACCG CGACGAGCAC CTGCCGACGA ACCATGGCTT CGACGAGTTC TTCGGCAACC TCTACCACCT GAATGCCGAA GAGGAGCCGG AAGGCTACTT CTATCCGAAG GACGAGGAAT TCCGGAAGAA TTTCGGGCCG CGCGGCGTGA TCAAGTCCAG CGCGGACGGA AAGATCGAGG ATACCGGAGC GCTCAACACC AAACGGATGG AGACGGTCGA CGAAGAGTTC CTCGCGGCAG CGAAGGATTT TATCGATCGC CAAGCGAAGG CCGACAAGCC GTTCTTCTGC TGGTTCAACT CGACACGCAT GCACGTGTTT ACCCACCTGA AGCCGGAATC CATGGGCAAG ACCGGTAAGG GCATCCACGC CGACGGCATG GTGGAGCATG ACGGCCATGT CGGTCAGCTG CTGCAGCAGC TCGACGACCT CGGCATCACG GAAAACACGA TCGTGCTCTA CACGACCGAC AACGGTGCCG AGCTCGCTTT GTGGCCGGAT GGCGCGATGA CCATGTTCCA TGGCGAAAAG GGAACGACGT GGGAAGGTGG CTTCCGCATT CCCATGATGG TGCGCTGGCC CGGCGTGGTG AAGCCGGGCA CGCAAATCAA CGACCCCGTG ACGCTGATGG ACTGGATGCC GACCTTCGCC ACCGCCGCCG GAATCCCTGA TGTCAAGGAG GAGATGAAGA CGGGCTTCAA GTCCGGCGAC AAGACCTTCA AGGTCCATCT CGACGGTTAT GACTTGACCG CGCTCCTTAA GGGTGAGGCC GAGGAGCCGC CTCGCGAAGC AGTCTACTAC TTCGACCAGG GCGGCAATCT GAATGCCATA AGGTGGAATG ACTGGAAGCT CAGCTTCGCA GTCAACAGCG AAGGCAACAT CGCCACCGCC ACGCGCGAGA CGCCGAGTTG GGCCAATATC GCCAATCTGA GAATGGACCC TTATGAACGG GGGACGAAAG AAGGCGGAGG GGCCATGGAG TTCATAGCCC GAAACATGTG GCTCCTGGTG CCGATCCAGA GCAAGATCAA GGAGTTCTTC CAGGATTTTG ACCAGTACCC CTACCAGCCC GGCAGCACGC TCAACGCCAG CGGCATAAAC TACAACCTGT TGCAGCAGCA GGCTGCGCTG AAGCGGCTCG GCGACCTGGA GCGTCTCGCG CCGCGCTGA
|
Protein sequence | MSFPDADKRK NAQGSISIDR RSLLLGGTIL AAAAAANGAV AVGSAKAQEQ SSAGSGKTPN ILVIFGDDIG IPQISAYTMG LMGYRTPNID RIAAEGAIFT DAYGQQSCTA GRASFILGQE PFRTGLLTIG MPGDPHGIQD WMPTIADVMK SKGYATGQFG KNHLGDRDEH LPTNHGFDEF FGNLYHLNAE EEPEGYFYPK DEEFRKNFGP RGVIKSSADG KIEDTGALNT KRMETVDEEF LAAAKDFIDR QAKADKPFFC WFNSTRMHVF THLKPESMGK TGKGIHADGM VEHDGHVGQL LQQLDDLGIT ENTIVLYTTD NGAELALWPD GAMTMFHGEK GTTWEGGFRI PMMVRWPGVV KPGTQINDPV TLMDWMPTFA TAAGIPDVKE EMKTGFKSGD KTFKVHLDGY DLTALLKGEA EEPPREAVYY FDQGGNLNAI RWNDWKLSFA VNSEGNIATA TRETPSWANI ANLRMDPYER GTKEGGGAME FIARNMWLLV PIQSKIKEFF QDFDQYPYQP GSTLNASGIN YNLLQQQAAL KRLGDLERLA PR
|
| |