Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2065 |
Symbol | |
ID | 5322924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2116263 |
End bp | 2117891 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640791002 |
Product | sulfatase |
Protein accession | YP_001327733 |
Protein GI | 150397266 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0577602 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCAA CCGAGAGAGC CTTGATGAGC AAGCGCCCCA ATATCGTCCT CGTACTGGCT GATGACATGG GTTTTTCCGA CCTCGGCTGC TATGGCGGAG AAATCTCGAC GCCCAATCTG GACAGCCTCG CGCGTCGCGG CGCGCGCTTC ACGCAGTTCT ATAACACGGC GCGATGCAGC CCATCGCGTG CGTCGCTTCT GACCGGGCTT CATCCTCACC AGACCGGCAT CGGTATTCTG ACCAACAACG ACTTGCCGCG AGGCTATCCG GGTAACCTGA ACCTGCGATG CGCCACGCTG GCGGAAATGC TGAAAGCTGC CGGATATGCG ACATGCCTCT CGGGGAAATG GCACCTGGCG AGCGAAATGC ACGAACCGAA CGATACCTGG CCGACGAGAC GCGGTTTCGA CCGGTTCTTC GGCACGCTCA CCGGCTGCGG CAGCTTCTAT ACGCCCGGAA CGCTGACCCG CGGCGAATGC GACGCCTCGG CCGAAGCACT CGACCCGGCA TTCTTCTATA CCGACGCCAT CGCCTCTCAT GCCGCGGAAT TCGTCACCGA ACAGTCCGCG GCAGGCAATC CGTTCTTTCT CTACGCCGCC TTCACCGCTC CCCATTGGCC GCTTCATGCA CATCCCGGCG ATATCGACCG TTATCGGGGG CGCTTTGACG AAGGCTGGGA CGTTCTGCGC GAAAAGCGGA TGAAGCGGCT GGTCGAGGAA GGAATTCTTA CGGCGAGCAC CGCAATCAGC GCGCGCGATC CCACGCAGCC CGCCTGGTCT GATACGAAGG AAAAGGCTTG GCAAGTCAAA CGAATGCAGG CCTATGCTGC CCAAATCGAG CGAATGGACC GCGGGATCGG CAAAATCATC GAGGCACTTA AGACCGGCGG CACCTTCGAA AACACGGCTT TCATCTTCCT GTCCGACAAT GGAGCATCGC CGGAGGATCT GCCGCAATTC GACGCCGAAA AATTCATGCG GCGAACGGAC ATTCTTCCAC GGGCGACGCG CGATGGACTA CCGATGCGTG TCGGCAATAC TCCCGATATC TGCCCTGGTG CCGAAGACAC ATATTCCAGC TATGGCCGTG CCTGGGCAAA CCTGTCCAAT ACGCCCTTCC GCTTCTACAA ACGGTGGGTG CATGAAGGCG GCATCGCCAC GCCATTGATT GTCCATTGGC CCGCAGGAGG ACTCGATTGC GGCGCGATCC TCGATCAACC CGCCCAGCTC GTCGATATCG CCCCAACCAT TCTGGAAGTG ACCGGTGCAA GCTATCCGCT TCAGGCTATC GGCCGGGAAA TCGCTCCGCT GGAAGGCTGC AGCCTGCTTC CTGCATTGAA GGGCGAAATA CTCTTCGAGA GGCCGCTCTA CTGGGAACAC ACGGGTAATG CCGCAATCCG CCTCGGACGA TGGAAGCTTG TTCGCGAAGA GCCAAATGGC TGGGAACTTT ACGACCTTGC AGCCGATCGC ACAGAGCTGA ACGACGTGGC GCCGGGCAAC CCTGAGGTCG TCGCGGACCT CCGGGCAAAA TGGGAAGCCT GGGCAGAGCG CATCGGCGTC ATCCCCTGGG AGGTAACGCT CGGCATTTAC GAGGAACGCG GTCTGCATCC GACCTGGGCA GCCGGCTGA
|
Protein sequence | MQPTERALMS KRPNIVLVLA DDMGFSDLGC YGGEISTPNL DSLARRGARF TQFYNTARCS PSRASLLTGL HPHQTGIGIL TNNDLPRGYP GNLNLRCATL AEMLKAAGYA TCLSGKWHLA SEMHEPNDTW PTRRGFDRFF GTLTGCGSFY TPGTLTRGEC DASAEALDPA FFYTDAIASH AAEFVTEQSA AGNPFFLYAA FTAPHWPLHA HPGDIDRYRG RFDEGWDVLR EKRMKRLVEE GILTASTAIS ARDPTQPAWS DTKEKAWQVK RMQAYAAQIE RMDRGIGKII EALKTGGTFE NTAFIFLSDN GASPEDLPQF DAEKFMRRTD ILPRATRDGL PMRVGNTPDI CPGAEDTYSS YGRAWANLSN TPFRFYKRWV HEGGIATPLI VHWPAGGLDC GAILDQPAQL VDIAPTILEV TGASYPLQAI GREIAPLEGC SLLPALKGEI LFERPLYWEH TGNAAIRLGR WKLVREEPNG WELYDLAADR TELNDVAPGN PEVVADLRAK WEAWAERIGV IPWEVTLGIY EERGLHPTWA AG
|
| |