Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3146 |
Symbol | |
ID | 5324025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 3296658 |
End bp | 3298127 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640792094 |
Product | sulfatase |
Protein accession | YP_001328805 |
Protein GI | 150398338 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACAAGC CGAATGTCCT GTTCATATTC TCTGATCAGC ACGCGCAGAA AGTAGCGGGC TGTTATGGCG ATGATGTCGT CCGAACGCCC AATATCGACC GGCTCGCGCA GGAAGGTGTG CGCTTCGACA ATGCCTATTG CCCGTCACCG ATCTGCACGC CAAGCCGGAT GTCGATGCTG ACCGCCCGCT GGCCGCACAG GCAGGAATGC TGGACGAATG ACGACATGCT TCGTTCGGAT GTCCCGACAT GGCTGCACCG GGCGGGAGAG GCCGGCTACC GCCCCGCGCT GATCGGCCGG ATGCATTCGA TAGGCCCGGA CCAGCTTCAC GGCTATGCCG AGCGTGGAAT CGGGGATCAT ACGCCGAATT TCGCGGGGAT CGCACGATTT CCCATGGGTG TGCTTGAAGG CACCAACGAA CCGGATTCCG TATCCCTGAC GCAAAGTGGA GCGGGAATGG CGATCTACCA GCGGAAAGAT CAGGACGTCG TGGATGCCGC TGCGGCCTGG CTTCGGGATA AGGGAGCGGC CAGAAACGCC GCCGGGCAGC AATTTTGCCT GACGGTCGGG CTGATGACGC CGCATGCCCC CTATGTCGTC GATCGCGAGG CCTTCGACCA TTATCACGGG CAAGTACCGC CGCCCCGTCT GGATGTGCCG CAGGACGAGC ATGACTGGCA TCGGTGGTGG CGTCACGACC GCGGCATCGG CGAAGTGTCT GATGCAGTCA GGGATCGCGC CCGTGCCGCC TATTGGGGGC TTGTGCAGCG CACCGATGAA ATGATCGGGC AGGTTCTCGA CGCGCTCAAG GAAATCGGCG CGATGGACGA TACCCTGATC GTCTATGCGT CCGATCACGG CGACCATGTC GGAGAACGTG GCCTGTGGTG GAAACATACA TTCTTTGAAG AATCCGTGAA GTTTCCGCTG GTGATGCGGC TTCCCGGCGC CATTCCGGCG GGCGAAAGCC GGGATCAGGT CGTCAATCTG GTCGATCTCA GCCAGACGAT GATCGAGGTC ATGGGGGCTC AGCCTTTGCC CTATGCGGAT GGCAAAAGTT TCTGGGCCGT TGCCTGTGAT CGCGAAGCGC CATGGGAGAA CGAGACATTC AGCGAATACT GTACGGATCC GGTACCATCA TGGACCGGAG GACGGGCTGT GCAGCAGCGG ATGATCCGGT CAGGTTCGTG GAAATTGTCC GTCTATGACG GCGAGCCGCC GCTTTTGTTT GATCTGTCCA CAGATCCGGA TGAGCGTATC AATCGCGCCG AGGACCCGGA TTGTGCGGAA ATGTTTCAAC GATTGTCGGC GCGCCTGGCC CATGACGGTT GGCGGCCCGA AACGGTTGCC GCCCGCATGC GGGAACGACG AGCCGAGAAA GACATTCTCG CCGCTTGGGC GCGTGAGGTT CAGCCAGCGC AGACCCATGT CTGGGAATAC ACCGCCGATA TGAACAGACT CGATAACTAA
|
Protein sequence | MNKPNVLFIF SDQHAQKVAG CYGDDVVRTP NIDRLAQEGV RFDNAYCPSP ICTPSRMSML TARWPHRQEC WTNDDMLRSD VPTWLHRAGE AGYRPALIGR MHSIGPDQLH GYAERGIGDH TPNFAGIARF PMGVLEGTNE PDSVSLTQSG AGMAIYQRKD QDVVDAAAAW LRDKGAARNA AGQQFCLTVG LMTPHAPYVV DREAFDHYHG QVPPPRLDVP QDEHDWHRWW RHDRGIGEVS DAVRDRARAA YWGLVQRTDE MIGQVLDALK EIGAMDDTLI VYASDHGDHV GERGLWWKHT FFEESVKFPL VMRLPGAIPA GESRDQVVNL VDLSQTMIEV MGAQPLPYAD GKSFWAVACD REAPWENETF SEYCTDPVPS WTGGRAVQQR MIRSGSWKLS VYDGEPPLLF DLSTDPDERI NRAEDPDCAE MFQRLSARLA HDGWRPETVA ARMRERRAEK DILAAWAREV QPAQTHVWEY TADMNRLDN
|
| |