Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1130 |
Symbol | |
ID | 5321976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 1197841 |
End bp | 1199454 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640790071 |
Product | sulfatase |
Protein accession | YP_001326816 |
Protein GI | 150396349 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.216257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGC AGCCAAATAT TCTGTTCATA ATGTCGGACG ACCATGCCGC CCGGGCGATA TCGGCTTACG GCTCGGGCCT GAACAGCACG CCCAACATCG ACCGCATCGC CAATGAGGGG ATGCGGCTCG ATCGGTGTTA TGTCACCAAT TCGATCTGCA CGCCCAGCCG CGCCGCTATT TTGACCGGTA CCTATAACCA TGTGAACATG GTCACCACGC TCGACACCCA TATCGATAAT CGGCTCCCAA ACGTCGCCAA GCACTTGCGC GCCGGTGGTT ATCAGACGGC GATCTTCGGC AAGTGGCATC TGGGCGAGGG AAAGGCGCAC GAGCCTTCGG GTTTCGATGA ATGGTCCGTG GTTCCCGGCC AGGGTGAGTA TTTCGATCCG GTGATGATCG ACCCGAGCGG CTCCCGCATG GAGAAGGGCT ACGCCACCGA CATCATCACC GACAAATGCC TCGATTTTCT CAGCAGGCGC GATATCGGAA GACCCTTCTT CCTGATGTGC CATCACAAGG CCCCACATCG GAGCTTCGAG CCGCATCCAC GCTACAAACA ACTTTATGCA GACGGGAATC TGCCGGTTCC GGAAACATTT TCCGACGACT ATTCCAACCG CGCCGCCGCT GCCGCCGCCG CGAAAATGCG CGTCCGCTCC GACATGACGT ATAAGGACCT CGGTCTCGTC CAGCCCGAAG GCGGCGAGGA GACTGGCGAA CTGCTCCTGC CGGGCTGGAC GCAGCGCAAG GTCCCCGATA TCGCGGAAGG TGGATCGTTG CGTCTAATCG ACGGGGCGAC AGGTGAGAAC TACCTCTTCA CCGATCCGCA GAAGCTGGCT CTTTTCAAAT ATCAGCGCTA CATGATGCGT TATCTGCAGA CCATCGCCGC CGTGGACGAC AATGTCGGCC GATTGCTCGA CTATCTCGAC GCGGAAGGAC TGCGCGGCGA CACCATCGTC ATCTACACGT CCGACCAGGG CTTCTTTCTC GGCGAACACG GCTGGTTCGA CAAGCGCTTC ATGTACGAAG AATCAATGCA GATGCCTTTC CTGATCCGCT ATCCGCAAGG CATCGAGGCC GGCGTGCAGG CCAGCCACAT CGCGACCAAT GTCGACTTTG CGCCGACCTT TCTCGATTAT GCCGGCTTGC AGATCCCCAG TTACATGCAG GGACGGAGCA TGCGCCCGAT TTTCGACCGA ACGGCAGATG ACGGCGACAC GGGCCTCGCC TATCACAGAT ATTGGATGCA CAAGGACGAG TTCCACAACG CGTTCGCGCA TTACGGCGTT CGTGACGCGC GCTATAAGCT CATCTATTGG TATAACGATC CGCTCGGGCA ACTCGGCGCC TTTTCCGGTG TTGAACCTCC GGAGTGGGAG CTGTTCGATT GTGAGAAGGA CCCATTCGAA CTTCATAACC GCGCGAACGA CCCGGCCTAT TCGGAAATTT TCGAAGAATT GCTTGCCAAG CTCGATGCGC GCATGGCCGA GATCGGCGAT ACTCCGGAGC ATAGGAGCGC TGAGGTGCTG GCTGGTCTGC GAAGCAGGGG TCTGAGCCTC GACATGCAAT CGGATCAAGC GAACGACAGG CGGTGGGCAA TAGCTGCCCC TTAA
|
Protein sequence | MTKQPNILFI MSDDHAARAI SAYGSGLNST PNIDRIANEG MRLDRCYVTN SICTPSRAAI LTGTYNHVNM VTTLDTHIDN RLPNVAKHLR AGGYQTAIFG KWHLGEGKAH EPSGFDEWSV VPGQGEYFDP VMIDPSGSRM EKGYATDIIT DKCLDFLSRR DIGRPFFLMC HHKAPHRSFE PHPRYKQLYA DGNLPVPETF SDDYSNRAAA AAAAKMRVRS DMTYKDLGLV QPEGGEETGE LLLPGWTQRK VPDIAEGGSL RLIDGATGEN YLFTDPQKLA LFKYQRYMMR YLQTIAAVDD NVGRLLDYLD AEGLRGDTIV IYTSDQGFFL GEHGWFDKRF MYEESMQMPF LIRYPQGIEA GVQASHIATN VDFAPTFLDY AGLQIPSYMQ GRSMRPIFDR TADDGDTGLA YHRYWMHKDE FHNAFAHYGV RDARYKLIYW YNDPLGQLGA FSGVEPPEWE LFDCEKDPFE LHNRANDPAY SEIFEELLAK LDARMAEIGD TPEHRSAEVL AGLRSRGLSL DMQSDQANDR RWAIAAP
|
| |