Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1239 |
Symbol | |
ID | 5322086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1327067 |
End bp | 1328731 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640790180 |
Product | sulfatase |
Protein accession | YP_001326924 |
Protein GI | 150396457 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0342863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.143694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGTA GAATTATCTG CGGCGTTGGA GCATTGGCTG CTTCCACCGT CCTTTGGGGC GCGCTTGCCC CAGTGCAAGC TCAGGAGACG AGAAAGCCCA ACATCCTGTT CATCGTGTCC GACGATACCG GCTACGGCGA TCTCGGCCCT TACGGCGGCG GCGAGGGCCG CGGCATGCCT ACGCCGAACA TCGACAAGCT GGCGGATGAA GGCATGACCT TCTTTTCCTT CTACGCCCAA CCAAGTTGCA CGCCCGGCCG CGCTGCCATG CAGACCGGGC GCATTCCGAA CCGCAGCGGC ATGACCACGG TCGCCTTCCA GGGCGAGGGC GGTGGGCTGC CGGCAGCCGA ATGGACGTTG GCGTCTGTGC TGAAACGCGG CGGGTACCAG ACCTATTTCA CCGGCAAGTG GCATCTCGGC GAGGCAGACT ACGCGCTGCC GATCGCACAG GGCTACGATG AAATGAAATA TGTCGGCCTC TATCATCTCA ATGCCTACAC TTATGCGGAC GCAACCTGGT TCCCGGACAT GGACCCGGAA CTCAGGGCCA TGTTCCAGAA AGTGACGAAA GGCTCGCTTT CCGGCAAAGC CGGCGGGGAG GTGACGGAGG ACTTCAAAAT CAACGGACAA TACGTCGACA CTCCCGTGAT CGACGGTAAG CCCGGTGTCG TCGGGATCCC GTTCTTCGAC GGCTATGTCG AAAAGGCGGC AATCGAATTC CTCGATGCCG CCGCCAAAAA GCCGGATCAG CCGTTCTACA TCAACGTGAA CTTCATGAAG GTGCACCAGC CGAATCTTCC TGCCCCGGAG TTCCAGCATA AGTCCTTGTC AAAGACCAAA TATGCGGACT CCGTCGTGGA GCTCGACACG CGCATCGGCA GGATCCTGGA CAAGCTGCGT GAAACCGGCA TGGACAGGAA CACCCTGGTT TTCTACACGA CCGACAACGG GGCCTGGCAG GACGTCTATC CCGACGCCGG CTATACGCCG TTTCGCGGTA CCAAGGGCAC CGTGCGCGAG GGCGGCAATC GCGTTCCCGC CATCGCGGTC TGGCCCGGCA AGATCAAGCC GAGTTCGAAG AATCACGACA TCGTCGGCGG CCTTGACCTG ATGGCGACGT TCGCGGCAGC CGGCGGCGTC CCGCTTCCTG ACAAGGACCG CGAAGACAAG CCGATCGTCT TCGACAGCTA CGACATGTCG CCGGTTCTTC TCGGCACCGG CAAGTCGGCG CGCAAGTCCT GGTTCTACTT CACCGAGAAC GAGTTGACGC CGGGCGCCGC CCGTGTCGGC AACTACAAGG CGGTGTTCAA CCTGCGGGGT GACAACGGGC AATCGACCGG AGGATTGGCA GTCGATTCCA ATCTCGGCTG GAAGGGGGCG GAGAGCTATG TGGCGACAGT CCCGCAGGTC TTTGATCTCT GGCAGGATCC GCAGGAACGC TACGACATTT TCATGAACAA CTATACGGAG CACACCTGGA CGATGGTGAG CATCAGCAAC GCCATCGAGG AGTTGATGAA GACCTATGTG CAGCATCCGC CGCGCAAGCT GCAGAGCGAA TCTTACAGCG GACCGCTTAC GATCACCAGT TACCAGCGCT TTGAGTGGGC GCGCCAGCAA CTTGGGAAGG AAGGCGTGAA CATCCCGCTA CCGACAGGCA ACTGA
|
Protein sequence | MNRRIICGVG ALAASTVLWG ALAPVQAQET RKPNILFIVS DDTGYGDLGP YGGGEGRGMP TPNIDKLADE GMTFFSFYAQ PSCTPGRAAM QTGRIPNRSG MTTVAFQGEG GGLPAAEWTL ASVLKRGGYQ TYFTGKWHLG EADYALPIAQ GYDEMKYVGL YHLNAYTYAD ATWFPDMDPE LRAMFQKVTK GSLSGKAGGE VTEDFKINGQ YVDTPVIDGK PGVVGIPFFD GYVEKAAIEF LDAAAKKPDQ PFYINVNFMK VHQPNLPAPE FQHKSLSKTK YADSVVELDT RIGRILDKLR ETGMDRNTLV FYTTDNGAWQ DVYPDAGYTP FRGTKGTVRE GGNRVPAIAV WPGKIKPSSK NHDIVGGLDL MATFAAAGGV PLPDKDREDK PIVFDSYDMS PVLLGTGKSA RKSWFYFTEN ELTPGAARVG NYKAVFNLRG DNGQSTGGLA VDSNLGWKGA ESYVATVPQV FDLWQDPQER YDIFMNNYTE HTWTMVSISN AIEELMKTYV QHPPRKLQSE SYSGPLTITS YQRFEWARQQ LGKEGVNIPL PTGN
|
| |