Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1567 |
Symbol | |
ID | 5322425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 1662853 |
End bp | 1664790 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640790511 |
Product | sulfatase |
Protein accession | YP_001327243 |
Protein GI | 150396776 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.116076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.327443 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTGG AGACGCCGTT GGCCCGTATC GATTCCGCCG TGAGCAGCTC TGACGTCTAC GCCAATGGGC AAGTCTCTTC CGCAAGATAC GCCAGAACCC TGTCGAGCGC CCGCAGCGCC CTATTCACGC TTCTGCTCGC GATCTCGGTC GTCTTCACCA TCGAACTTAT CGTGCGCTGG TCGTGGCCCG ACACCGTCGC CTATTTTACC GATCCCATGC GGCCGGCCTG GACCACGGTT GCCGTCTTCT TCCTCGCAAT GCTCGGCGTC GATGCCCTGT TCGGCCGGGA ACACAAGGCA GCACTGCTCG TTGCGCCGCT TGCCGTGGTA CCCGCCTTCA TCAGTCAGCA GAAGCAGGTC TTCCTGTCCG ATCCACTCTA CCCGACCGAT TTCCTCTTTG GCCGGCAGAT CATGGAACTG ATGCCGGTTC TCGTAAAGGA TAGGCCCTGG ACTGCCGTCG GCGTCGTGGC CGGAATCATA ATCGCGATCG TCGTTTCCGT CCTCCTTCTG CGATTTGCCT GGCGGAACTT CCCCAAGCTG ACCCGCCGTG AACGTATGGC GCGCATCGCA TTTGCCTTGC CGTTGCTGGT AGCGTTCTGG AACATCATGG ACTACAACCA GTTCTCCTGG ATTCGTGACC GGCTGCGGGT CATCCCCATC ATGTGGGACC AGACCGAGAA CTATCGCCAC AACGGCTTTG CCCTGGCTTT CGCCATCAAC CTCCCCATGG CCAATGTAAG CGCGCCGGCT GGCTACATGG CGGATGCGAT CGAGCGGATT CCGGTCAAGC CGCTTCCCGC CGGTACGAGC CATCGCGGCA AGCCGGACGT GATCGTGCTC ATGAGCGAAT CCTTCTGGGA CCCCACCCGT CTTCCCAAGG TGAAGCTGAC ACCCGATCCC ATGCCGACGA TCCGCGAACT GCAGGGCGGC AACGTATTTT CTCCGGAGTT CGGTGGCATG ACAGCCAATG TCGAATTCGA GGCGGTGACG GGTTTTTCCA ACGCGTTCCT TCCCTATGGC AGCATTCCCT ACCAGCAATA TATACGAAAT CCGATCCCCT CGCTTGCCAC CTTCTTCCGC AGTGAAGGTT ACGTCTCACG CGCCATTCAT CCTTTTCAGG GATGGTTCTG GAACCGCAAT GCCGTCTACA AAGCCTTCGG TTTCGATATG TTCAAGTCGG AGGAGAACAT GCCGCCGATG GCCAAGCGTG GCATCTTCGC CTCTGACGAG TCGTTGACGA AGGAGATCAT CCGCCAGGCA GACGAGCTGG AAGACCCTTT CTTCTTCTTC GCCGTAACCC TGCAGGGCCA TGGTCCCTAT GAGGCCAACC GATACGCGAA GAACACGATC AAGGTCGAAG GCGAGCTCTC CGACGCCGAT CGTCAGGTAC TTGCGACCTA TGCTCAAGGC GTGAAGGAAG CCGATGACAG CCTCAAGATG CTGATGGACT GGGCGAAAGA ACGGGACCGG GAGACGATCA TCGTTCTCTT CGGCGATCAC CTGCCGCCGC TGAACACCGT CTATTCCAGC ACCGGCTACA TGAAGGGAAT CACGGCCGAG CGGAAGGGAC CGAAGGATCA GATGAAGGCC GAGCACGAAA CACCGCTCGT CGTCTGGTCG AACAAGACAG GTCCGAAAAA GAAGATCGGC ACGATCAGCC CGGCCTTTCT TTCCTATCAG ATTCTGAAGC AGGCCGGATA TGAGCACCCC TACTACACCG GTTTCCTTGG AAAGGTTTAT GATCACTACC GCGTCCTCGA CCGTTACATG CTGATCCGCA AGAACGGCAA GGATGTCGCC GACTGGCTCC GCCAACGGAA GATACCGGCA TCGTTGCGTG ACTACCGCTT CCTGCAGCAC GACATGATGT TCGGCAAGCG CTACAGCACC GAGCGCTTCT TCCAGTCCCA CGCCGATCTC TACAGCGCCG GTTTGTAA
|
Protein sequence | MKLETPLARI DSAVSSSDVY ANGQVSSARY ARTLSSARSA LFTLLLAISV VFTIELIVRW SWPDTVAYFT DPMRPAWTTV AVFFLAMLGV DALFGREHKA ALLVAPLAVV PAFISQQKQV FLSDPLYPTD FLFGRQIMEL MPVLVKDRPW TAVGVVAGII IAIVVSVLLL RFAWRNFPKL TRRERMARIA FALPLLVAFW NIMDYNQFSW IRDRLRVIPI MWDQTENYRH NGFALAFAIN LPMANVSAPA GYMADAIERI PVKPLPAGTS HRGKPDVIVL MSESFWDPTR LPKVKLTPDP MPTIRELQGG NVFSPEFGGM TANVEFEAVT GFSNAFLPYG SIPYQQYIRN PIPSLATFFR SEGYVSRAIH PFQGWFWNRN AVYKAFGFDM FKSEENMPPM AKRGIFASDE SLTKEIIRQA DELEDPFFFF AVTLQGHGPY EANRYAKNTI KVEGELSDAD RQVLATYAQG VKEADDSLKM LMDWAKERDR ETIIVLFGDH LPPLNTVYSS TGYMKGITAE RKGPKDQMKA EHETPLVVWS NKTGPKKKIG TISPAFLSYQ ILKQAGYEHP YYTGFLGKVY DHYRVLDRYM LIRKNGKDVA DWLRQRKIPA SLRDYRFLQH DMMFGKRYST ERFFQSHADL YSAGL
|
| |