Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3791 |
Symbol | |
ID | 5317992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 239958 |
End bp | 241604 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640775604 |
Product | sulfatase |
Protein accession | YP_001312537 |
Protein GI | 150375941 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.425915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG AACCGATTGC GGACCAGCGT GATCCACGCG AGCGCGCCAA TATCACGCGG CGCAGCATTC TGCTTGGCGG GACTGCCATT GCCGCGGCAT CGACCGTACT GACAACCGGC GGCGCGCAGA CCGCCCAGGC GCAGACGCAA ACGCAAGCGC CAGATGCCGG CGGGTCCGCC AAGCAGCCGA ACATCCTGGT CATCTGGGGC GATGACATCG GCACCTGGAA TATCAGCCAT AATAATCGTG GCATGATGGG CTATAAGACG CCGAACATCG ACCGGATCGC TCAGGAAGGC TTGTCCTTCA CGGATTATTA CGGACAGCAG AGCTGCACGG CCGGGCGCGC CGCTTTCATT GGCGGTAATG TACCGGTGCG CACCGGCATG ACCAAGGTCG GCCTCCCCGG CGCGAAGGAA GGCTGGCAGG AGACGGATGT CACGATGGCA ACCGTTCTCA AGAGCCAGGG CTACGCGACG GGCCAGTTCG GCAAGAACCA CCAGGGGGAC AGGGACGAGC ACCTGCCGAC CAATCACGGC TTCGACGAGT TTTTCGGCAA CCTGTATCAC CTCAATGCGG AGGAGGAGCC GGAGAACCGC GACTATCCGA AGGACCCGGA ATTCCGCCGT CGATTTGGCC CGCGCGGCGT GATCCATTCC TTCGCCGACG GCAAGATCGA GGATACCGGC GCGCTCACCA AAAAGCGGAT GGAAACGATT GACGAGGAGT CCCTTGCTGC GGCCAAGGAT TTCATTACGC GCCAGAACCA GGCAGGCACG CCCTTCTTCG TCTGGTGGAA CGGCACGCGC ATGCATTTCA GAACACATGT CAAGGCGGAA CATGCCGGCA TATCGGGGCC AAGCGGCGAC GAGTATCACG ATGGTATGGT CGAACATGAC ATGCATGTCG GCGAGTTGCT CAAGCTCCTC GACGAACTCG GTATCGCCGA AAATACTGTG GTCATGTATT CGACCGACAA TGGACCTCAT TTCAACACAT GGCCGGATGC CGCCACGACG CCGTTCAGGA GCGAAAAGAA CTCGAACTGG GAAGGCGCCT ATCGCGTTCC GGCTTTCGTG CGCTGGCCGG CCCGGTTCCC GGCAGGCAAG ACTCTCAACG GTATCGTTGC TCATGAGGAT TGGCTGCCTA CGTTTGGCGC GATTGCCGGT GCACCGGACG TCAAGGAAAA GCTCCTGGAG GGTGTCGAAC TCAACGGCCG CCGCTATCGC AACTATATCG ACGGCTACAA TCAGCTCGAA TATCTGGAAG GCAAGACCGA CCAGTCGCCG CGTCACGAGT TCTGGTACGT GAATGACGAC GGGCAGGTCG TCGCCGCGCG CTACGACGAC TGGAAAGTGG TATTCCTCGA GAATCGCGGC GAAGCCTTCG GCGTCTGGCG CGAACCCTTT ACCGAATTGC GCGTGCCGCT CTTGTTCAAC CTGCGACGCG ACCCCTTCGA GAAGGCGCAG CACAACGCCA ACACCTATGA TGACTGGTTC CTTGAACGGG CCTTCGTCGT GGTGCCGATC CAGGGTCTGG CGGCGAAGTT CCTGCAGACG ATGAAGGAAT ATCCGCCAAG CCAGTCACCC GGTTCCTTCA ATCTCACCAA AATCGAAGAG AGCCTGAAGG CCGGGATGAG AAACTAA
|
Protein sequence | MKKEPIADQR DPRERANITR RSILLGGTAI AAASTVLTTG GAQTAQAQTQ TQAPDAGGSA KQPNILVIWG DDIGTWNISH NNRGMMGYKT PNIDRIAQEG LSFTDYYGQQ SCTAGRAAFI GGNVPVRTGM TKVGLPGAKE GWQETDVTMA TVLKSQGYAT GQFGKNHQGD RDEHLPTNHG FDEFFGNLYH LNAEEEPENR DYPKDPEFRR RFGPRGVIHS FADGKIEDTG ALTKKRMETI DEESLAAAKD FITRQNQAGT PFFVWWNGTR MHFRTHVKAE HAGISGPSGD EYHDGMVEHD MHVGELLKLL DELGIAENTV VMYSTDNGPH FNTWPDAATT PFRSEKNSNW EGAYRVPAFV RWPARFPAGK TLNGIVAHED WLPTFGAIAG APDVKEKLLE GVELNGRRYR NYIDGYNQLE YLEGKTDQSP RHEFWYVNDD GQVVAARYDD WKVVFLENRG EAFGVWREPF TELRVPLLFN LRRDPFEKAQ HNANTYDDWF LERAFVVVPI QGLAAKFLQT MKEYPPSQSP GSFNLTKIEE SLKAGMRN
|
| |