Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Reut_B5764 |
Symbol | |
ID | 3613792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ralstonia eutropha JMP134 |
Kingdom | Bacteria |
Replicon accession | NC_007348 |
Strand | + |
Start bp | 2587911 |
End bp | 2589359 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637695196 |
Product | sulfatase |
Protein accession | YP_299951 |
Protein GI | 73539584 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGCA AGAATGTTGT GGTCATCATG TCCGATGAGC ACGACCCGAG AATGATGGGC TGCTCCGGGC ATCCCTTCGT GAAAACTCCC AACCTGGATG CACTGGCCGC CCGCGGCGTG CGCTTCTCCA GCGCCTACAC ACCCAGCCCG ATCTGCGTCC CGGCCCGTGC AGCCTTTGCC ACCGGCCGGC GGGTACATCA GGTCCGGCTT TGGGACAACG CCATGCCGTA TACGGGCGAG CAGCGCGGCT GGGGCCATGT GCTGCAGGAC CGGGGCATCC GCGTCGAAAG TATCGGCAAG CTGCATTACC GCAATGAAGA AGATCCGGCG GGCTTCGATG CGGAGCACCT GCCGATGCAC GTGGTCGGTG GCCACGGCAT GGTCTGGGCG TCGATCCGCA ATCCGTTTCG TCCACGCGAA AACGGCCCGC GTATGCTGGG CGAGCACATC GGCCCCGGCG AGTCGTCGTA CACGCAATAC GACCGGGCGG TAACGCAACG GGCGGTGCAA TGGCTGCAGG AGGCGGCACA GCGCCAGGAA GCCGGCTTTG TGCTGTACGT CGGCCTGGTC GCGCCGCATT TCCCCTTCGT CGTACCTGAA GAGTTTTACA GCCTCTACCC GACCGACGGC CTGCCGGAGC CGAAGCTGCA CCCGCGTACC GGCTACGAAC AGCATCCCTG GGTCAGGGAG TATTGCGACT TCATGGCGTC GGAACGGCAG TTCGCCGATG CTGACGAACG CCTGCGCGCC TTCGCCGCGT ACTACGGGCT CTGCACCTGG CTCGACCACA ACGTCGGCCA GATCCTCGGG GCGCTGCGCG ACAACGGATT GGAAGACACC ACGCACATCG TCTACACCTC CGACCACGGG GACAACCTCG GCGCGCGTGG TGTCTGGGGC AAGTCGACCC TCTATGAAGA GAGCGTCAAG GTGCCGATGC TGCTGGCCGG CCCGATTGTC ACGCCCGGTG TGTGCAACAC CCCTGTCGAC CTGCTGGACC TGTTCCCGAC GATCCTGCAA GGCGCGGGTG TCGATCCGGC GACGGAGATC GATGAACGGC CCGGCCGCTC CCTGTTCGAG CTTGCGCGCT CGGCGCCGGA ACCGGATCGG GTCATCCTCA GCGAGTACCA CGCGGCAGGC AGCAATGCGG GTGGCTTCAT GCTGCGCAAG GGACGCTGGA AGTACCACCA TTACGTTGGC TTCCGCCCGG AGTTGTTCGA CCTGGAGTCA GACCCCGAGG AACTGACCGA TCTGGCTGGC GATCCGGCAT ACGCCCCGGT CCTTGCCAGC ATGCACGAGG CCCTGCTGGC GATCTGCGAC CCCGATGCAG TTGATCGGCA GGCCAAGTGT GATCAAGCCG CCCTGATCGA GCACTACGGC GGCCCCGATA TGGCCCATAC GCTCGGCTCG TCCACATCCA CGCCCGTGAC GGCAAAGACC GCTGCATAA
|
Protein sequence | MASKNVVVIM SDEHDPRMMG CSGHPFVKTP NLDALAARGV RFSSAYTPSP ICVPARAAFA TGRRVHQVRL WDNAMPYTGE QRGWGHVLQD RGIRVESIGK LHYRNEEDPA GFDAEHLPMH VVGGHGMVWA SIRNPFRPRE NGPRMLGEHI GPGESSYTQY DRAVTQRAVQ WLQEAAQRQE AGFVLYVGLV APHFPFVVPE EFYSLYPTDG LPEPKLHPRT GYEQHPWVRE YCDFMASERQ FADADERLRA FAAYYGLCTW LDHNVGQILG ALRDNGLEDT THIVYTSDHG DNLGARGVWG KSTLYEESVK VPMLLAGPIV TPGVCNTPVD LLDLFPTILQ GAGVDPATEI DERPGRSLFE LARSAPEPDR VILSEYHAAG SNAGGFMLRK GRWKYHHYVG FRPELFDLES DPEELTDLAG DPAYAPVLAS MHEALLAICD PDAVDRQAKC DQAALIEHYG GPDMAHTLGS STSTPVTAKT AA
|
| |