Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Reut_B4569 |
Symbol | |
ID | 3613533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ralstonia eutropha JMP134 |
Kingdom | Bacteria |
Replicon accession | NC_007348 |
Strand | - |
Start bp | 1248703 |
End bp | 1250331 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637693998 |
Product | arylsulfatase |
Protein accession | YP_298763 |
Protein GI | 73538396 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.149859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCACA CATTGAAGCG GCTGGTGGCC GTTACGGCCA CCGTCGCCGC CATGAGTCCG TTCGCCGCCG GGGCACAGCA GACGCGGCCA AATATCCTCG TCATCTGGGG CGACGACATC GGCTGGGAGA ACGTCAGCGC CTACGGCATG GGCGTGATGG GCTACACGAC ACCGAATATC GACAGCATCG GCATGGAGGG CATCCGCTTT ACCGACCAGT ACGCCCAGCC TTCCTGCACG GCGGGCCGCG CGGCGTTCAT TACCGGCCAG TATCCGATCC GCTCGGGCAT GACCACCGTT GGCCAGCCCG GCGACAAACT TGGCTGGCAG CCGGCATCGC CAAGCCTTGG CGAAGTGATG AAGCAAGCCG GCTACCGCAC CGGTTTCTTT GGAAAGAGCC ATATGGGCGA CCGCAACTCG CACCTGCCCA CGGTCCACGG CTTTGACGAG TTCTTTGGCA ACCTCTACCA CCTGAATACC GAAGAGCTTC CCGAGAACCA CGACTACCAG GCTTATGCGA ACGGCTACCC CGGTGGTGAC AAGGCCTTCG CGCAGAAATT CGCACCGCGC GGCGTACTGC ACACCTATGC CACGGACAAC GACGATCCGA CCGACATGCC CCGCTTCGGT CCCGTCGGCA AACAGAAGAT CGAGGACACC GGCCCGCTGA CGAAAAAGCG GATGGAGGAC TTCGACGCCG CCGAGGTCAT TCCCAAAGCC ATCGACTTCA TGCAGGGCGC GAAGCAGAAG GACAAGCCTT TCTTTGTCTG GCTCAACACC AGCCGCATGC ACCTCTATAC CCACCTGAAC GACAAGTGGC GGTACGCGGC GGCCAAGTAC ACGCATGAAG ACGACATGCA GGGCAGCGGC ATGCTGCAGC ATGACCACGA CATCGGCCTC GTGCTGGAAT ACCTCAAGCG CAGCGGCCTC GACAAAAACA CCATCGTCTG GTACTCGACC GACAACGGGC CGGAACATGT CTCATGGCCA CACGGATCGA CCACGCCGTT CCGCGGCGAA AAGATGACGA CCTATGAAGG CGGCGTGCGC GTGGTCTCCA TGCTGCGCTG GCCCGGTGTC ATCAAGCCGG GCCAGATCAA GAACGGCATC CAGGCGCACC AGGACATGTT CACGACATTC GCTGCGATTG CGGGCGTGCC GGACGTCGTC GGGCAGATGA AGCGCGAGAA GCATCAGTAC ATCGACGGCA TCAACAACCT CGACTACTGG ACGGGAAAGA CCGCTGACAG TGCACGCAAG GACTTCCTGT ACTACTACGA GAACAAGCTC ACGGCCGTGC GCATGGGCCC ATGGAAGCTG CACTTCTCGC TCAAGGAGGA TTACTACGGC ACGCTCCAGC CGCGCAGCGT CACGATGCTC TTCAACCTGC GCAGCGACCC GTTCGAAAGC TATGACAGCA AGGACGCCTA TGGTCACCTG CTGCAAAAGG CCCAGTGGAT CTCCGGCCCG ATGAATGAAC TGATCGCCAG TCACCTCAAG ACCATCGCGG ACTATCCGCC GGTGCAGCCT GCCAAGTCGT TCGACCGTTC AAACATGGTC CAGGACTTTC TGCAGCAGCA ACAGCAGTTG CGGCAGCGGC AGCAGCAACA GGCGGCGATC AGGGAGTAA
|
Protein sequence | MMHTLKRLVA VTATVAAMSP FAAGAQQTRP NILVIWGDDI GWENVSAYGM GVMGYTTPNI DSIGMEGIRF TDQYAQPSCT AGRAAFITGQ YPIRSGMTTV GQPGDKLGWQ PASPSLGEVM KQAGYRTGFF GKSHMGDRNS HLPTVHGFDE FFGNLYHLNT EELPENHDYQ AYANGYPGGD KAFAQKFAPR GVLHTYATDN DDPTDMPRFG PVGKQKIEDT GPLTKKRMED FDAAEVIPKA IDFMQGAKQK DKPFFVWLNT SRMHLYTHLN DKWRYAAAKY THEDDMQGSG MLQHDHDIGL VLEYLKRSGL DKNTIVWYST DNGPEHVSWP HGSTTPFRGE KMTTYEGGVR VVSMLRWPGV IKPGQIKNGI QAHQDMFTTF AAIAGVPDVV GQMKREKHQY IDGINNLDYW TGKTADSARK DFLYYYENKL TAVRMGPWKL HFSLKEDYYG TLQPRSVTML FNLRSDPFES YDSKDAYGHL LQKAQWISGP MNELIASHLK TIADYPPVQP AKSFDRSNMV QDFLQQQQQL RQRQQQQAAI RE
|
| |