Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_5415 |
Symbol | |
ID | 4042276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 2158500 |
End bp | 2160488 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637980833 |
Product | putative alkyl sulfatase |
Protein accession | YP_587543 |
Protein GI | 94314334 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2015] Alkyl sulfatase and related hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.297499 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATGCA AGACCGCCGC GTTTCACGCG GCCATTTCCG TGCTGGTGGG CAGCCTGTTC CCGATGAGCG CCATGGCCGC GCCCACCGAG TCCATTGCCA CCAACGACGC AACGGCCGCC ACGCGCGATG CCAATGCGGA CGTCCTGAAG CGCCTGCCTT TCGCCAACCG GCAGGACTTC GAAGATGCCC AACGTGGCTG GGTCGGATCG CTCGACAGTG GCGAGATCCG CAATGCCGAT GGTCGCGTGG TCTGGAACCT CGACGCCTAT GCCTTCCTGC GTGACGATGC CTCACCCGCC TCGGTCAATC CGAGCCTCTG GCGCCAGGCG CAGCTCAACC TGAAGCACGG CCTGTTCAAG GTCACCGACC GCATCTATCA GGTACGTGGC TTCGACCTCT CGAACATGAC CATCGTCGAG GGCGACAGTG GCCTTATCGT GATCGATCCG CTCCTGACCG CCGAAACCGC CCGCGCGGCG ATCGACGTCT ACTACAAGTA CCGTCCGAAG AAGCCCATCG TCGCGGTGAT CTACTCACAC AGCCACGTGG ACCACTTCGG CGGCGTGAAG GGCGTGGTCA GTCAGGATGA CGTCAAGTCG GGCAAGGTGA AGATCTACGC GCCCGAAGGC TTTATGGAAG AGGCCATCAG CGAGAACATC TTCGCCGGCA ATGCCATGAG CCGCCGCGCG CAGTACATGT ACGGCGCCCT GCTGCCAAAG GGCCCGCAGG GGCAGGTGGA CGCCGGCCTC GGCAAGACCG TTTCGCTCGG CACGATCACA CTGATTCCGC CCACCGACCT GATCGGCAAG ACCGGCGAAA CCCGCACGAT CGACGGCGTG CGGATCGAAT TCCAGATGGC TCCCGGCTCC GAGGCGCCGG CCGAGATGCT GATGTACTTC CCGCAGTGGC GGGCACTGTG CGCGGCAGAG GACGCCACGC ATAACCTGCA CAACCTCTAC ACCATCCGCG GCGCCCAGGT GCGCGACGCC AACCAGTGGT GGCGCGCGCT CGACGAGACC ATCGACCGCT ACGGCAACCG CACTGACGTC ATCTTCGCGC AGCACCACTG GCCAAAGTGG GGCCAGCAGA GCATTACCGG CTTCCTCTCG CGGCAGCGCG ACGCCTACAA GTTCATCCAC GACCAGACAC TGCGCCTGGC CAACCAGGGC TACACGATGA CCGAGGTAGG CGAGCGCGTG AAGCTGCCGC CGTCTCTGGC CAGCCAATGG GACCTTCGCG ACTACTACGG CACGGTGAAT CACAACGCCA AGGCCGTCTA CCAGCGGTAC CTCGGCTGGT ACAGCGGTGA CCCGGCCGAC CTGCACCCGC TGCCACCGGA AGAGTCCGCG CAACGCTACG TGCAGTACAT GGGCGGCGCC GACAAGATCC TCGCTCAGGC CAGCAAGTCG TACGCCCAAG GCGATTACCG CTGGGTGGCG CAGGTGGTCA AGCACGTGGT CTACGCCGAT CCGTCGAACC TGGCGGCCCG CAAGCTCGAG GCCGACGCGC TTGAACAGCT CGGCTACCAG ACCGAGGCCG CAAGCTGGCG CAGTGCCTAT CTGGTGGGCG CCTACGAGTT GCGCAATGGT GTGCCCAAGC TGCAGGGCAC CCAGACTGCC AGCCCAGACA TGATCGGGGC CATGACCGAC ACGATGTTCC TGGACTTCCT GGCCGTGCGT CTCAATGGCG AGCGCGCCGC CGGCCACGAC CTGAAGTTCA ACTGGGTACA ACCCGATACT GGCAAGCGCT ATGCGCTGTC GGTGGAAAAC GGTGTCTTCC TCTATAAGCC GGAGCGCCAG TTCGACGACG CCGGTGCCAC GTTGACGATG CCGCGCAGCG CGCTGATCGG CTCGCTGCTG GGCCAGACCA CGCTGCCCGC GGAACTCTCG GCCGGGCGCG CCAAGGTGGA CGGCGATCCG GCTGTACTGA AGTCATGGAT GGGAATGCTG GACAAGTTCG ACCCGCAGTT CAATATCGTG ACGCCTTGA
|
Protein sequence | MQCKTAAFHA AISVLVGSLF PMSAMAAPTE SIATNDATAA TRDANADVLK RLPFANRQDF EDAQRGWVGS LDSGEIRNAD GRVVWNLDAY AFLRDDASPA SVNPSLWRQA QLNLKHGLFK VTDRIYQVRG FDLSNMTIVE GDSGLIVIDP LLTAETARAA IDVYYKYRPK KPIVAVIYSH SHVDHFGGVK GVVSQDDVKS GKVKIYAPEG FMEEAISENI FAGNAMSRRA QYMYGALLPK GPQGQVDAGL GKTVSLGTIT LIPPTDLIGK TGETRTIDGV RIEFQMAPGS EAPAEMLMYF PQWRALCAAE DATHNLHNLY TIRGAQVRDA NQWWRALDET IDRYGNRTDV IFAQHHWPKW GQQSITGFLS RQRDAYKFIH DQTLRLANQG YTMTEVGERV KLPPSLASQW DLRDYYGTVN HNAKAVYQRY LGWYSGDPAD LHPLPPEESA QRYVQYMGGA DKILAQASKS YAQGDYRWVA QVVKHVVYAD PSNLAARKLE ADALEQLGYQ TEAASWRSAY LVGAYELRNG VPKLQGTQTA SPDMIGAMTD TMFLDFLAVR LNGERAAGHD LKFNWVQPDT GKRYALSVEN GVFLYKPERQ FDDAGATLTM PRSALIGSLL GQTTLPAELS AGRAKVDGDP AVLKSWMGML DKFDPQFNIV TP
|
| |