Gene Rmet_5415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5415 
Symbol 
ID4042276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp2158500 
End bp2160488 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content65% 
IMG OID637980833 
Productputative alkyl sulfatase 
Protein accessionYP_587543 
Protein GI94314334 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2015] Alkyl sulfatase and related hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.297499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATGCA AGACCGCCGC GTTTCACGCG GCCATTTCCG TGCTGGTGGG CAGCCTGTTC 
CCGATGAGCG CCATGGCCGC GCCCACCGAG TCCATTGCCA CCAACGACGC AACGGCCGCC
ACGCGCGATG CCAATGCGGA CGTCCTGAAG CGCCTGCCTT TCGCCAACCG GCAGGACTTC
GAAGATGCCC AACGTGGCTG GGTCGGATCG CTCGACAGTG GCGAGATCCG CAATGCCGAT
GGTCGCGTGG TCTGGAACCT CGACGCCTAT GCCTTCCTGC GTGACGATGC CTCACCCGCC
TCGGTCAATC CGAGCCTCTG GCGCCAGGCG CAGCTCAACC TGAAGCACGG CCTGTTCAAG
GTCACCGACC GCATCTATCA GGTACGTGGC TTCGACCTCT CGAACATGAC CATCGTCGAG
GGCGACAGTG GCCTTATCGT GATCGATCCG CTCCTGACCG CCGAAACCGC CCGCGCGGCG
ATCGACGTCT ACTACAAGTA CCGTCCGAAG AAGCCCATCG TCGCGGTGAT CTACTCACAC
AGCCACGTGG ACCACTTCGG CGGCGTGAAG GGCGTGGTCA GTCAGGATGA CGTCAAGTCG
GGCAAGGTGA AGATCTACGC GCCCGAAGGC TTTATGGAAG AGGCCATCAG CGAGAACATC
TTCGCCGGCA ATGCCATGAG CCGCCGCGCG CAGTACATGT ACGGCGCCCT GCTGCCAAAG
GGCCCGCAGG GGCAGGTGGA CGCCGGCCTC GGCAAGACCG TTTCGCTCGG CACGATCACA
CTGATTCCGC CCACCGACCT GATCGGCAAG ACCGGCGAAA CCCGCACGAT CGACGGCGTG
CGGATCGAAT TCCAGATGGC TCCCGGCTCC GAGGCGCCGG CCGAGATGCT GATGTACTTC
CCGCAGTGGC GGGCACTGTG CGCGGCAGAG GACGCCACGC ATAACCTGCA CAACCTCTAC
ACCATCCGCG GCGCCCAGGT GCGCGACGCC AACCAGTGGT GGCGCGCGCT CGACGAGACC
ATCGACCGCT ACGGCAACCG CACTGACGTC ATCTTCGCGC AGCACCACTG GCCAAAGTGG
GGCCAGCAGA GCATTACCGG CTTCCTCTCG CGGCAGCGCG ACGCCTACAA GTTCATCCAC
GACCAGACAC TGCGCCTGGC CAACCAGGGC TACACGATGA CCGAGGTAGG CGAGCGCGTG
AAGCTGCCGC CGTCTCTGGC CAGCCAATGG GACCTTCGCG ACTACTACGG CACGGTGAAT
CACAACGCCA AGGCCGTCTA CCAGCGGTAC CTCGGCTGGT ACAGCGGTGA CCCGGCCGAC
CTGCACCCGC TGCCACCGGA AGAGTCCGCG CAACGCTACG TGCAGTACAT GGGCGGCGCC
GACAAGATCC TCGCTCAGGC CAGCAAGTCG TACGCCCAAG GCGATTACCG CTGGGTGGCG
CAGGTGGTCA AGCACGTGGT CTACGCCGAT CCGTCGAACC TGGCGGCCCG CAAGCTCGAG
GCCGACGCGC TTGAACAGCT CGGCTACCAG ACCGAGGCCG CAAGCTGGCG CAGTGCCTAT
CTGGTGGGCG CCTACGAGTT GCGCAATGGT GTGCCCAAGC TGCAGGGCAC CCAGACTGCC
AGCCCAGACA TGATCGGGGC CATGACCGAC ACGATGTTCC TGGACTTCCT GGCCGTGCGT
CTCAATGGCG AGCGCGCCGC CGGCCACGAC CTGAAGTTCA ACTGGGTACA ACCCGATACT
GGCAAGCGCT ATGCGCTGTC GGTGGAAAAC GGTGTCTTCC TCTATAAGCC GGAGCGCCAG
TTCGACGACG CCGGTGCCAC GTTGACGATG CCGCGCAGCG CGCTGATCGG CTCGCTGCTG
GGCCAGACCA CGCTGCCCGC GGAACTCTCG GCCGGGCGCG CCAAGGTGGA CGGCGATCCG
GCTGTACTGA AGTCATGGAT GGGAATGCTG GACAAGTTCG ACCCGCAGTT CAATATCGTG
ACGCCTTGA
 
Protein sequence
MQCKTAAFHA AISVLVGSLF PMSAMAAPTE SIATNDATAA TRDANADVLK RLPFANRQDF 
EDAQRGWVGS LDSGEIRNAD GRVVWNLDAY AFLRDDASPA SVNPSLWRQA QLNLKHGLFK
VTDRIYQVRG FDLSNMTIVE GDSGLIVIDP LLTAETARAA IDVYYKYRPK KPIVAVIYSH
SHVDHFGGVK GVVSQDDVKS GKVKIYAPEG FMEEAISENI FAGNAMSRRA QYMYGALLPK
GPQGQVDAGL GKTVSLGTIT LIPPTDLIGK TGETRTIDGV RIEFQMAPGS EAPAEMLMYF
PQWRALCAAE DATHNLHNLY TIRGAQVRDA NQWWRALDET IDRYGNRTDV IFAQHHWPKW
GQQSITGFLS RQRDAYKFIH DQTLRLANQG YTMTEVGERV KLPPSLASQW DLRDYYGTVN
HNAKAVYQRY LGWYSGDPAD LHPLPPEESA QRYVQYMGGA DKILAQASKS YAQGDYRWVA
QVVKHVVYAD PSNLAARKLE ADALEQLGYQ TEAASWRSAY LVGAYELRNG VPKLQGTQTA
SPDMIGAMTD TMFLDFLAVR LNGERAAGHD LKFNWVQPDT GKRYALSVEN GVFLYKPERQ
FDDAGATLTM PRSALIGSLL GQTTLPAELS AGRAKVDGDP AVLKSWMGML DKFDPQFNIV
TP