Gene Rmet_5402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5402 
Symbol 
ID4042263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp2130727 
End bp2132709 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content62% 
IMG OID637980820 
Productputative alkyl sulfatase 
Protein accessionYP_587530 
Protein GI94314321 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2015] Alkyl sulfatase and related hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACA GCCGATTTCG CGCGAGCGTC GTGACGCTCG TCTCCGCCGC GGTGCTCTCC 
CAGTTCGGCG CGGCCTATGC CCAGGACGCG CGGAAGGACG CCACTAGCGC GACAAAGGCC
GCCAACCAGA AGCTCCTCAG CGAATTGCCT TTCTCGGATC GTTCCGATTT CGATGACGCG
CACCGCGGCT TTGTCGCACC ACTGCCGAAC ACGGTGATCA AGGGGTCGTC GGGTAACGCC
ATCTGGAACC CGAACCAGTA CGCCTTTATC AAGGAAGGTT CCCAGGCACC GGGCACGGTC
AACCCGAGCC TCTGGCGTCA GGCTCAACTG ATCAACATCA GCGGCCTGTT CCAGGTGACC
GATGGCATCT ATCAGGTTCG GAATCAGGAC CTGTCGAACA TGACCATCAT CGAGGGCAAG
GAAGGCATCA CGGTGGTCGA CCCCCTCGTC TCGGAAGAAA CAGCCAAGGT CGCAATCGAT
CTCTACTATG CCAATCGTGG CCGCAAGCCC GTGAAGGCGG TGATCTACAC ACACAGCCAC
GTTGACCACT ATGGCGGCGT GCGTGGCGTG ATCAGCCAGG ACGACGTCAC CTCCGGGAAG
GTCAAGATCT ACGCGCCTGA GGGCTTCCTG GAAGCCGCGG TGGCGGAGAA CGTGATGGCC
GGCAATGCCA TGAGCCGCCG CGCCAGCTAC ATGTACGGCA ATCTGCTGCC CGCCGATGAA
AAGGGCCAGG TCGGCGCCGG TCTCGGCACC ACGACCTCGG CCGGCACCGT CACGCTGTTG
TCTCCGACGG ACACGATCAC CAAGACCGGT GAAAAGCGCG TCATCGACGG CCTGACCTAC
GAGTTCCTGA TGGCTCCTGG CTCGGAAGCT CCGGCCGAGA TGCTGTGGTA CATCGAGGAA
AAGCGCGCGA TCTCCGCAGC CGAGGATTGC ACACACACGC TGCACAACAC GTATTCGCTG
CGCGGCGCCA AGATCCGCGA GCCGCTGCCC TGGTCGAAGT ACCTGAATCA GGCGCTGACC
ATGTGGGGCG CCAAGGCAGA TGTCATGTTC GCCCAGCACC ACTGGCCGAG TTTCGGCCAG
AAGAACGTGG TACACCTGCT GCGCCAGCAG CGCGACCTGT ATCGCTACAT CAATGACGAG
ACGCTGCGCC TGGCCAACCA GGGCGAGACG ATGGTCGAGA TCGCGGACAA GTTCAAGCTG
CCGCCCGATC TCGCCAACAT GTGGGCCAAC CGTGGCTACT ACGGCTCAGT CAGCCACGAC
GTGAAGGCTA CCTACGTCCT TTACCTCGGC TGGTTCAATG GCAACCCCGC CACGCTGGAC
GAACTGACGC CGGTGGAAGC CAGCAAGCGC TACGTCGAGT TCATGGGCGG CGCCAATGCC
GTGCTGTCGA AGGCCAAGCA GGCGTACGAC AAGGGTGAGT ATCGCTGGGT GGCCCAGGTG
GTCAATCACG TGGTGTTTGC CGACCCGTCG AACAAGGCGG CCAAGAACCT CCAGGCCGAC
GCGCTCGAGC AACTGGGCTA CCAGGCCGAA TCCGGCCCGT GGCGCAACTT CTACCTGACC
GGTGCCAAGG AATTGCGCGA AGGCGTGAAG AAGCTGCCCA CGCCGAACAC CGCGAGCGGC
GATACGGTGA AGGCCATGAC GCCGGAGATG TTCTTCGACT ATCTCAGCGT TCGCGTGAAC
CGCGCCAAGG CGGCCAATGC GAAGATCGCG CTGAATGTCG ACTTCGGCAA GGAAGGCGGC
AAGTATCTGC TCGAACTTGA GAACGGCGTG CTCAACCACA CGGCAGGCGT TGAATCCGCA
AACGCTGATG CCTCCGTGGC AATGTCACGC GACACGCTGA ACGGCATCAT CCTGCAACAG
ACGAAGCTGG CCGACGCGAT CAAGAACGGA TCGGCGAAGG TCACTGGCAA TCAGGCCAAG
CTCGACGAAC TGGTGAGCTA CCTCGACAAC TTCGAGTTCT GGTTCAATAT CGTCACGCCG
TAA
 
Protein sequence
MKNSRFRASV VTLVSAAVLS QFGAAYAQDA RKDATSATKA ANQKLLSELP FSDRSDFDDA 
HRGFVAPLPN TVIKGSSGNA IWNPNQYAFI KEGSQAPGTV NPSLWRQAQL INISGLFQVT
DGIYQVRNQD LSNMTIIEGK EGITVVDPLV SEETAKVAID LYYANRGRKP VKAVIYTHSH
VDHYGGVRGV ISQDDVTSGK VKIYAPEGFL EAAVAENVMA GNAMSRRASY MYGNLLPADE
KGQVGAGLGT TTSAGTVTLL SPTDTITKTG EKRVIDGLTY EFLMAPGSEA PAEMLWYIEE
KRAISAAEDC THTLHNTYSL RGAKIREPLP WSKYLNQALT MWGAKADVMF AQHHWPSFGQ
KNVVHLLRQQ RDLYRYINDE TLRLANQGET MVEIADKFKL PPDLANMWAN RGYYGSVSHD
VKATYVLYLG WFNGNPATLD ELTPVEASKR YVEFMGGANA VLSKAKQAYD KGEYRWVAQV
VNHVVFADPS NKAAKNLQAD ALEQLGYQAE SGPWRNFYLT GAKELREGVK KLPTPNTASG
DTVKAMTPEM FFDYLSVRVN RAKAANAKIA LNVDFGKEGG KYLLELENGV LNHTAGVESA
NADASVAMSR DTLNGIILQQ TKLADAIKNG SAKVTGNQAK LDELVSYLDN FEFWFNIVTP