Gene Rmet_5422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5422 
SymbolsumF 
ID4042283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp2168098 
End bp2169156 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content68% 
IMG OID637980840 
ProductSulfatase-modifying factor 1 precursor (C-alpha-formyglycine- generating enzyme 1); putative exported protein 
Protein accessionYP_587550 
Protein GI94314341 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGTGA ACGTGTTGCG AAACAGGGGA CGCTGGCGCA CGCCGCTGGC GGTAGCTGGG 
CTGGCATTCG CGGTGGGTGT CATCGCGTCG CTGGGCGTGC ATGCGGTGCC GGGGGCCGCC
ATGCCTGACG GCGCCACACT CGGTTCGGTG CAGCGATGCG CCGCCTACTC GGGTTTGCCG
GCCGGCTGGG GCAAGTCGCG GACGGCTGGC ATGGCTCGGG TGACGGGCGG TGAGTTCGTG
CCCGGTACCA CGCTTGGCTA TCCGGACGAG CGGCCCGCAG GAAAAACGCG CGTCGGCAGC
TTCTGGATCG ACCGTACCGA GGTGACGGTG GCCCAGTTCG CGGCGTTCGT GCAGGCCACC
GGTTACGTCA CCGATGCCGA GCGACAAGGT GCGGCGGTGG TCTTCCACAA GCCGACCGAC
GCAGAACTGG GTCAACGCCC CTACGCGTGG TGGACGATGG TGACAGGCGC CAACTGGCGG
CATCCGGAAG GGCCGGCCGC TGCCAATTCA CATGGCTACG ATCACCGACG TGACAACCAG
CCGGTGACGC TCGTCACGCA GGCCGATGCC AGGGCCTATG CGAACTGGCT CGGCCATGAC
CTGCCCACGG AGGACGAATG GGAGTTCGCG GCCAAGGCGG GACGTAGTGA TGCCGGACTG
GAAACGGCAC CGCAAACGGC CGAAGGAACG CCCACCGCCA ACTACTGGCA GGGCGTGTTC
CCGGTGCTCA ATACCTCGCG CGATGGGTTC GCGGGGCTCG CGCCGGTGGG GTGCTACACC
GCCAATGCGC TCGGCTTGTT CGACATGATC GCCAACGCCT GGGAGTGGAC TGGCGACGCC
TACACCGGCC CGCGTCAGTC GCACGCCAAT GGCGATACGG CAGTCGTGGC GGCAGCGTCA
CGGTCGCGCA AGCCGGCGGC GACCAGTGTA ATCAAGGGCG GATCGTTCCT GTGCGCGCCG
GACTTCTGCG TGCGCTACCG TGCTTCGGCG CGGGAGTCTG CCGAAAGCGA CTTGCCGACG
TCGCATATCG GCTTCCGAAC GGTGCTGCGC GATGGCTGA
 
Protein sequence
MVVNVLRNRG RWRTPLAVAG LAFAVGVIAS LGVHAVPGAA MPDGATLGSV QRCAAYSGLP 
AGWGKSRTAG MARVTGGEFV PGTTLGYPDE RPAGKTRVGS FWIDRTEVTV AQFAAFVQAT
GYVTDAERQG AAVVFHKPTD AELGQRPYAW WTMVTGANWR HPEGPAAANS HGYDHRRDNQ
PVTLVTQADA RAYANWLGHD LPTEDEWEFA AKAGRSDAGL ETAPQTAEGT PTANYWQGVF
PVLNTSRDGF AGLAPVGCYT ANALGLFDMI ANAWEWTGDA YTGPRQSHAN GDTAVVAAAS
RSRKPAATSV IKGGSFLCAP DFCVRYRASA RESAESDLPT SHIGFRTVLR DG