Gene Rmet_5052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5052 
Symbol 
ID4041914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1741825 
End bp1742883 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content67% 
IMG OID637980473 
Productputative epoxide hydrolase 
Protein accessionYP_587183 
Protein GI94313974 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.905889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGA TCGTCAATTT GCCTCGACGG GGCTTTATGG GAGTTGTGGC GGCGACGCTG 
GCCGCGGCGC AATTCGGGGG GGCGCAGACG GCCCGGGCCC AGACCAGCCA GTCGAAGGCA
ACGCCACTGC CCCAGCCACT GCCCCGGCCG GGCACTCATA CGAGCTTTGC GCCGATCAAG
CAGATCAAAG CCGGCGTGTT GAACGTCGGC TATGCGGAGG CTGGCCCGAC CAACGGGCCC
GTCGCGCTGT TGCTGCATGG CTGGCCGTAC GACATCTACA GCTTCGTCGA CGTGGCACCG
CTACTGGCGG CACGGGGCTA TCGCGTGATC ATGCCCTACC TGCGCGGCTA CGGCACCACC
AGCTTTCTGT CCTCCGACAC CATGCGAAAT GGTCAGCCCT CGGCGATTGC CGCCGACATG
ATCGCGCTGC TGGATGCGCT CGGCATCCAG AACGCCGTGG TGGCCGGCTT CGACTGGGGC
GCGCGCACCG CCGACATCAT GGCCGCGCTG TGGCCAGACC GCTGCCGGGG ACTGGTATCG
GTCAGCGGCT ACCTGATCGG CAGCCAGGCC GGCGGCAAGG TGCCCCTGCC GCCCGAGGCC
GAGCTTCAGT GGTGGTACCA GTTCTACTTC GCCACGGACC GTGGCCGCGC CGGCTATCAG
AAGTACACCC ACCAGTTCGC CAAGCTCATC TGGAAACTGG CATCACCGAA GTGGGATTTC
GATGACGCCA CGTTCGAACG CAGCGCCGCC GCGTTCGACA ACCCCGACCA TGTCGACATC
ACGGTGCACA ACTATCGCTG GCGGCAAGGG CTGGCCACCG GAGAAGCCCG GTTCGACGAT
ATCGAGAAAC GGCTCGCCAC CGCGCCCACC ATCAGCGTGC CCACCATCAC CATGGAAGGC
GACGCCAACG GTGCGCCGCA TCCCGAGCCA AGCGCCTATG CCAGCAAGTT CACGGGACGG
TACGAACACC GCAACGTGAC CGGCGGGATC GGCCATAACC TGCCGCAGGA AGCGCCCGCC
GCGTTCGCGC AGGCGGTGCT CGACGTGGAC CGGCGCTAA
 
Protein sequence
MPEIVNLPRR GFMGVVAATL AAAQFGGAQT ARAQTSQSKA TPLPQPLPRP GTHTSFAPIK 
QIKAGVLNVG YAEAGPTNGP VALLLHGWPY DIYSFVDVAP LLAARGYRVI MPYLRGYGTT
SFLSSDTMRN GQPSAIAADM IALLDALGIQ NAVVAGFDWG ARTADIMAAL WPDRCRGLVS
VSGYLIGSQA GGKVPLPPEA ELQWWYQFYF ATDRGRAGYQ KYTHQFAKLI WKLASPKWDF
DDATFERSAA AFDNPDHVDI TVHNYRWRQG LATGEARFDD IEKRLATAPT ISVPTITMEG
DANGAPHPEP SAYASKFTGR YEHRNVTGGI GHNLPQEAPA AFAQAVLDVD RR