Gene Rmet_5544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5544 
Symbol 
ID4042405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp2290151 
End bp2291611 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content56% 
IMG OID637980962 
Productaldehyde dehydrogenase 
Protein accessionYP_587672 
Protein GI94314463 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACC TAACTGCGGA CGCAATTTCT GAAGCCTTGC GAGGCAATCC GGTCCGGCAA 
CTGATTAACG GTTCGATGGT TGATGGGGCC GAAACGCTTG AAGTCATTAA CCCAGCTACC
GGCGAGGCTT GCGCCATTGC GCCCGTGGCT TCTTTGCGAC AGCTTGACGA AGCTGTCGAC
GCTGCCCGGC GTTCGCAACA AAGCTGGGGT GGCTTGCCAT TGACAGAGCG CAGAACGGCT
CTAAAGGGGT TAGCCACGAT TCTTCGGGAG CATGTTGCGG AGTTGGCTGC GCTGCTGACG
CTAGAGCAAG GCCGACCTCT CGCCCAGACT GAAGCGGAAG TGATGCGTGC CGCCATGCTG
CTCGAGGCCA TGCTCACGAT TGACATCGAC GACGAGATTC TTCGCGAAGA TGAATCTGGT
CGGGTCATTC TGCAACACAA GCCGATTGGC GTTGTCGGTG CCATTGCCCC TTGGAACGTT
CCCATCGGAC TCGCCGTTCC GAAGATCACC CATGCACTCT ACGCTGGCAA TACTGTTGTA
CTAAAGCCGT CCCAATACAC TCCTCTGGCC ACGCTGCGAC TTGGTGAGTA CGCATCGAAC
CTGTTCCCCC CTGGAGTGCT GAACGTCTTG AACGGTGGAA ATGATCTTGG GGAAAGGATT
TGCACGCACC CGGACATCGC CAAGATCTCC CTGACTGGAT CTGTGCCAAC AGGGAAACGA
GTTATGGCGT CCGCTGCAGC CTCGCTGAAG CGCTTAACGC TTGAGCTAGG TGGAAACGAC
GCCTGCATCG TCCGCCAGGA TGCGGACGTT GACAAGATTG CACCCGCGCT GTTCGCTGCG
GCGTTCATCA ACAGTGGTCA GGTTTGCATG GCGATAAAGC GCCTTTTCGT ACATCAGGAT
CTTCATGAGC GCCTGGTTGA AAAGCTGGGA GGCCTAGCTG CTAAAGCCAA AGTAGGCGAT
GGCTTTGACT CGACGAGCCA AATGGGGCCG GTTCAGAATC GTGCGCAATA CGAGTCGGTA
AAAGCAGTTC TGGCCGAAGT GGCCGCAGAC CCGGCAGCAA TCATTGTCGC GGGTGGCGAA
GCGTTGAGCC GCCAGGGATT CTTCATTGCT CCCACAGTCG TATCGGGCGT CAGAGAAGGA
AATTCCCTTG TCGACAAGGA GACGTTTGGG CCAGTGCTTC CAGTCCTATC TTTTCAAACC
GATGAGGAAG CGATCGAGCG TGCGAATGCC GGATCGATGG GATTGGGTGC GTCTGTGTGG
GGCAATGATC TCAAAATGGC AGAGCACGTA GCGCGGCAGT TGGTAGCTGG CACCGTATGG
ATAAACAGAC ATGTGGGCGT TGACCCCTTG GTGCCGTTTG GCGGAGCAAA GGAATCCGGT
CTTGGACGGC AGTTCGGAAA AGCAGGGTTG CTAGAGTTCA CCGAAACATC CGCGCTGTTT
GTTCCCAGAG CCAACAAATA G
 
Protein sequence
MKYLTADAIS EALRGNPVRQ LINGSMVDGA ETLEVINPAT GEACAIAPVA SLRQLDEAVD 
AARRSQQSWG GLPLTERRTA LKGLATILRE HVAELAALLT LEQGRPLAQT EAEVMRAAML
LEAMLTIDID DEILREDESG RVILQHKPIG VVGAIAPWNV PIGLAVPKIT HALYAGNTVV
LKPSQYTPLA TLRLGEYASN LFPPGVLNVL NGGNDLGERI CTHPDIAKIS LTGSVPTGKR
VMASAAASLK RLTLELGGND ACIVRQDADV DKIAPALFAA AFINSGQVCM AIKRLFVHQD
LHERLVEKLG GLAAKAKVGD GFDSTSQMGP VQNRAQYESV KAVLAEVAAD PAAIIVAGGE
ALSRQGFFIA PTVVSGVREG NSLVDKETFG PVLPVLSFQT DEEAIERANA GSMGLGASVW
GNDLKMAEHV ARQLVAGTVW INRHVGVDPL VPFGGAKESG LGRQFGKAGL LEFTETSALF
VPRANK