Gene Rmet_5096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5096 
SymbolaldH 
ID4041957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1786872 
End bp1788359 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content66% 
IMG OID637980514 
Product2-ketoglutarate semialdehyde dehydrogenase / NADP-dependent aldehyde dehydrogenase 
Protein accessionYP_587224 
Protein GI94314015 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.845756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.435826 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGCT TTCGTGCTCC GCTCGCTTCT CCGCTTTTCC GTCAGACTCC CTCGTTTCCA 
TTCATGCACG CTTCCGAACA GAACCAAGTC TTCGCGATCA TCGACCAGCA CGCCACGCGC
GCCCACGAGG CCAGCGCGGC ATGGGCGCAT TCGAGTCCCA CGGTCCGCAG TGGCGTGCTG
CGTGCGCTGG CCGACGCCCT CGACGCGCGC GCGAATGAAC TCGCGGTGAT GGCCGACGAG
GAAACCGGCC TCGGCATCCC ACGGCTGACG GGGGAGATCG CCCGCACATC GTTCCAGTTG
CGGGGCTTCG CGGATGAAGT CGTGGCGGGC GTCCCCTACG TCCAGGTGGA TGACGAAGCC
ATTCCCGGCC CGCCTCCGTC GGGCCGTCCG CGTCTGACCC GTGTCATGGT CCCGATTGGT
CCGGTTGCCG TATTTGCAGC CAGCAACTTC CCGTTCGCGT TCTCGGTCCT GGGCGGAGAC
ACCGCATCGG CGTTGGCTGC GGGTTGTCCG GTCATCGTGA AGCCGCATGG CGGACATCCG
CGCCTGTCAA TCGAGGTGGC GAAGATTGCC GCAGAGGTCC TGCGCGCGCA AGGTGTGGAA
GAGGGCGTGT TCTCGATGCT CGACTGCGAC ATGTCGCGCG ACGCCAATAT CCATCTGGTC
AAGCATCCGC TGATCGCGGG CGTGGCGTTC ACGGGCTCCT ATCAGGGTGG CGTGGCCTTG
TGGCGCGCCG CGAATGAGCG CGAGATCCCG ATTCCCTTCT TCGGCGAACT CGGTTCGATC
AATCCGGTGG TGGTGCTCCC TGCGGCACTG CATGACGACC CGGTTGCGCC GGCCAAGGTG
CTGGCGGCGT CGATGCTGCA AGGCTGCGGC CAGTTCTGTA CCAGCCCCGG CGTTATCGTG
GTGGTCGATG ACCAGCGCGG CCGCGAGTTT GTCTCGGCGC TGGGCGACTC GCTGGCGGGG
CAGGCCACCC ATCGGATGCT GAGCGAAGGC ATCCGCAATG GTTTTGAAGA TGCCGTCGCG
CGTATCAAGG CGAATGCCAA TATCGAGGTC GTACTGGAAG GGAATGAGGC TGGGGCCGGC
CCGGCTCCCC GCCTGTTCCA GACCACCGGC GCTGCCTTTA TCGCGGATGC CAGCCTCCAC
GAGGAAATGT TCGGTCCAGC GGCCGTAGTG GTCATGGTGC CATCCGTCGA TGCGATTCCC
GATGTACTCT CCGCCGTCAA GGGATCGCTC ACGGTCACGA TCTGGGGGGG CGACCAGGAC
AACCACGAGA ATCGCGAGAT CGTGCGCGTG GCCCAGAACA TTGGGGGCCG CGTGCTGTTC
TCTGGTGTAC CGACCGGCGT CGCCGTGACG CGCGCGCAGC AACATGGCGG CCCCTGGCCG
GCGTCCACCG ATCCGAAGAC CACGTCGGTC GGGTATGCTG CCTTGCAGCG TTTTCTTCGG
CCCGTGGCAC TTCAGGATGC GCCGGAGTGG CTGCTCCAGC GCGCGTGA
 
Protein sequence
MQSFRAPLAS PLFRQTPSFP FMHASEQNQV FAIIDQHATR AHEASAAWAH SSPTVRSGVL 
RALADALDAR ANELAVMADE ETGLGIPRLT GEIARTSFQL RGFADEVVAG VPYVQVDDEA
IPGPPPSGRP RLTRVMVPIG PVAVFAASNF PFAFSVLGGD TASALAAGCP VIVKPHGGHP
RLSIEVAKIA AEVLRAQGVE EGVFSMLDCD MSRDANIHLV KHPLIAGVAF TGSYQGGVAL
WRAANEREIP IPFFGELGSI NPVVVLPAAL HDDPVAPAKV LAASMLQGCG QFCTSPGVIV
VVDDQRGREF VSALGDSLAG QATHRMLSEG IRNGFEDAVA RIKANANIEV VLEGNEAGAG
PAPRLFQTTG AAFIADASLH EEMFGPAAVV VMVPSVDAIP DVLSAVKGSL TVTIWGGDQD
NHENREIVRV AQNIGGRVLF SGVPTGVAVT RAQQHGGPWP ASTDPKTTSV GYAALQRFLR
PVALQDAPEW LLQRA