Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_5413 |
Symbol | calB |
ID | 4042274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 2155363 |
End bp | 2156796 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637980831 |
Product | coniferyl aldehyde dehydrogenase (CALDH) |
Protein accession | YP_587541 |
Protein GI | 94314332 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.073023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.273691 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCGTC TGTTCCAAGC GCAGCGGTCA GCCTTTGCCG CCGCGCCCTA CCCGGCGCTG GCCGACAGAA AGCAGGGCCT GCGGCGCTTG CGCGATGCAG TTCGCAAGCA CGCCGACAGC CTGGCGCAGG CGGCGGACCG GGACTTCGGC GGACGCGCCC ATGCCGAGAC GATGATGGTC GACGTGCTGC CGTCCGTGCT GCACATCAAC CACCTGCTTG GCGGGCTACG CCGGTGGATG CGACCGTCGC GCAGGAGTGT CGAACTGTTG TTCCTCGGGA ACAGCGCCAA GGTGGTGTAT CAACCCAAGG GCGTGGTTGG CATCGTGGTG CCCTGGAACT TCCCCGTCTA TCTGGCGCTC GGGCCACTGG CCACCGCGCT GGCGGCAGGC AACCGATGCC TGATCAAGAC CTCGGAGTTC GCGCCCCGGA CCTCTGCCGC GCTGCGCGCC TTGCTTGCTG AGGCGTTCAC CGAGGACGAA GTCGCCATCG TCGAAGGCGA CGCCGACGTG GCACGCCAGT TCACCGGCCT ACCGTTCGAT CATCTGGTGT TCACGGGTTC GCCCGAGGTC GGCCGCCACG TGATGCGTGC CGCCGCCGAA CACCTGACGC CCGTCACGCT CGAACTCGGT GGCAAATCGC CTGCCATCGT GTCGTCCAGC GCGAACCTGG CAACCGCGGC ACGGCGCATC GCCCACGGCA AAACGGTCAA CGCCGGTCAG ATCTGCGTGG CGCCGGACTA CGCCCTGGTG CCAGAGGCGC TCGCCGCCCG TTTTGCCACG GAAGTGCTGA ACGCATCCGC AGCGCTGTTC CCGGCGGATC GCGACGACTA CGCAAGCATC ATTCATGACC GCGCCCACCA GCGCATGAAC GATCTGCTCG ATGACGCGCG TGCCCACGGC GCGCAGGTCA TGCGCTCGGC CATGCCCGAG GATGGACGCC GCGTGCCACT GCACGTGGTA CTCGGCGTAA CGCCCCGGAT GCGCATCGCG CGCGAAGAAA TCTTCGGCCC GATCCTGCCG GTCTTCACCT ATCGCGCATT CGAGGACGTC GTGACGCACA TACGCGGCGG CACGCGCCCG CTCGCCATGT ACTACTTCGG CGACGACAAG GCCGAATCCA CCACCCTGCT GCAACGCACG CACGCGGGCG GCGTCACGCT GAACGACTGG GGCTGGCACG TGCTGAATCA CGACCTGCCG TTCGGTGGCA CCGGCACGTC CGGCATGGGC AACTATCACG GCGCCGAGGG CTTCCGCGAG CTATCGCACG CCAAGGCCGT GTTCTCCGAG CGGAAGTGGT TCCCGATCGA ATTGTTTCAT CCGCCCTACG GCACGTTCAT CCAGCGCCTG ACGCTGCGGA TGTTCCTGGG GCGGCCTACC GCAGCGCCCC CCGTGCAAAC CGAAGCCTCG CCAGTGACGG CATCCCAAGA CTAA
|
Protein sequence | MERLFQAQRS AFAAAPYPAL ADRKQGLRRL RDAVRKHADS LAQAADRDFG GRAHAETMMV DVLPSVLHIN HLLGGLRRWM RPSRRSVELL FLGNSAKVVY QPKGVVGIVV PWNFPVYLAL GPLATALAAG NRCLIKTSEF APRTSAALRA LLAEAFTEDE VAIVEGDADV ARQFTGLPFD HLVFTGSPEV GRHVMRAAAE HLTPVTLELG GKSPAIVSSS ANLATAARRI AHGKTVNAGQ ICVAPDYALV PEALAARFAT EVLNASAALF PADRDDYASI IHDRAHQRMN DLLDDARAHG AQVMRSAMPE DGRRVPLHVV LGVTPRMRIA REEIFGPILP VFTYRAFEDV VTHIRGGTRP LAMYYFGDDK AESTTLLQRT HAGGVTLNDW GWHVLNHDLP FGGTGTSGMG NYHGAEGFRE LSHAKAVFSE RKWFPIELFH PPYGTFIQRL TLRMFLGRPT AAPPVQTEAS PVTASQD
|
| |