Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4903 |
Symbol | |
ID | 6977997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 541435 |
End bp | 542874 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643394060 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002278878 |
Protein GI | 209546960 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0679006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.531166 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCG TCACCCCAAT CGCCGCCGAT ACCAGTCCCG TTGCCATGAA GGCGCTGCTT GCCCGGCAAC GGGAGTCCTT CCTGAAGGAC GGGCCACCTG ATATCGACAC CCGCATCGAT CGCATCGACC GCGTCATCGG CCTTCTCGTC GATCATAAGG ATGCGATTGC CGCAGCGCTG TCGGAGGATT TCGGCAGCCG CAGTGTCGAG GCAAGCCTGC TGCTCGACGT TTTCACCTGC GTCGGTTCGC TCAAATATGC CAAGGCCCAT CTGGCCGAAT GGCTGAAGCC GGAAGAACAC GAGGCATTGT TCCCCGATGC GGTCGCCAAA GTGGTCTATC AGCCGAAGGG CGTGGTCGGG ATCCTCAGCC CCTGGAATTT TCCCTATCAG CTCGCCCTTG CGCCGCTTGC CGGCATTCTC GCCGCCGGCA ATCGCGCCAT GATCAAGCCC TCGGAGGTGA CGCCGGCTTC GTCATCACTG ATGGCGGAAC TCGTCGCCCG CGCTTTCGAC GACACGGAAA TCGCCGTCGT GCAGGGTGGA CCGGCGACGG GCACGGCCTT CACATCCCTT GCTTTCGACC ATCTGATCTT CACCGGCGGC ACGGCAATCG CCCATCATGT CATGCGGGCG GCAGCAGAGA ACTTGACACC GCTGACGCTT GAACTCGGCG GCAAGTCGCC CGTCGTCGTC GGCCGCTCGG CCGATCTCGC TGATGCGGCC CGCCGTGTCA TGACCGTCAA GACGCTGAAT GCCGGGCAGA TCTGCCTGGC ACCCGACCAT GTCTATGTGC CCGAGGAAAG CGTCGAAGCC TTTGCCGAGC ACGCGGTAGC GGCGGTCGCC GCAATGTATC CGACGCTGAA GGAAAATCCG GATTACACTT CGATCGTCAA TGCCCGCCAT CATGCCCGTG TTCAGGGGCT GATCGACGAC GCCAGGGCCA AGGGCGCCCG TATCGTCGAG ATCAACCCGG CGGCGGAAAA TTTCTCGCAG CAACCCGCCC ACCGCATGCC GCCGACCCTG ATCCTCGATC CCACCGAGGA TATGCGCGTG CTGCAGGAGG AAACCTTCGG GCCGGTGCTG CCGGTTCTTC CCTATCGCGA GATCGCCGAC GTCATCGATC GCATCAACGC CCGGCCGCGC CCGCTCGCTC TCTATTACTT CAGCCAGGAT GGCGAAGAGG AGGCCCGCGT TCTCGACAAT ACGACCTCCG GCGGCGTGAC GGTCAACGAC TGCATGAGCC ACGTCACGGC CGAAGGCCTG CCCTTCGGCG GTGTCGGCCA TTCCGGCATG GGCGCCTATC ACGGCAAGTT CGGCTTCCTC GCCTTCTCGC ATCCGCGCGC CGTCTATCAC CAGAGCAAGA TGGTGGAAGC GGAATATATG ATGCGGCCGC CCTTTGGCGA GGCGATGCGC GGCTTTCTGG CCGGGGCGAT CTGCAAGTAA
|
Protein sequence | MNTVTPIAAD TSPVAMKALL ARQRESFLKD GPPDIDTRID RIDRVIGLLV DHKDAIAAAL SEDFGSRSVE ASLLLDVFTC VGSLKYAKAH LAEWLKPEEH EALFPDAVAK VVYQPKGVVG ILSPWNFPYQ LALAPLAGIL AAGNRAMIKP SEVTPASSSL MAELVARAFD DTEIAVVQGG PATGTAFTSL AFDHLIFTGG TAIAHHVMRA AAENLTPLTL ELGGKSPVVV GRSADLADAA RRVMTVKTLN AGQICLAPDH VYVPEESVEA FAEHAVAAVA AMYPTLKENP DYTSIVNARH HARVQGLIDD ARAKGARIVE INPAAENFSQ QPAHRMPPTL ILDPTEDMRV LQEETFGPVL PVLPYREIAD VIDRINARPR PLALYYFSQD GEEEARVLDN TTSGGVTVND CMSHVTAEGL PFGGVGHSGM GAYHGKFGFL AFSHPRAVYH QSKMVEAEYM MRPPFGEAMR GFLAGAICK
|
| |