Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4844 |
Symbol | |
ID | 6977938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 486066 |
End bp | 487556 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643394005 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002278823 |
Protein GI | 209546905 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.256973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.013402 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGCG AACTCTATAT CGACGGACAA TGGGTAAAGC CGGTCAAGGG CGGCACCTGC ACGGTGACCA ATCCCGCGAC GGAAGAGGTG ATCCAGACGA TCGGCGCCGC AACGCGCGAG GATGTCGATC TTGCCGTCAA CGCGGCGCGC CGCGCCTTCG ACAAGGACGG CTGGCCGAAG CTGACGGGAG CCCAGCGCGC GCGGTATCTC CGTGCGATCG CCGACGGCAT CCGCGCCCGG CAGGCCGAGA TCGCCCGCCT CGAAGTCCTC GACAACGGCA AGCCGTTCCC CGAGGCCGAT TGGGACGTTG CCGACGCGGC GGGCTGCTTC GATTTCTATG CGGGGCTCGC CGAGCAGCTC GACAACAATC CCGAGGAGGC GATCACGCTT CCCGATCAGC GCTTCACCTC CAAGGCGGTG CGTGAGCCGC TCGGCGTGGC CGGCGCGATC ATCCCCTGGA ATTATCCATT GCTGATGGCA GCCTGGAAGG TTGCTCCGGC ACTTGCCGCC GGCTGCACCG TGGTGCTGAA GCCCGCCGAA TTGACGTCGC TGACGGCGCT GGAACTGGCG GCGGTTGCCG ATGAGGCCGG GCTGCCGGCG GGCGTGCTCA ATATCGTCAC GGGAGCCGGG TCGGTCGCCG GGCAGGCAAT CATCGATCAC AAGCAGGTGG ACAAACTTGC CTTTACCGGC TCCGGGCCGG TCGGCTCGAA AATCATGGCG GCGGCGGCCC GCGACATCAA GCGTGTCAGT CTCGAACTCG GCGGCAAGTC GCCCTTCGTC GTCTTCGAAG ACGCCGATAT CGACAAAGCC GTCGAATGGA TCATGTTCGG CATCTTCTGG AACCAGGGCC AGGTCTGCTC GGCGACGTCG AGAGTCCTCG TGCAGGACTC CATATACGAG CGATTGCTTG CACGGCTCAT CGAGGAAACC AGCAAGATCA AGATCGGCAA CGGTCTGGAC GAGGGCGTCC TCCTCGGGCC GCTGGTTTCC AAGCGCCAGC ACGAGCAGGT CGTTGCCGCG ATCGAATCGG CCCGGCAGGC CGGCGCAACG GTCGCCTGCG GCGGAGCGCG CCCAGAAGGT TTTGACAAGG GCTACTACCT CCAGCCGACC ATTCTGACGG ATGTTCCGCT CGACAGCGCC GCCTGGGAGG AGGAGATCTT CGGGCCTGTC GTCTGCATAA GGCCGTTCAA GACCGAAGAG GAGGCGATCG CGCTCGCCAA TGATTCCCGC TTCGGGCTTG CCGCCGCCGT CATGTCGAAG GACGACATCC GGGCCGAACG TGTTGCGGCC GCCTTCCGCG CCGGCATCGT CTGGATCAAC TGCTCGCAGC CGACCTTCAC CGAGGCGCCC TGGGGCGGCT ACAAGGAATC CGGCATCGGC CGCGAACTCG GGCGCTGGGG CCTCGACAAT TATCTCGAGA CCAAGCAGAT CACCCGCTTC GCCAGCGAGG AGCCCTGGGG CTGGTACATC AAGCCGGAGG CGGCCGAATG A
|
Protein sequence | MRSELYIDGQ WVKPVKGGTC TVTNPATEEV IQTIGAATRE DVDLAVNAAR RAFDKDGWPK LTGAQRARYL RAIADGIRAR QAEIARLEVL DNGKPFPEAD WDVADAAGCF DFYAGLAEQL DNNPEEAITL PDQRFTSKAV REPLGVAGAI IPWNYPLLMA AWKVAPALAA GCTVVLKPAE LTSLTALELA AVADEAGLPA GVLNIVTGAG SVAGQAIIDH KQVDKLAFTG SGPVGSKIMA AAARDIKRVS LELGGKSPFV VFEDADIDKA VEWIMFGIFW NQGQVCSATS RVLVQDSIYE RLLARLIEET SKIKIGNGLD EGVLLGPLVS KRQHEQVVAA IESARQAGAT VACGGARPEG FDKGYYLQPT ILTDVPLDSA AWEEEIFGPV VCIRPFKTEE EAIALANDSR FGLAAAVMSK DDIRAERVAA AFRAGIVWIN CSQPTFTEAP WGGYKESGIG RELGRWGLDN YLETKQITRF ASEEPWGWYI KPEAAE
|
| |