Gene Rleg2_4844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4844 
Symbol 
ID6977938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp486066 
End bp487556 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content65% 
IMG OID643394005 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002278823 
Protein GI209546905 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.256973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.013402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGCG AACTCTATAT CGACGGACAA TGGGTAAAGC CGGTCAAGGG CGGCACCTGC 
ACGGTGACCA ATCCCGCGAC GGAAGAGGTG ATCCAGACGA TCGGCGCCGC AACGCGCGAG
GATGTCGATC TTGCCGTCAA CGCGGCGCGC CGCGCCTTCG ACAAGGACGG CTGGCCGAAG
CTGACGGGAG CCCAGCGCGC GCGGTATCTC CGTGCGATCG CCGACGGCAT CCGCGCCCGG
CAGGCCGAGA TCGCCCGCCT CGAAGTCCTC GACAACGGCA AGCCGTTCCC CGAGGCCGAT
TGGGACGTTG CCGACGCGGC GGGCTGCTTC GATTTCTATG CGGGGCTCGC CGAGCAGCTC
GACAACAATC CCGAGGAGGC GATCACGCTT CCCGATCAGC GCTTCACCTC CAAGGCGGTG
CGTGAGCCGC TCGGCGTGGC CGGCGCGATC ATCCCCTGGA ATTATCCATT GCTGATGGCA
GCCTGGAAGG TTGCTCCGGC ACTTGCCGCC GGCTGCACCG TGGTGCTGAA GCCCGCCGAA
TTGACGTCGC TGACGGCGCT GGAACTGGCG GCGGTTGCCG ATGAGGCCGG GCTGCCGGCG
GGCGTGCTCA ATATCGTCAC GGGAGCCGGG TCGGTCGCCG GGCAGGCAAT CATCGATCAC
AAGCAGGTGG ACAAACTTGC CTTTACCGGC TCCGGGCCGG TCGGCTCGAA AATCATGGCG
GCGGCGGCCC GCGACATCAA GCGTGTCAGT CTCGAACTCG GCGGCAAGTC GCCCTTCGTC
GTCTTCGAAG ACGCCGATAT CGACAAAGCC GTCGAATGGA TCATGTTCGG CATCTTCTGG
AACCAGGGCC AGGTCTGCTC GGCGACGTCG AGAGTCCTCG TGCAGGACTC CATATACGAG
CGATTGCTTG CACGGCTCAT CGAGGAAACC AGCAAGATCA AGATCGGCAA CGGTCTGGAC
GAGGGCGTCC TCCTCGGGCC GCTGGTTTCC AAGCGCCAGC ACGAGCAGGT CGTTGCCGCG
ATCGAATCGG CCCGGCAGGC CGGCGCAACG GTCGCCTGCG GCGGAGCGCG CCCAGAAGGT
TTTGACAAGG GCTACTACCT CCAGCCGACC ATTCTGACGG ATGTTCCGCT CGACAGCGCC
GCCTGGGAGG AGGAGATCTT CGGGCCTGTC GTCTGCATAA GGCCGTTCAA GACCGAAGAG
GAGGCGATCG CGCTCGCCAA TGATTCCCGC TTCGGGCTTG CCGCCGCCGT CATGTCGAAG
GACGACATCC GGGCCGAACG TGTTGCGGCC GCCTTCCGCG CCGGCATCGT CTGGATCAAC
TGCTCGCAGC CGACCTTCAC CGAGGCGCCC TGGGGCGGCT ACAAGGAATC CGGCATCGGC
CGCGAACTCG GGCGCTGGGG CCTCGACAAT TATCTCGAGA CCAAGCAGAT CACCCGCTTC
GCCAGCGAGG AGCCCTGGGG CTGGTACATC AAGCCGGAGG CGGCCGAATG A
 
Protein sequence
MRSELYIDGQ WVKPVKGGTC TVTNPATEEV IQTIGAATRE DVDLAVNAAR RAFDKDGWPK 
LTGAQRARYL RAIADGIRAR QAEIARLEVL DNGKPFPEAD WDVADAAGCF DFYAGLAEQL
DNNPEEAITL PDQRFTSKAV REPLGVAGAI IPWNYPLLMA AWKVAPALAA GCTVVLKPAE
LTSLTALELA AVADEAGLPA GVLNIVTGAG SVAGQAIIDH KQVDKLAFTG SGPVGSKIMA
AAARDIKRVS LELGGKSPFV VFEDADIDKA VEWIMFGIFW NQGQVCSATS RVLVQDSIYE
RLLARLIEET SKIKIGNGLD EGVLLGPLVS KRQHEQVVAA IESARQAGAT VACGGARPEG
FDKGYYLQPT ILTDVPLDSA AWEEEIFGPV VCIRPFKTEE EAIALANDSR FGLAAAVMSK
DDIRAERVAA AFRAGIVWIN CSQPTFTEAP WGGYKESGIG RELGRWGLDN YLETKQITRF
ASEEPWGWYI KPEAAE