Gene Rleg2_6113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6113 
Symbol 
ID6983186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp40250 
End bp41650 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content60% 
IMG OID643399136 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002283892 
Protein GI209551976 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.401659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.835317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATA TTATTGAGAT CTTTTCTCCG TTCGATACGA GCCGGATTGG ACAGGTGGCA 
GCGGCAAGTC CAGTTGAGAT CGAGCGAGCA CTTGAAACAG CCTACGCGCT GTTCCGCGAC
CGTCGCCAGT GGCTATCCAA GCAGAAACGG ATTGAAATTC TCAAGCGCGC GGCAGCGATT
ATTACCCAGA GGCGCGAGGA ACTTGCTCGC CAAGCTGCGT CCGAGGGCGG GAAACCGCTA
CGCGACTCCC TTATCGAGGT CGATCGCGGT GTCGATGGCA TCCATACCTG CGTCGAGGAA
CTGCGGACCA AAGCCGGGCA AGTCGTGCCG ATGGACCTCA ACGCCACGTC TGCCGGGCGT
GTGGCGTTCA CCCAATACGA GCCGATCGGC GTTGTGGTCG GCGTCAGTGC GTTCAACCAT
CCCTTCAATC TGGTTGTGCA TCAGCTCGCC CCCGCGGTCG CCGTCGGCGC TCCCGTAATC
CTTAAGCCCG CGACGACGAC ACCTCTCTCC TGCAGGTCGC TTGTTGAAAT ATTCCGCGAA
GCTGGTCTGC CCGAGGGTTG GGCCCAGATG GTGGTACCCG AGACGAACGA GCTTGCGACC
CGCTTGGTAA CGGATCCACG CGTCGGGTTC TTCTCGTTCA TTGGTTCGGC GGCGGTGGGA
TGGTCCTTGC GCTCGAAGAT CGCTCCCGGG ACCAGGTGCG CACTCGAACA TGGCGGCGTC
GCTCCCGTCA TCGTGTCTCA CGATGCCGAT TTGGATATGG CGATGCCCAA GGTGGCGCGG
GGCGCTTTCT GGCATGCCGG CCAGGCATGT GTCGGTGTCC AGCGGGTATT CTGCCACAAC
AGCGTTGTCG ACGACGTGGC GGAGCGACTG GGCGCGCTGG GAGAGAGAAT GGTGATCGGA
GATCCTCTGT CGATGGCGAC CGAAGTAGGC CCACTGATTT CGCGAAAAGA GCTGGAGCGC
GTTGGCCGAT GGGTCGACGA CGCGGCAAGC GGAGGGGCCA AGCTTATTTC AGGCGGGAAA
AGGATTTCCG AAAGCTGCTA TTCCAATACG GTTCTTCTCA ATCCCCGACC CGATTCCGAT
GTCATGCTCA AGGAGGTGTT TGGGCCTGTC GTCTGCGTGT ACGGCTACGA CGACATCGAT
TCAGCCATCT CGCTTTCCAA TAGCCTGCCG TATTCATTCC AGGCGGCGGT CTTCACGAGC
AGGCTCGACA CCGCGATGCA CTGCTACCGG CACCTCGACG GCACCGCGAT AATGGTCAAC
GAGAACACGC TCTTCCGCGT CGACTGGATG CCGTTTTCCG GTGCTCGGCA ATCCGGACAC
GGTGTCGGCG GGATGCCCTA CACCATGCAT GAAATGCAGA CGGAAAAGAT GATGGTCTGG
CGCTCCGACG CGCTCGCCTA G
 
Protein sequence
MADIIEIFSP FDTSRIGQVA AASPVEIERA LETAYALFRD RRQWLSKQKR IEILKRAAAI 
ITQRREELAR QAASEGGKPL RDSLIEVDRG VDGIHTCVEE LRTKAGQVVP MDLNATSAGR
VAFTQYEPIG VVVGVSAFNH PFNLVVHQLA PAVAVGAPVI LKPATTTPLS CRSLVEIFRE
AGLPEGWAQM VVPETNELAT RLVTDPRVGF FSFIGSAAVG WSLRSKIAPG TRCALEHGGV
APVIVSHDAD LDMAMPKVAR GAFWHAGQAC VGVQRVFCHN SVVDDVAERL GALGERMVIG
DPLSMATEVG PLISRKELER VGRWVDDAAS GGAKLISGGK RISESCYSNT VLLNPRPDSD
VMLKEVFGPV VCVYGYDDID SAISLSNSLP YSFQAAVFTS RLDTAMHCYR HLDGTAIMVN
ENTLFRVDWM PFSGARQSGH GVGGMPYTMH EMQTEKMMVW RSDALA