Gene Rleg_3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3171 
Symbol 
ID8014070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3173490 
End bp3174923 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content63% 
IMG OID644825737 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002976965 
Protein GI241205869 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.427314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.922652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTT ATCAAAACCT GATCGCCGGC GAATGGGTCG GCTCGAACGC GACGAAGAAT 
ATCAACCCAT CGGATACGAA TGAGGTCGTC GGCCTCTATG CCGATGGCAG CGCAGATGAC
ACGAGAAACG CCATTGCTGC CGCCAAGGCT GCCTTCCCAG CCTGGTCGCG CTCCGGCATC
TGGGAGCGCC ACGTCATCCT GAAGAAGACA GGCGACGAGA TCATGGCGCG CAAGGATGAG
CTCGGTGCGC TGCTTGCCCG CGAAGAAGGC AAGACGCTGC CGGAAGCCAC CGGCGAGGTG
ATCCGCGCGT CGCAGATTTT CGAATTCTTC GCCGGCGAGG CACTGCGGCT GGCCGGTGAG
GTCCTTCCGT CGGTTCGCCC GAATATCGGT GTCGAGATCA CCCGCGAGGC GCTCGGCGTC
ATCGGCATCA TTACGCCGTG GAACTTCCCG ATCGCCATTC CCGCCTGGAA GATCGCGCCG
GCGCTCTGCT ACGGCAACAC TATCGTCTTC AAGCCGGCCG AACTGGTGCC CGCCTGTTCC
TGGGCGATCG TCGATATCCT GCACCGCGCC GGCCTGCCGA AAGGCGTGCT GAACCTCGTC
ATGGGCAAGG GCTCGGTCGT CGGCCAGGCC ATGCTCGAAA GCCCCGACGT TCACGGCATC
ACCTTCACCG GTTCCACCGG CACCGGCAGA CGCGTCGCCG CCGCCTCCAT CGAGCATAAC
CGCAAGTTCC AGCTGGAAAT GGGCGGCAAG AACCCGATGG TCGTGCTCGA CGATGCCGAT
CTCAACGTCG CCGTCGAGGC GGCCGCCAAT TCCGGCTTCT TCTCGACCGG CCAGCGTTGC
ACCGCTTCCT CGCGGCTGAT CGTCACCGAA GGCATTCACG ACAAGTTCGT CGCAGCGCTG
ACCGATAAGC TGAAGACGCT GGTCGTCGAC AACGCCCTGA AGGCCGGCAC TCATATCGGC
CCCGTCGTCG ATGAGCGGCA GTTGAAGACC GATACCGACT ATATCGAGAT CGGCAAGTCG
GAAGGCGCCA AACTCGCCTT TGGCGGCGAG GTGATCTCCC GCGAAACGCC CGGCTTCTAT
CTGCAGCCGA CGCTGTTCAC CGAAGCGACC AACCAGATGC GGATCTCGCG CGAGGAGATC
TTCGGGCCCG TGGTGTCGGT AATCCGGGCG AAGGATTACG ACGAGGCGCT GGCGACGGCC
AATGACACGC CGTTCGGCCT TTCGGCCGGC ATCGCCACGA CTAGCCTGAA ACATGCCACG
CATTTCAAGC GCAATTCCGA GGCCGGCATG GTGATGGTCA ACCTGCCGAC GGCAGGCGTC
GATTTCCACG TGCCGTTCGG CGGCCGCAAG GGCTCGTCTT ACGGCCCGCG CGAGCAGGGC
AAGTACGCCG CCGAATTCTA CACAACCGTC AAGACCGCCT ACACCCTGGC TTGA
 
Protein sequence
MTIYQNLIAG EWVGSNATKN INPSDTNEVV GLYADGSADD TRNAIAAAKA AFPAWSRSGI 
WERHVILKKT GDEIMARKDE LGALLAREEG KTLPEATGEV IRASQIFEFF AGEALRLAGE
VLPSVRPNIG VEITREALGV IGIITPWNFP IAIPAWKIAP ALCYGNTIVF KPAELVPACS
WAIVDILHRA GLPKGVLNLV MGKGSVVGQA MLESPDVHGI TFTGSTGTGR RVAAASIEHN
RKFQLEMGGK NPMVVLDDAD LNVAVEAAAN SGFFSTGQRC TASSRLIVTE GIHDKFVAAL
TDKLKTLVVD NALKAGTHIG PVVDERQLKT DTDYIEIGKS EGAKLAFGGE VISRETPGFY
LQPTLFTEAT NQMRISREEI FGPVVSVIRA KDYDEALATA NDTPFGLSAG IATTSLKHAT
HFKRNSEAGM VMVNLPTAGV DFHVPFGGRK GSSYGPREQG KYAAEFYTTV KTAYTLA