Gene Rleg2_5500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5500 
Symbol 
ID6978594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1148832 
End bp1150328 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content66% 
IMG OID643394599 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002279417 
Protein GI209547499 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00406533 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCCG TTTTTGCCCG CCCCGCCTAT CATGATGCGC TGTCGCAGCT TGCCGACCGT 
CATCTCCTGC GCGATCTCGC CTATGTCGGC GGCCGCTGGA TCGCCGGCAA AACCGGTGAA
AGTTTCGAGG TTACCGATCC GGCCTCCTCC GCGACGCTGG CTTGGGTTGC AAGCCTTGGC
ACGGACGAGA CAGAGGCGGC GATCGATGCA GCGTCAGAGG CTTTTGCCGC CTGGCGCGCC
ATGCTGCCGC AGAACCGGGC GGCGATCCTG CGGAAGTGGC ACGAGTTGAT GCTCGCGGCC
AAGGAGGATC TGGCGCTGCT CATGACGCTC GAACAAGGTA AGCCGCTTTC GGAATCGCGT
GGCGAAATCG ACTATGCCGC CTCTTTCCTC GAATGGTATG CCGAGGAAGG CAAGCGGCTG
AATGCCGAAA GCGTCACCAG CCACCTGCCC GGGGCGGAAA TGATCGTCCG GCGCGAGGCG
CTCGGCGTCG TCGGTATCGT CACGCCCTGG AATTTTCCCT CGGCCATGGT CACGCGCAAG
GCTGCCGCCG CCCTTGCCGC CGGCTGCACG GTCGTCGCTC ACCCGTCCTC CGAGACGCCG
CTTTCCGCAC TTGCCCTTGC AGAACTCGGC GAACGGGCCG GCATTCCCCC AGGTGTCTTT
AATGTCGTGA CCGGCCATGC AGCAACGATC GTCGGACGAA TGTGCGCCGA CGCCCGCCTG
CGTGCGATCA GCTTCACCGG CTCCACCGAA GTCGGCCGCC TGATTGCCGC TCAATGCGCC
CCGACCATGA AGCGGCTGGT GATGGAACTC GGCGGCCACG CGCCGCTGAT CGTCTTCGAT
GATGCCGATA TCGCCAAGGC GGTCGAGATC GCCGTCGATG CCAAGTTTGC CACGTCGGGC
CAGGATTGCC TTGCCGCCAA CCGGATCTTC GTCCAGTGCG GCATCGCCGA TGCCTTCGCC
AAGGCTTTTG CCGCCCGCAT CGGGGAGCTC AAGGTCGGCG CCGGGCTCGA GGACGGCGCC
GAGATCGGAC CGCTGATGCA TGAGCGCGCC GTCGTCAAGG TCGAGGAGCA GGTCGCCGAC
GCGCTGGCGC AGGGCGCGCG CCTCGTCACC GGCGGCAAGC GCCATAAGGC CGGCCGGCTC
TTCTACGAGC CGACGCTTCT GACCGATGTG CCGGCGAGTG CGCTGATCAT GCGCGAGGAG
ACCTTCGGGC CTGTGGCGGC GCTGACCACC TTCGACACCG AAGAAGAGGT CGTCGCCCGC
GCCAATGATA CCGAATACGG CCTGGTCGCC TATGTCGTCA CCGAAAATGG CGCCCGCCAG
ATGCGCCTCG GCCGCGCGCT GGAATACGGC ATGGTGGCGG TCAACCGCGT GAAGATCACC
GGCGGCCCTA TTCCCTTCGG CGGCTGGAAG CAGTCCGGCC TCGGCCGCGA GGGCTCACGC
CACGGGCTCG AAGCCTTCAC CGAGCTCAAA TATCTCTGCA TCGACACCGC CGCTTAA
 
Protein sequence
MTAVFARPAY HDALSQLADR HLLRDLAYVG GRWIAGKTGE SFEVTDPASS ATLAWVASLG 
TDETEAAIDA ASEAFAAWRA MLPQNRAAIL RKWHELMLAA KEDLALLMTL EQGKPLSESR
GEIDYAASFL EWYAEEGKRL NAESVTSHLP GAEMIVRREA LGVVGIVTPW NFPSAMVTRK
AAAALAAGCT VVAHPSSETP LSALALAELG ERAGIPPGVF NVVTGHAATI VGRMCADARL
RAISFTGSTE VGRLIAAQCA PTMKRLVMEL GGHAPLIVFD DADIAKAVEI AVDAKFATSG
QDCLAANRIF VQCGIADAFA KAFAARIGEL KVGAGLEDGA EIGPLMHERA VVKVEEQVAD
ALAQGARLVT GGKRHKAGRL FYEPTLLTDV PASALIMREE TFGPVAALTT FDTEEEVVAR
ANDTEYGLVA YVVTENGARQ MRLGRALEYG MVAVNRVKIT GGPIPFGGWK QSGLGREGSR
HGLEAFTELK YLCIDTAA