Gene Rleg2_5410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5410 
Symbol 
ID6978504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1053807 
End bp1055324 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content66% 
IMG OID643394512 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002279330 
Protein GI209547412 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.263721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00137707 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCTGGA CCCCTTCAGG CAGGCATCTT ATCGCCGGCG AGTGGATTGC CGGAACGACG 
ACCTTTCGCT CCGAGCCGGC GCACGGCCCG GCTCATGATT TCGCCGTCGG CACCACGGAA
CTGGTCGACC GCGCCTGCCG CGCAGCCGAG GCCGCTTTTG CGGCGTTTTC CGCAAAGACA
TGCGAGGAGC GCGCCATTTT CCTCGAGATC ATCGCCGAGG AAATCGACAG ACGTGGCGAG
GCCGTCACCC TGATCGGAAC CGAGGAAACC GGGCTGCCGG AAGGCCGGCT CAATGGCGAG
CGCGCCCGCA CCACCGGTCA GCTCAAGCTG TTTGCCGAGC ATATCCGCAA GGGCGCGCAT
CTTGACGCTC GCATCGATGC GGCGCAACCC GATCGGCAAC CGGCGCCGCG GCCGGAGATC
CGTCTGGTGC AGCGGCCGAT CGGCCCGGTC GCCGTCTTCG GCGCCTCGAA TTTTCCGCTG
GCATTTTCGA CGGCCGGTGG TGATACGGCC GCTGCGCTTG CCGCCGGTTG CCCTGTCGTG
GTGAAAGGAC ATTCAGCCCA TCCCGGCACC GGTGAGATCA TTGCCGAGGC GATCGCAGCG
GCCGTCGAAC GCACCCAAAT GCCGGCCGGC GTCTTCAGCC TGATCCAGGG CGGGCGCCGC
GATGTCGGCA CCGGCCTGGT GACGCATCCC GCCATCAAGG CGGTCGGCTT TACCGGATCG
CTTGCCGGCG GTCGTGCGCT GTTCGACCTT TGCGCCCAGC GCCCCGAGCC GATCCCGTTT
TTCGGTGAAC TCGGCAGCGT CAATCCCATG TTCCTGCTGC CGGCGGCGAC CGCCGCCCGG
GCGGAGGCGA TCGGTTCAGG CTGGGCTGGC TCACTGACGC TTGGCGCCGG CCAGTTCTGC
ACCAAGCCCG GTATCGCCGT CGTGGTCGAT GGGCCGGAGG CGGACAGGTT TACCGGCGCT
GCCAAATCGG CTCTCGAAAA GGTGGCGCCA CAGACGATGT TGACCCAAGG CATCGCCGCC
GCCTATCACG ACGGTGTCGA GCGCATGCGG GCGAGCAATG CCGTCGCGCC GGTTCTTTCC
GTTGAGAGTG CTGGCCGGGA CGCCGCCCCG AACCTGTTCG AGACCAATGG CTCGGCCTGG
CTTGCCGATC ATTCGCTCAG CGAAGAGGTC TTCGGCTCTC TCGGTCTCGT CGTGCGGGTT
GGCTCGCCGG AAGAGATGCT CACCCTTGCC GAAAGCTTCC AGGGACAGCT GACTGCGACG
ATCCATATGG ATGACTCAGA TCTTGGCCTT GTCCGCGACC TGCTGCCGAT CCTCGAAAGG
AGGGCGGGCA GATTGCTGGT CAATGGCTTC CCAACCGGCG TCGAGGTTGT CGATTCCATG
GTGCATGGTG GACCTTATCC GGCCTCGACC AATTTCGGTG CGACCAGCGT CGGGACCATG
TCGATCCGCA GGTTTCTGCG CCCCGTCGCT TACCAGAATT TCCCCGCCGG CCTGTTGCCC
CAGGATCTGC GCAACTGA
 
Protein sequence
MSWTPSGRHL IAGEWIAGTT TFRSEPAHGP AHDFAVGTTE LVDRACRAAE AAFAAFSAKT 
CEERAIFLEI IAEEIDRRGE AVTLIGTEET GLPEGRLNGE RARTTGQLKL FAEHIRKGAH
LDARIDAAQP DRQPAPRPEI RLVQRPIGPV AVFGASNFPL AFSTAGGDTA AALAAGCPVV
VKGHSAHPGT GEIIAEAIAA AVERTQMPAG VFSLIQGGRR DVGTGLVTHP AIKAVGFTGS
LAGGRALFDL CAQRPEPIPF FGELGSVNPM FLLPAATAAR AEAIGSGWAG SLTLGAGQFC
TKPGIAVVVD GPEADRFTGA AKSALEKVAP QTMLTQGIAA AYHDGVERMR ASNAVAPVLS
VESAGRDAAP NLFETNGSAW LADHSLSEEV FGSLGLVVRV GSPEEMLTLA ESFQGQLTAT
IHMDDSDLGL VRDLLPILER RAGRLLVNGF PTGVEVVDSM VHGGPYPAST NFGATSVGTM
SIRRFLRPVA YQNFPAGLLP QDLRN