Gene Rleg2_5062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5062 
Symbol 
ID6978156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp709450 
End bp710871 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content60% 
IMG OID643394200 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002279018 
Protein GI209547100 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.338258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAGA ACGCGGTCCG CTCAACTCCG CTTCAATCAG ATGCCATCGT TGTCTGCGAC 
CCTTTCAACG GCGCGGCACT TGGATCGGTA ACCGAAACGA GCGCGAACGC CGTTCCGGAA
CTGCTTGATC GCGCTAAAAG CGGCGCCCGC ATCGCGCGCG CCCTGCCCAG GCACCAGCGC
TCGGCCATCC TCGAGAAGGC CGCATCACTG ATTGCCGCCG ATAAAGAAGA CTTCGCGACC
CTTATCGTCC GTGAAGCAGG CAAGACAATC ACACAGGCCC GCAAGGAAGT GACCCGCTGC
GTCAACACGC TGAAGCTTTC CGCGGACGAG GCGAAGCGCA ACGCGGGCGA AGTCATTCCA
TTTGACAGCT ACGCGGGTTC GGAGTCCCGT CAGGGCTGGT ATACCCGCGA ACCTCTTGGG
ATCATTGCGG CCATCACGCC CTACAATGAC CCTCTGAACC TCGTCGCCCA CAAGCTGGGC
CCCGCGATCG CGGGTGGAAA TGCGATTCTT CTCAAGCCAT CGGAACTCAC TCCGCTTTCC
GCCGTCAGGC TGGTGGGCAC CCTCGTGACG GCGGGCCTTC CGGAAGAAGT GATAACCGTA
GCTATCGGCG GCGCGGAGCT TGGTAAAGCC ATTGTCTCGG CGAAGGATAT CCGCATGATC
TCCTTCACTG GCGGCTTTGC AACCGGCGAA GCGATCTCGA AAACGGCAGG ACTAAAGAAG
TTTTCCATGG ACCTCGGCGG CAACGCCCCT GTTATCGTCA TGGAAGACAG CAATTTCGAC
GCGGCGGTTG CGGGTTGCAT ATCGGGTGCT TACTGGGCAG CTGGTCAGAA TTGCATCGGC
ACGCAGCGCA TCCTTGTTCA GCGCCGGATT TATAAACGCT TCCGGGACGA GTTCGTCGCG
CGGACGCTCA AGCTCAAGAC GGGCAACCCG TTGGAGAGTG ACACCGATGT CGGCCCCATG
ATTACCGAAA AGGCCGTCGA CCGCGCTGCC GCCATGGTCG AGCGTGCTAT TCAGGGAGGA
GCAACACTTC TCTGCGGTCA CCAGCCGTCC GGCAACTTGT ATCCGCCGAC CGTTCTCGAA
AACGTTCCGG CGACCTGCGA TGCATGGAGT GAGGAAGTGT TCGCACCAAT CGTCATCCTT
GAGCCGTTTG ACAGCATCGC AGAAGCCATA GACCTCGCCA ATAGCCCGGA ATATAGCCTG
CACGCGGGTA TCTTCACGAA CGATCTAGAA GATGCACTCG ACGCGGCCGA TCGCATCGAT
GCGGGCGGTG TCATGATCAA CGACTCCTCC GACTATCGCT TCGACGCGAT GCCCTTCGGT
GGCTTCAAGT ACGGCAGCAT GGGTCGCGAA GGCGTCCGCT TCGCCTACGA GGATATGACG
CAGCCAAAGG TCGTCTGCAT CAACAGGCTG AAGCGCTCGT GA
 
Protein sequence
MMQNAVRSTP LQSDAIVVCD PFNGAALGSV TETSANAVPE LLDRAKSGAR IARALPRHQR 
SAILEKAASL IAADKEDFAT LIVREAGKTI TQARKEVTRC VNTLKLSADE AKRNAGEVIP
FDSYAGSESR QGWYTREPLG IIAAITPYND PLNLVAHKLG PAIAGGNAIL LKPSELTPLS
AVRLVGTLVT AGLPEEVITV AIGGAELGKA IVSAKDIRMI SFTGGFATGE AISKTAGLKK
FSMDLGGNAP VIVMEDSNFD AAVAGCISGA YWAAGQNCIG TQRILVQRRI YKRFRDEFVA
RTLKLKTGNP LESDTDVGPM ITEKAVDRAA AMVERAIQGG ATLLCGHQPS GNLYPPTVLE
NVPATCDAWS EEVFAPIVIL EPFDSIAEAI DLANSPEYSL HAGIFTNDLE DALDAADRID
AGGVMINDSS DYRFDAMPFG GFKYGSMGRE GVRFAYEDMT QPKVVCINRL KRS