Gene Rleg2_4997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4997 
Symbol 
ID6978091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp642384 
End bp643808 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content63% 
IMG OID643394143 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002278961 
Protein GI209547043 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.201022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACAT ATCTGAAATT CTACATCGAC GGCGCATGGG TCGATCCTGT CGGCAGCGGG 
CGCCTTGCCG TCGTCGATCC CGCCACTGAG CAGCCTTTCG CCGAAATCGC CATGGGCGCG
GCCGAAGATG CCGAGCGTGC GATCCTGGCC GCGCGCCGGG CCTTCGTGGG TTTTTCGGTC
ACCACGTTCG AAGAACGCCT CGCGCTGCTC GAGCGCATCC TCGATCTCCT GAAGCTGCGC
AACGACGAGA TCGGCGACGT GATCTCGCGC GAAATGGGTG CGCCGCGTCA GATGGCGCGC
GACGAACAGG CCGGCATCGG GGTTGCACAT TTTTACGAGA CGATCCGAGC CATGCGGGAG
TTCCCGTTCG AATATATGCA GGGCACGACC CGCGTCATGC ATGAACCTGT CGGCGTCGTC
GGCATGATCA CGCCTTGGAA CTGGCCGATA AACCAGATCG CTTGCAAGGT AGCGCCGGCG
CTGGCTACCG GCTGCACCAT GGTGCTGAAG CCCTCGGAAA TTGCGCCAGT CAATGCGATC
CTGTTCGCCG AGATCCTGCA TGAGGCGGGC GTACCGAAAG GCGTGTTTAA TCTGGTGAAC
GGCGACGGCC CAACCGTGGG CGCCGTTCTG GCAGCTCATT CCGATGTCGA TATGATTTCC
TTTACCGGCT CGACCCGTGC CGGCATTTCG GTGGCGCGGG CGGCGGCCCC CACCGTCAAG
CGCGTCCATC AGGAGCTTGG CGGCAAGTCA CCCAACATTG TGCTCCGAAG CGCCGATCTC
AATGCAGCCG TCCGCGACGG CGTTGCCAAA TGCTTTGCCA ATTCCGGCCA GTCCTGCAAT
GCCCCGACCC GGCTTTTAGT ACCTGAAGAG AGCATGGATC GGGTTATCGA CGTCGCTCGC
GAGGCGGCCG AGGCGCTCCG TGTCGGGGCG CCGGGCGATG ATGCGACTGA TATTGGGCCG
GTCGCCAGCC GGATACAGTT CGACAAGATC CAGGGTCTGA TCAACAAAGG TGTCGAAGAG
GGTGCCGAAC TTGTGACCGG CGGACCGGGC CGGCCGGCGC ATCTCAATGC CGGCTATTAT
GTCCGGCCCA CGATCTTTGC TCGCGTGACC AACGACATGA CCATTGCCCG AGAGGAAATC
TTCGGGCCGG TCCTGTCGAT CCTGGGCTAT GGCAATGAGG TCGAGGCTGC GCAGATTGCA
AACGACACGC CCTATGGCCT GGCGTCCTAC ATCCAGGGCG AGCCCGCCGA GGCGCGCGCG
TTCGCACGGC GGCTGAGGAC AGGCATTGTC CGCCTCAACA ATTCGGCTTG GGATGGCGCA
GCACCCTTCG GCGGCTACAA GCAGTCCGGC AACGGCCGCG AATACGGCAA GTTCGGCCTG
CACGAATTTA CCGAAATCAA GGGCATCGTC GGCTTCGGCG ACTAA
 
Protein sequence
MDTYLKFYID GAWVDPVGSG RLAVVDPATE QPFAEIAMGA AEDAERAILA ARRAFVGFSV 
TTFEERLALL ERILDLLKLR NDEIGDVISR EMGAPRQMAR DEQAGIGVAH FYETIRAMRE
FPFEYMQGTT RVMHEPVGVV GMITPWNWPI NQIACKVAPA LATGCTMVLK PSEIAPVNAI
LFAEILHEAG VPKGVFNLVN GDGPTVGAVL AAHSDVDMIS FTGSTRAGIS VARAAAPTVK
RVHQELGGKS PNIVLRSADL NAAVRDGVAK CFANSGQSCN APTRLLVPEE SMDRVIDVAR
EAAEALRVGA PGDDATDIGP VASRIQFDKI QGLINKGVEE GAELVTGGPG RPAHLNAGYY
VRPTIFARVT NDMTIAREEI FGPVLSILGY GNEVEAAQIA NDTPYGLASY IQGEPAEARA
FARRLRTGIV RLNNSAWDGA APFGGYKQSG NGREYGKFGL HEFTEIKGIV GFGD