Gene Rleg2_4441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4441 
Symbol 
ID6977535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp73791 
End bp75308 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content65% 
IMG OID643393619 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_002278437 
Protein GI209546519 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.411822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAC TGCAGGAAAA CATCGCGAAG GCGGAAACCT ACCTTGCCGG CTTCAAGGAG 
CGCGGCGTCC TCAACCGCAT CGGCGGCGAG GATGTCGGAG GGGGTGATGG CGCGACCTTC
GAGACCCTCT CGCCGGTCGA TCTGAAGCCG CTCGCCACCG TCGCCCGCGG CAATGCCGCC
GATATCGACC GTGCGGCCAA GGTTGCCAAA TCAGCCTTTG GCGAATGGGC CGCCATGCCG
GGCGATGCAC GCAAGAAGCT GCTGCACAGA ATAGCCGACG CGATCGTCGC GCGCGCCGAA
GAGATCGCCT TTGTCGAATG CATGGACACC GGACAGTCGC TGAAGTTCAT GGCGAAGGCG
GCGCTGCGCG GTGCGGAAAA CTTCCGCTTC TTCGCCGATC GCGCGCCGGA GGCCCGGGAC
GGCAAGGCGC TGCGCGCCGA CGGCCAGGTG AACCTGACGA CGCGTGTGCC GATCGGCCCG
GTCGGCATCA TCACGCCGTG GAACACACCC TTCATGCTGT CGACCTGGAA GATCGCGCCC
GCCCTTGCCG CCGGCTGCAC CATCGTCCAC AAGCCGGCCG AGTTCTCGCC GCTGACGGCG
CGACTGCTGG TCGAGATCGC CGAAGAGGCC GGTCTGCCCA AGGGTGTATG GAACCTCGTC
AACGGCTTCG GCGAGGATGC CGGCAAGGCG CTGACCGAAC ATCCGCTGAT CAAGGCGATC
GGCTTCGTCG GCGAGAGCCG CACCGGCTCG ATGATCATGA AACAGGGCGC CGACACGCTG
AAGCGGGTGC ATTTTGAGCT CGGCGGCAAG AACCCGGTCA TCGTCTTTGC CGATGCCGAT
CTTGAGCGTG CCGCCGACGC CGCCGTCTTC ATGATCTATT CGCTGAACGG CGAGCGCTGC
ACCTCGTCCT CGCGCCTGCT GGTCGAAGAC AGCGTCTATG ACAGGTTCAC CGCACTCGTC
GCCGAAAAGG CCAGACGCAT CAAGGTCGGC CACCCCCTCG ATCCCGAGAC GGTCATCGGC
CCGCTCATCC ATCCCGTGCA CGAAAAGAAG GTGCTGGAAT ATATCGCGAT CGGCCGCTCC
GAGGGCGCGA CGCTTGCTGC CGGCGGCGAG AAGTTCGATG GCCCGGGCGG CGGCTGCTAT
GTCTCCCCCA CCCTCTTTAC CGGCGCCGAC AATAAGATGC GCATCGCCCA GGAAGAGATC
TTCGGGCCGG TGCTGACGGC CATCCCCTTC AAGGACGAAG CCGATGCGCT GGCGCTGGCC
AACGACGTCC AGTACGGGCT CACCGGTTAT CTCTGGACCT CCGACGTCAC CCGCGCCTTC
CGTTTCACCG ACCATCTCGA TGCCGGGATG ATCTGGGTGA ACTCGGAAAA CGTCCGCCAC
CTGCCGACGC CTTTCGGCGG CGTCAAGAAC TCCGGCATCG GCCGCGACGG CGGCGACTGG
TCCTTCGATT TCTACATGGA AACCAAGAAC GTCGCCTTCG CCACCAAGCC ACACGCCATC
CAGAAACTCG GCGGCTGA
 
Protein sequence
MSKLQENIAK AETYLAGFKE RGVLNRIGGE DVGGGDGATF ETLSPVDLKP LATVARGNAA 
DIDRAAKVAK SAFGEWAAMP GDARKKLLHR IADAIVARAE EIAFVECMDT GQSLKFMAKA
ALRGAENFRF FADRAPEARD GKALRADGQV NLTTRVPIGP VGIITPWNTP FMLSTWKIAP
ALAAGCTIVH KPAEFSPLTA RLLVEIAEEA GLPKGVWNLV NGFGEDAGKA LTEHPLIKAI
GFVGESRTGS MIMKQGADTL KRVHFELGGK NPVIVFADAD LERAADAAVF MIYSLNGERC
TSSSRLLVED SVYDRFTALV AEKARRIKVG HPLDPETVIG PLIHPVHEKK VLEYIAIGRS
EGATLAAGGE KFDGPGGGCY VSPTLFTGAD NKMRIAQEEI FGPVLTAIPF KDEADALALA
NDVQYGLTGY LWTSDVTRAF RFTDHLDAGM IWVNSENVRH LPTPFGGVKN SGIGRDGGDW
SFDFYMETKN VAFATKPHAI QKLGG