Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4441 |
Symbol | |
ID | 6977535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 73791 |
End bp | 75308 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643393619 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_002278437 |
Protein GI | 209546519 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.411822 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAAC TGCAGGAAAA CATCGCGAAG GCGGAAACCT ACCTTGCCGG CTTCAAGGAG CGCGGCGTCC TCAACCGCAT CGGCGGCGAG GATGTCGGAG GGGGTGATGG CGCGACCTTC GAGACCCTCT CGCCGGTCGA TCTGAAGCCG CTCGCCACCG TCGCCCGCGG CAATGCCGCC GATATCGACC GTGCGGCCAA GGTTGCCAAA TCAGCCTTTG GCGAATGGGC CGCCATGCCG GGCGATGCAC GCAAGAAGCT GCTGCACAGA ATAGCCGACG CGATCGTCGC GCGCGCCGAA GAGATCGCCT TTGTCGAATG CATGGACACC GGACAGTCGC TGAAGTTCAT GGCGAAGGCG GCGCTGCGCG GTGCGGAAAA CTTCCGCTTC TTCGCCGATC GCGCGCCGGA GGCCCGGGAC GGCAAGGCGC TGCGCGCCGA CGGCCAGGTG AACCTGACGA CGCGTGTGCC GATCGGCCCG GTCGGCATCA TCACGCCGTG GAACACACCC TTCATGCTGT CGACCTGGAA GATCGCGCCC GCCCTTGCCG CCGGCTGCAC CATCGTCCAC AAGCCGGCCG AGTTCTCGCC GCTGACGGCG CGACTGCTGG TCGAGATCGC CGAAGAGGCC GGTCTGCCCA AGGGTGTATG GAACCTCGTC AACGGCTTCG GCGAGGATGC CGGCAAGGCG CTGACCGAAC ATCCGCTGAT CAAGGCGATC GGCTTCGTCG GCGAGAGCCG CACCGGCTCG ATGATCATGA AACAGGGCGC CGACACGCTG AAGCGGGTGC ATTTTGAGCT CGGCGGCAAG AACCCGGTCA TCGTCTTTGC CGATGCCGAT CTTGAGCGTG CCGCCGACGC CGCCGTCTTC ATGATCTATT CGCTGAACGG CGAGCGCTGC ACCTCGTCCT CGCGCCTGCT GGTCGAAGAC AGCGTCTATG ACAGGTTCAC CGCACTCGTC GCCGAAAAGG CCAGACGCAT CAAGGTCGGC CACCCCCTCG ATCCCGAGAC GGTCATCGGC CCGCTCATCC ATCCCGTGCA CGAAAAGAAG GTGCTGGAAT ATATCGCGAT CGGCCGCTCC GAGGGCGCGA CGCTTGCTGC CGGCGGCGAG AAGTTCGATG GCCCGGGCGG CGGCTGCTAT GTCTCCCCCA CCCTCTTTAC CGGCGCCGAC AATAAGATGC GCATCGCCCA GGAAGAGATC TTCGGGCCGG TGCTGACGGC CATCCCCTTC AAGGACGAAG CCGATGCGCT GGCGCTGGCC AACGACGTCC AGTACGGGCT CACCGGTTAT CTCTGGACCT CCGACGTCAC CCGCGCCTTC CGTTTCACCG ACCATCTCGA TGCCGGGATG ATCTGGGTGA ACTCGGAAAA CGTCCGCCAC CTGCCGACGC CTTTCGGCGG CGTCAAGAAC TCCGGCATCG GCCGCGACGG CGGCGACTGG TCCTTCGATT TCTACATGGA AACCAAGAAC GTCGCCTTCG CCACCAAGCC ACACGCCATC CAGAAACTCG GCGGCTGA
|
Protein sequence | MSKLQENIAK AETYLAGFKE RGVLNRIGGE DVGGGDGATF ETLSPVDLKP LATVARGNAA DIDRAAKVAK SAFGEWAAMP GDARKKLLHR IADAIVARAE EIAFVECMDT GQSLKFMAKA ALRGAENFRF FADRAPEARD GKALRADGQV NLTTRVPIGP VGIITPWNTP FMLSTWKIAP ALAAGCTIVH KPAEFSPLTA RLLVEIAEEA GLPKGVWNLV NGFGEDAGKA LTEHPLIKAI GFVGESRTGS MIMKQGADTL KRVHFELGGK NPVIVFADAD LERAADAAVF MIYSLNGERC TSSSRLLVED SVYDRFTALV AEKARRIKVG HPLDPETVIG PLIHPVHEKK VLEYIAIGRS EGATLAAGGE KFDGPGGGCY VSPTLFTGAD NKMRIAQEEI FGPVLTAIPF KDEADALALA NDVQYGLTGY LWTSDVTRAF RFTDHLDAGM IWVNSENVRH LPTPFGGVKN SGIGRDGGDW SFDFYMETKN VAFATKPHAI QKLGG
|
| |