Gene Rleg2_4998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4998 
Symbol 
ID6978092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp643862 
End bp645331 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content63% 
IMG OID643394144 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_002278962 
Protein GI209547044 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAAC TCAAAGACCC CACGTTATTC ATCCAACAGG CGCTGATCGG CGGTGACTGG 
GCCGATGCCG CCGAAGGTGC GACGCTGACC GTCGACAACC CTGCGACCGG TGCCACGATC
GGCACGGTGC CGAACTGCAG CGCTGAGGAT ACGGAGCGGG CGATTGCGGC CGCTGCGGAT
GCGTTCAGGA CATGGAGGCT GACGACTGCT TCCGAGCGGG CGCGGCTGCT GCAGCGCTGG
CATGACCTCA TGATCGAGAA TGCTGACGAT CTCGCTCTGA TCATGACCTT GGAACAAGGC
AAGCCGCTTT CCGAGGCGCG AGGCGAAGTC CTTTATGGCG CGACCTTTAT CAAGTGGTTT
GCCGAGGAGG CGCGCCGCAG CTACGGCATG ACAATCCCCG CTCCCACGAC CGATCGCCGT
ATCCTGGTCA GCAAGGAGCC GGTCGGGGTT GCCGCGGTCA TCACACCGTG GAATTTTCCG
AATGCGATGA TCACGCGAAA ATGCGCGCCG GCGCTTGCGG CAGGCTGCAC CGTCGTGGTC
AAGCCATCGG AACTGACGCC CTATTCAGCA TTGGCGCTGG GCCTGCTGGC CGAGCGCGCC
GGCATCCCGG CCGGTGTCAT CAACATCGTC ACCGGCCTGC CCAAGACGAT TGGTGCAACG
CTGACGGCAA GCCCCGATGT CCGCAAGGTA TCCTTCACCG GCTCGACGGC CGTGGGTTCG
CTGCTGATGG CACAATGCGC ACCGACCGTG AAGCGGCTCA GCCTCGAACT CGGCGGCAAT
GCGCCCTTCA TCGTCTTCGA CGATGCGGAT CTCGATGAGG CGGTCGAGGC GGCTCTCGTC
TCGAAATTCC GCAATGGTGG CCAGACCTGT GTCTGCGCCA ACAGGCTGCT CGTTCAGGCC
GGCATCTATG ATACTTTTGC CGGAAAGCTC GCGGCGCGCG TCAGCGCCAT GCGGATCGGC
TCGGGCGTGG ATGATGGCAT CCAGATCGGT CCGATGATCA ATGTCGCGGC CGTCGGCAAG
ATCAGCGCTC ACATCGAGGA TGCGCTAGAG CTGGGCGCCA GAAATATCAC ACCGCGAAAC
GACCTGCCGG CAGGTGAGCG TTTCGTTGCG CCGACCGTGC TCACAGGTGC GACCACCGCC
ATGCGGCTTG CAAGCGAAGA GACCTTCGGT CCGGTTGCGC CGCTGTTCCG CTTTGAGACC
GAGGAGGAGG CGATCGCGAT CGCCAATGCA ACGCCCTATG GTCTGGCCGC CTATTTCTTC
ACCGAGAACT TGCACCGCGC CTGGCGCGTT GGCGAGGCGT TGGAATTCGG AATGGTCGGT
CTCAATACGG GCAGCGTATC GATGGAAGTG GCGCCTTTTG GCGGTGTCAA GCAGTCCGGC
CTCGGCCGCG AGGGAGGTCC GACCGGAATG GAGGAATACC TGGAGGTAAA GGCCTTCCAT
CTCGGTGGCC TGAAGGCGCA GCGGGTCTGA
 
Protein sequence
MLELKDPTLF IQQALIGGDW ADAAEGATLT VDNPATGATI GTVPNCSAED TERAIAAAAD 
AFRTWRLTTA SERARLLQRW HDLMIENADD LALIMTLEQG KPLSEARGEV LYGATFIKWF
AEEARRSYGM TIPAPTTDRR ILVSKEPVGV AAVITPWNFP NAMITRKCAP ALAAGCTVVV
KPSELTPYSA LALGLLAERA GIPAGVINIV TGLPKTIGAT LTASPDVRKV SFTGSTAVGS
LLMAQCAPTV KRLSLELGGN APFIVFDDAD LDEAVEAALV SKFRNGGQTC VCANRLLVQA
GIYDTFAGKL AARVSAMRIG SGVDDGIQIG PMINVAAVGK ISAHIEDALE LGARNITPRN
DLPAGERFVA PTVLTGATTA MRLASEETFG PVAPLFRFET EEEAIAIANA TPYGLAAYFF
TENLHRAWRV GEALEFGMVG LNTGSVSMEV APFGGVKQSG LGREGGPTGM EEYLEVKAFH
LGGLKAQRV