Gene Rleg_5232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5232 
Symbol 
ID8007406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp643434 
End bp644930 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content65% 
IMG OID644822140 
ProductSuccinate-semialdehyde dehydrogenase 
Protein accessionYP_002973400 
Protein GI241113565 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.88595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.882474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG TTTTTGCCCG CCCCGCCTAT CATGACGCGC TGTCGCGGCT CGCCGACCGT 
CATCTCCTGC GCGATCTGGC CTATGTCGGC GGCCGGTGGA TCGCCGGCAA ATCAGGGAAA
AGTTTCGAGG TCACCGATCC CGCCTCTTCG GCGACGCTGG CCTGGGTTGC TAGCCTTGAC
GCCGATGAGA CGGCAGTGGC GATCGATGCT GCGTCGGAGG CTTTTGCCGG CTGGCGCGCA
ATGCTGCCGC AGAGCCGCGC GGCGATCCTG CGCAAATGGT TTGAGCTGAT GCTTGCGGCC
AAGGAGGATC TGGCGCTGAT CATGACGCTC GAACAGGGCA AGCCGCTTGC GGAATCGCGC
GGCGAGATCG ATTACGCCGC CTCCTTCGTC GAATGGTATG CCGAGGAAGG CAAACGGCTG
AACGCCGAAA GCGTCACCAG CCATCTGCCC GGCGCGGAAA TGATCGTCCG GCGTGAGGCG
CTCGGCATCG TCGGCATCGT CACGCCCTGG AATTTCCCCT CTGCCATGCT CACCCGGAAG
GCTGCCGCGG CGCTGGCCGC CGGTTGCACG GTCGTCGCCC ACCCCTCCTC AGAAACGCCG
CTTTCGGCAC TGGCGCTTGC CGAGCTCGGC GAGCGGGCAG GCATTCCCAC CGGCGTCTTC
AACGTGGTCA CCGGCAACGC CGCAACGATC GTCGGACGGA TGTGTGCCGA TGTCCGCGTG
CGCGCCATGA GCTTCACCGG CTCCACCGGA ATCGGACGGC TGATCGCCGC CCAATGCGCC
CCGACCCTGA AGCGGCTGGT GATGGAACTC GGCGGCCACG CCCCGCTGAT CATCTTCGAT
GACGCTGATA TCGAAAAGGC GGTCGAGATC GCCGTCAACG CCAAATTTGC CACATCAGGC
CAGGATTGCC TCGCCGCCAA TCGCATTTTC GTCCAGCGGG GGATCGCCGA TGGCTTCGCC
AAGGCCTTCG CAGACCGCAT TGCCGAACTG AAAGTTGGTC CGGGTCTTGA GGATGGCGCC
GAGATCGGGC CGCTCATGCA TGAACGCGCC GTCGCCAAGG TCGAAGAACA GGTCGCCGAC
GCGCTGGCGC GCGGCGCGCG GCTCGTTACC GGCGGCAAGC GCCATAAGGC CGGCCGGCTT
TTTTATGAGC CGACGCTGCT GAGCGACGTG CCGGCGGATG CGCTGATCAT GCACGAGGAG
ACCTTCGGCC CTGTAGCGGC CATCACCGCC TTCGATACGG AAGACGAGGT CATCACTCGC
GCCAACGATA CCGAATACGG CCTTGTCGCC TATATCGTCA CGGAAAACGG CGCCCGGCAG
ATGCGCCTCG GCCGCGCGCT CGAATACGGC ATGGTCGCCG TCAACCGCGT GAAAATCACC
GGCGCTCCCA TTCCCTTCGG CGGCTGGAAG CAGTCCGGCC TCGGCCGCGA GGGTTCACGC
CATGGGCTCG AGGCCTTCAC CGAGCTCAAA TATCTCTGCA TCGACACGGC CGCCTGA
 
Protein sequence
MTAVFARPAY HDALSRLADR HLLRDLAYVG GRWIAGKSGK SFEVTDPASS ATLAWVASLD 
ADETAVAIDA ASEAFAGWRA MLPQSRAAIL RKWFELMLAA KEDLALIMTL EQGKPLAESR
GEIDYAASFV EWYAEEGKRL NAESVTSHLP GAEMIVRREA LGIVGIVTPW NFPSAMLTRK
AAAALAAGCT VVAHPSSETP LSALALAELG ERAGIPTGVF NVVTGNAATI VGRMCADVRV
RAMSFTGSTG IGRLIAAQCA PTLKRLVMEL GGHAPLIIFD DADIEKAVEI AVNAKFATSG
QDCLAANRIF VQRGIADGFA KAFADRIAEL KVGPGLEDGA EIGPLMHERA VAKVEEQVAD
ALARGARLVT GGKRHKAGRL FYEPTLLSDV PADALIMHEE TFGPVAAITA FDTEDEVITR
ANDTEYGLVA YIVTENGARQ MRLGRALEYG MVAVNRVKIT GAPIPFGGWK QSGLGREGSR
HGLEAFTELK YLCIDTAA