Gene Rleg2_4025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4025 
Symbol 
ID6982795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4197499 
End bp4198980 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content65% 
IMG OID643398754 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_002283513 
Protein GI209551596 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.577209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTCA CCACCGCGCT GACCAGACAC GTTCCCTTCT CTTCACCGTT CCTGCGCGCT 
GCCGGCTATA TCAACGGCGT CTGGACATCG GGCGATGCCG CCGGGACTTT CGACGTGCTC
AACCCGGCAA CCGGCGAGCT GCTTGCCTCG CTGCCCGATA TGGGGGCTGC CGAGACGCGG
GCGGCGATCG ATGCGGCCTA TGCCGCCCAA CCGGCCTGGG CCGCCCGCCC GGCCAAGGAG
CGCAGCGTCA TCCTGCGCAA ATGGTTCGAC CTGATGGTCG CCAATGCCGA CGAACTCGCC
GCGATCCTGA CCGCCGAAAT GGGCAAGCCC TTCGCCGAAG CGCGCGGCGA GATCCTTTAT
GCCGCTGCCT ATATCGAATG GTATGCTGAG GAGGCAAAGC GGATCTATGG CGAGACGATC
CCCGCGCCGT CCAATGACAA GCGCATGATC GTCATCAAGC AGCCGGTCGG CGTCGTCGGC
ACGATCACGC CATGGAATTT CCCGGCGGCG ATGATTGCCC GCAAGATTGC GCCGGCGCTG
GCCGTCGGCT GCACGGTCGT GTCGAAACCG GCCGAACAGA CGCCGTTGAC GGCGATCGCC
CTTGCCGTGC TTGCCGAACA GGCCGGCATC CCCGCCGGCG TCTTCAACCT CATCGTCGGC
ATCGACGGCC CGGCGATCGG CCGCGAGCTC TGCGGCAATG ACAAGGTGCG CAAGATCAGT
TTCACCGGCT CGACGGAAGT CGGCCGCATC CTGATGCGGC AGTGCGCCGA TCAGATCAAG
AAGGTGAGCC TTGAGCTCGG CGGCAATGCG CCGTTCATCG TCTTCGACGA TGCCGATCTC
GACGCCGCCG TCGAAGGCGC GATCGCCTCC AAATACCGCA ATGCCGGCCA GACCTGCGTT
TGCGCCAACC GTCTCTACAT CCAGTCGGGC GTCTATGACG CCTTCGCGGC CAAGCTTGCC
GCCAAGGTCG CCGCAATGTC GGTCGGCGAC GGCTTCCAGC CGGGTGTCGA GATCGGGCCG
CTGATCGACG AACAGGGCCT TGCCAAGGTG GAAGACCATG TCGGTGACGC GCTTGCCAAG
GGCGCCAAGG TGCTGACCGG CGGCAAGCGC ATCGACGGCG CCGGCACTTT CTTTGCGCCG
ACGGTGCTGA CCGGCGTTGC CCGCGGCATG AAGGTGGCGC GCGAGGAAAC CTTCGGGCCG
GTGGCGCCGC TTTTCCGCTT CGAGACGGCC GAAGATGTCA TCGCTCAAGC CAATGATACG
GAATTCGGCC TTGCCGCCTA TTTCTATGCC GGCGACCTGA AGAAGGTCTG GCGGGTGGCG
GAAGCGTTGG AATACGGCAT GATCGGCATC AATACCGGCA TCATGTCGTC CGAGACGGCG
CCTTTCGGCG GCATCAAACA ATCCGGCCTC GGCCGCGAGG GCTCGCGCCA CGGCGCCGAC
GATTATCTCG AAATGAAATA TCTCTGCATC GGCGGCGTCT GA
 
Protein sequence
MAFTTALTRH VPFSSPFLRA AGYINGVWTS GDAAGTFDVL NPATGELLAS LPDMGAAETR 
AAIDAAYAAQ PAWAARPAKE RSVILRKWFD LMVANADELA AILTAEMGKP FAEARGEILY
AAAYIEWYAE EAKRIYGETI PAPSNDKRMI VIKQPVGVVG TITPWNFPAA MIARKIAPAL
AVGCTVVSKP AEQTPLTAIA LAVLAEQAGI PAGVFNLIVG IDGPAIGREL CGNDKVRKIS
FTGSTEVGRI LMRQCADQIK KVSLELGGNA PFIVFDDADL DAAVEGAIAS KYRNAGQTCV
CANRLYIQSG VYDAFAAKLA AKVAAMSVGD GFQPGVEIGP LIDEQGLAKV EDHVGDALAK
GAKVLTGGKR IDGAGTFFAP TVLTGVARGM KVAREETFGP VAPLFRFETA EDVIAQANDT
EFGLAAYFYA GDLKKVWRVA EALEYGMIGI NTGIMSSETA PFGGIKQSGL GREGSRHGAD
DYLEMKYLCI GGV