Gene Rleg_4355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4355 
Symbol 
ID8015130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4481318 
End bp4482799 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content65% 
IMG OID644826931 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_002978134 
Protein GI241207038 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00805728 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTTTCA CCAGCGCACT GACCAAGCAC GTTCCCTTCT CCTCGCCCCT GCTGCGCGAT 
GCCGGCTATA TCGACGGCGT CTGGACATCA GGCGATGCCA CTCGGACTTT CGACGTGCTG
AACCCGGCAA CCGGCGAGTT GCTCGCCTCA CTGCCCGATA TGGGCGCGGC CGAGACGCGG
ACGGCAATCG ATGCGGCCCA TGCCGCCCAG CCGGGCTGGG CGGCCCGTCC GGCCAAGGAG
CGCAGCACGA TCCTGCGCAA ATGGTTCGAC CTGATGGTCG CCAATGCCGA CGAACTCGCG
GCGATCCTGA CCGCCGAAAT GGGCAAGCCG TTCCCGGAAG CGCGCGGCGA GATCCTTTAT
GCCGCGGCCT ATATCGAATG GTATGCGGAA GAGGCCAAAC GCATCTATGG CGAGACGATC
CCCGCGCCTT CCGACGATAA ACGGATGATC GTCATCCGGC AGCCAGTCGG CGTCGTCGGT
ACGATCACGC CGTGGAACTT CCCGGCGGCG ATGATCACCC GCAAGATCGC CCCGGCGCTT
GCCGTCGGCT GCACCGTGGT CTCGAAGCCG GCCGAACAGA CGCCGCTGAC GGCGATCGCG
CTTGCCGTGC TCGCCGAGCA GGCCGGCATT CCGGCCGGCG TCTTCAACGT CATCGTCGGC
GTGGATGGCC CGGCGATCGG CCGCGAACTC TGCGGCAATG AAAAGGTGCG CAAGATCAGC
TTCACCGGCT CGACCGAGGT CGGCCGTATC CTGATGCGGC AGTGCGCCGA CCAGATCAAG
AAGGTGAGCC TGGAGCTCGG CGGCAACGCG CCCTTCATCG TCTTCGACGA TGCCGATCTC
GACGCTGCCG TCGAAGGCGC GATCGCCTCC AAATACCGCA ATGCCGGCCA GACCTGCGTC
TGCGCCAACC GCCTCTACGT CCAGTCGAAC GTCTATGACG CCTTCGCCGC CAAGCTTGCC
GCCAAGGTCG CCGAGATGTC GGTCGGCGAC GGCTTCAAGC CGGGTGTCGT GATCGGGCCG
CTGATCGACG AGCAAGGCCT TGCCAAGGTG GAGGACCATG TCAGCGACGC GCTTGCCAAG
GGCGCCAAGG TACTGACCGG CGGCAAGCGC ATCGACGGCG CCGGCACCTT CTTCACGCCG
ACGGTCCTGA CAGGCGTTGC GCGCGGCATG AAGGTAGCGC GCGAGGAGAC CTTCGGGCCG
GTGGCGCCGC TCTTTCGCTT CGAGACGGTC GAGGATGTCA TCGCCCAAGC CAATGATACG
GAATTCGGCC TCGCCGCCTA TTTCTACGCC GGCGACCTGA AGAAGGTCTG GCGGGTGGCG
GAAGCGCTGG AATACGGCAT GATCGGCATC AATACCGGCC TGATGTCATC CGAGACGGCA
CCCTTCGGCG GCATCAAGCA ATCCGGCCTC GGCCGCGAGG GCTCGCGGCA CGGCGCCGAC
GACTATCTGG AAATGAAATA TCTCTGCATC GGCGGCGTCT GA
 
Protein sequence
MAFTSALTKH VPFSSPLLRD AGYIDGVWTS GDATRTFDVL NPATGELLAS LPDMGAAETR 
TAIDAAHAAQ PGWAARPAKE RSTILRKWFD LMVANADELA AILTAEMGKP FPEARGEILY
AAAYIEWYAE EAKRIYGETI PAPSDDKRMI VIRQPVGVVG TITPWNFPAA MITRKIAPAL
AVGCTVVSKP AEQTPLTAIA LAVLAEQAGI PAGVFNVIVG VDGPAIGREL CGNEKVRKIS
FTGSTEVGRI LMRQCADQIK KVSLELGGNA PFIVFDDADL DAAVEGAIAS KYRNAGQTCV
CANRLYVQSN VYDAFAAKLA AKVAEMSVGD GFKPGVVIGP LIDEQGLAKV EDHVSDALAK
GAKVLTGGKR IDGAGTFFTP TVLTGVARGM KVAREETFGP VAPLFRFETV EDVIAQANDT
EFGLAAYFYA GDLKKVWRVA EALEYGMIGI NTGLMSSETA PFGGIKQSGL GREGSRHGAD
DYLEMKYLCI GGV