Gene Rleg2_4443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4443 
Symbol 
ID6977537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp76375 
End bp77826 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content64% 
IMG OID643393621 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_002278439 
Protein GI209546521 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.094172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTGA AAGACGCAAC ATTGTTCCGG CAGGCCGCAT TGGTCGGCGG CGATTGGATC 
GAGGCGGGGG ACAATGGGAT CGCGGTTGAT AATCCCGCGA CCGGCGAGAT CATCGGCCGC
GTTCCCAATC TCGGCGCAGC CGAGACCAAG GCGGCGATTG CCGCAGCCGA GATTGCGCAG
AAGGAATGGG CCGCCCGCAC CGCCAAGGAA CGGTCGGTCA TCCTGCGCCG CTGGTTCCAG
CTGATGATGG ACAATCAGGA CGATCTCGGC CGCATCCTGA CGGCAGAACA GGGCAAGCCG
CTTGCCGAGG CCAAAGGCGA GATCGCCTAT GGCGCAAGCT TCATCGAATG GTTTGCCGAG
GAGGCGCGGC GCGTCTATGG CGACATCGTT CCCGGCCATC AGAAGGATAA GCGCATCCTG
GTGATGAAGC AGCCGATCGG CGTCGTTGCC GCCATCACCC CGTGGAATTT CCCCAATGCG
ATGATCACCC GCAAGGCTGG ACCCGCCTTT GCCGCCGGCT GCGCCATGGT GCTGAAGCCG
GCCTCGCAGA CGCCGTTTTC GGCGATCGCG ATCGCCATCC TCGCCGAGCG GGCCGGTTTC
CCCAAGGGCC TGTTCAGCGT TCTCACCGGT TCGGCCCGCG CAATCGGCGG CGAGATGACC
GCAAGCTCCG TCGTGCGCAA GCTGACCTTT ACCGGCTCGA CCGAAGTCGG CGCCGAGCTC
TACCGGCAGA GTGCCCCGAC CATCAAGAAG CTCGGGCTGG AACTCGGCGG CAATGCACCC
TTCATCGTCT TCGACGACGC CGATCTCGAT GCGGCCGTGG AAGGCGCGCT GATCGCCAAA
TTCCGCAACA ATGGCCAGAC CTGCGTCTGC GCCAACCGCC TCTATGTGCA GGAGGGCGTC
TATGACGCTT TTGCCGAGAA GCTGTCGAAG GCCGTCGGCG CGTTGAAGAC CGGCAACGGT
TTTGACGAGG GCATCAATCT CGGCCCGCTG ATCGACGAGT CCGCCCTTGC CAAGGTCGAG
GAGCATGTCG CCGATGCGCT GTCCAAGGGC GGTCGTGTCG TTGCCGGCGG CCACCGCCAC
CCACTCGGCG GACGCTTCTA CGAAGCGACC GTCCTGGCCG ACGTTACCCC TGCCATGGCT
GTCGCCAAGG AAGAGACCTT CGGGCCGGTG GCGCCGCTCT TCCGCTTCAA GGACGAAGCC
GATGTAATCG CCCAGGCCAA CGACACCGAG TTCGGTCTTG CCTCCTATTT CTACGCCAAG
GATCTCGCCC GGGTCTTCCG GGTCGCCGAG GCGCTGGAAT ACGGCATGGT TGGCGTCAAT
ACCGGGCTGA TCTCGACGGC CGAAGCCCCC TTCGGCGGTG TCAAACTCTC CGGCCTCGGC
CGCGAAGGCT CGAAATACGG CATCGAGGAA TTCACCGAAA TCAAATATGT CTGCCTCGGC
GGCATCGCCT GA
 
Protein sequence
MELKDATLFR QAALVGGDWI EAGDNGIAVD NPATGEIIGR VPNLGAAETK AAIAAAEIAQ 
KEWAARTAKE RSVILRRWFQ LMMDNQDDLG RILTAEQGKP LAEAKGEIAY GASFIEWFAE
EARRVYGDIV PGHQKDKRIL VMKQPIGVVA AITPWNFPNA MITRKAGPAF AAGCAMVLKP
ASQTPFSAIA IAILAERAGF PKGLFSVLTG SARAIGGEMT ASSVVRKLTF TGSTEVGAEL
YRQSAPTIKK LGLELGGNAP FIVFDDADLD AAVEGALIAK FRNNGQTCVC ANRLYVQEGV
YDAFAEKLSK AVGALKTGNG FDEGINLGPL IDESALAKVE EHVADALSKG GRVVAGGHRH
PLGGRFYEAT VLADVTPAMA VAKEETFGPV APLFRFKDEA DVIAQANDTE FGLASYFYAK
DLARVFRVAE ALEYGMVGVN TGLISTAEAP FGGVKLSGLG REGSKYGIEE FTEIKYVCLG
GIA