Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4443 |
Symbol | |
ID | 6977537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 76375 |
End bp | 77826 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643393621 |
Product | succinic semialdehyde dehydrogenase |
Protein accession | YP_002278439 |
Protein GI | 209546521 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01780] succinate-semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.094172 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTGA AAGACGCAAC ATTGTTCCGG CAGGCCGCAT TGGTCGGCGG CGATTGGATC GAGGCGGGGG ACAATGGGAT CGCGGTTGAT AATCCCGCGA CCGGCGAGAT CATCGGCCGC GTTCCCAATC TCGGCGCAGC CGAGACCAAG GCGGCGATTG CCGCAGCCGA GATTGCGCAG AAGGAATGGG CCGCCCGCAC CGCCAAGGAA CGGTCGGTCA TCCTGCGCCG CTGGTTCCAG CTGATGATGG ACAATCAGGA CGATCTCGGC CGCATCCTGA CGGCAGAACA GGGCAAGCCG CTTGCCGAGG CCAAAGGCGA GATCGCCTAT GGCGCAAGCT TCATCGAATG GTTTGCCGAG GAGGCGCGGC GCGTCTATGG CGACATCGTT CCCGGCCATC AGAAGGATAA GCGCATCCTG GTGATGAAGC AGCCGATCGG CGTCGTTGCC GCCATCACCC CGTGGAATTT CCCCAATGCG ATGATCACCC GCAAGGCTGG ACCCGCCTTT GCCGCCGGCT GCGCCATGGT GCTGAAGCCG GCCTCGCAGA CGCCGTTTTC GGCGATCGCG ATCGCCATCC TCGCCGAGCG GGCCGGTTTC CCCAAGGGCC TGTTCAGCGT TCTCACCGGT TCGGCCCGCG CAATCGGCGG CGAGATGACC GCAAGCTCCG TCGTGCGCAA GCTGACCTTT ACCGGCTCGA CCGAAGTCGG CGCCGAGCTC TACCGGCAGA GTGCCCCGAC CATCAAGAAG CTCGGGCTGG AACTCGGCGG CAATGCACCC TTCATCGTCT TCGACGACGC CGATCTCGAT GCGGCCGTGG AAGGCGCGCT GATCGCCAAA TTCCGCAACA ATGGCCAGAC CTGCGTCTGC GCCAACCGCC TCTATGTGCA GGAGGGCGTC TATGACGCTT TTGCCGAGAA GCTGTCGAAG GCCGTCGGCG CGTTGAAGAC CGGCAACGGT TTTGACGAGG GCATCAATCT CGGCCCGCTG ATCGACGAGT CCGCCCTTGC CAAGGTCGAG GAGCATGTCG CCGATGCGCT GTCCAAGGGC GGTCGTGTCG TTGCCGGCGG CCACCGCCAC CCACTCGGCG GACGCTTCTA CGAAGCGACC GTCCTGGCCG ACGTTACCCC TGCCATGGCT GTCGCCAAGG AAGAGACCTT CGGGCCGGTG GCGCCGCTCT TCCGCTTCAA GGACGAAGCC GATGTAATCG CCCAGGCCAA CGACACCGAG TTCGGTCTTG CCTCCTATTT CTACGCCAAG GATCTCGCCC GGGTCTTCCG GGTCGCCGAG GCGCTGGAAT ACGGCATGGT TGGCGTCAAT ACCGGGCTGA TCTCGACGGC CGAAGCCCCC TTCGGCGGTG TCAAACTCTC CGGCCTCGGC CGCGAAGGCT CGAAATACGG CATCGAGGAA TTCACCGAAA TCAAATATGT CTGCCTCGGC GGCATCGCCT GA
|
Protein sequence | MELKDATLFR QAALVGGDWI EAGDNGIAVD NPATGEIIGR VPNLGAAETK AAIAAAEIAQ KEWAARTAKE RSVILRRWFQ LMMDNQDDLG RILTAEQGKP LAEAKGEIAY GASFIEWFAE EARRVYGDIV PGHQKDKRIL VMKQPIGVVA AITPWNFPNA MITRKAGPAF AAGCAMVLKP ASQTPFSAIA IAILAERAGF PKGLFSVLTG SARAIGGEMT ASSVVRKLTF TGSTEVGAEL YRQSAPTIKK LGLELGGNAP FIVFDDADLD AAVEGALIAK FRNNGQTCVC ANRLYVQEGV YDAFAEKLSK AVGALKTGNG FDEGINLGPL IDESALAKVE EHVADALSKG GRVVAGGHRH PLGGRFYEAT VLADVTPAMA VAKEETFGPV APLFRFKDEA DVIAQANDTE FGLASYFYAK DLARVFRVAE ALEYGMVGVN TGLISTAEAP FGGVKLSGLG REGSKYGIEE FTEIKYVCLG GIA
|
| |