Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0386 |
Symbol | |
ID | 6979101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 398602 |
End bp | 400098 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643395099 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_002279911 |
Protein GI | 209547994 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.433719 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0679471 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGAGA TCGGTCATTT CATCGGCGGC AAGCAGGTTG CCGGCACCAG CGGCCGCGTG AGCAATGTCT ATAATCCGGC AACAGGCGAA GTGCAGGCGA CGGTCGCGCT CGCAAGCGTC GAGGAACTGC GCGCCGCTGT CGAAAACGCC AAGGCTGCGC AGCCGAAATG GGCTGCCACC AATCCGCAGC GCCGCGCTCG CGTCTTCTTC AAATTCGTCG AACTCCTGAA CAAGCACATG GATGAACTTG CCGAAATGCT CTCCAAGGAG CACGGCAAGA CGATCGAAGA CGCCAAGGGT GACGTCATCC GCGGCCTCGA AGTCTGCGAA TTCGTCTGCG GCATTCCGCA TCTCGCCAAG GGCGAGTTCA CCGAGGGCGC AGGCCCTGCG ATCGATATGT ATTCGATCCG CCAGCCGGTC GGCATCGGCG CCGGCATTAC CCCCTTCAAT TTCCCCGGCA TGATCCCGAT GTGGATGTTT GCTCCGGCGA TCGCCTGCGG CAATGCCTTC ATCCTAAAGC CTTCCGAGCG TGATCCTTCC CTGCCGATCC GCCTTGCCGA ACTGATGATC GAGGCCGGTC TTCCGGCCGG CATCCTCAAT GTTGTCAACG GCGACAAGGG TGCCGTCGAC GCGATCCTTA CCGATCCCGA TATCGGCGCT GTTTCCTTCG TCGGCTCGAC GCCGATCGCC CGTTACGTCT ACGGTACGGC GGCGATGAAC GGCAAGCGTG CCCAGTGCTT CGGCGGCGCC AAGAACCACA TGATCATCAT GCCGGATGCC GATATGGATC AGGCCGTCAA CGCGCTAATG GGCGCAGGCT ACGGTTCGGC TGGCGAACGC TGCATGGCGA TCTCGGTTGC GGTTCCCGTT GGCGAGGACA CCGCCAACCG CCTCGTCGAA AAGCTGATCC CGAAGATCGA ATCGCTGCGC ATCGGCCCCT ACACAGACGA CCAGGCCGAC ATGGGCCCGC TCGTCACCAA GGACGCCTAT ACCCGCGTTC GCGGCCTGAT CGACCGCGGC GTCGAGGAGG GCGCCAAGCT CCTCGTCGAC GGCCGCGACT TCAAGCTCCA GGGTTATGAA GACGGCTATT TCGTCGGCGG CTGCCTGTTC GATCACGTCA CGCCTGAGAT GGATATCTAC AAGACCGAGA TCTTCGGACC CGTCCTTTCC GTCGTTCGCG CCCAGAATTA CGAAGAGGCG CTGTCGCTGC CGATGAAGCA CGAATACGGC AACGGGGTTG CGATCTACAC CCGTGACGGC GATGCCGCCC GCGATTTCGC CTCGCGCATC AATATCGGCA TGATCGGCAT CAACGTTCCG ATCCCGGTTC CGCTCGCCTA CCACTCCTTC GGCGGCTGGA AAGCTTCGAG CTTCGGCGAC CTCAACCAGC ACGGCACGGA TTCGATCAAG TTCTGGACGA AGACCAAGAC CGTCACCGCT CGCTGGCCCT CCGGCATCAA GAGCGGCGCT GAATTTGTCA TGCCGACGAT GAAGTGA
|
Protein sequence | MREIGHFIGG KQVAGTSGRV SNVYNPATGE VQATVALASV EELRAAVENA KAAQPKWAAT NPQRRARVFF KFVELLNKHM DELAEMLSKE HGKTIEDAKG DVIRGLEVCE FVCGIPHLAK GEFTEGAGPA IDMYSIRQPV GIGAGITPFN FPGMIPMWMF APAIACGNAF ILKPSERDPS LPIRLAELMI EAGLPAGILN VVNGDKGAVD AILTDPDIGA VSFVGSTPIA RYVYGTAAMN GKRAQCFGGA KNHMIIMPDA DMDQAVNALM GAGYGSAGER CMAISVAVPV GEDTANRLVE KLIPKIESLR IGPYTDDQAD MGPLVTKDAY TRVRGLIDRG VEEGAKLLVD GRDFKLQGYE DGYFVGGCLF DHVTPEMDIY KTEIFGPVLS VVRAQNYEEA LSLPMKHEYG NGVAIYTRDG DAARDFASRI NIGMIGINVP IPVPLAYHSF GGWKASSFGD LNQHGTDSIK FWTKTKTVTA RWPSGIKSGA EFVMPTMK
|
| |