Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0419 |
Symbol | |
ID | 8011621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 435646 |
End bp | 437142 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644823015 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_002974269 |
Protein GI | 241203173 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.214339 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00208567 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTGAGA TCGGTCATTT CATCGGCGGC AAACATGTTG CCGGCACCAG CGGCCGCGTG AGCAATGTCT ACAATCCGGC GACGGGCGAA GTGCAGGCGA CGGTCGCACT CGCAAGCGTC GAGGAACTGC GCGCCGCCGT CGAAAACGCC AAGGCTGCGC AGCCGAAATG GGCTGCCACC AATCCGCAGC GCCGCGCCCG CGTCTTCTTC AAATTCGTCG AACTCCTGAA CAAGCACATG GACGAGCTTG CCGAAATCCT CTCCAAGGAA CACGGCAAGA CGATCGAGGA TGCCAAGGGC GACGTCATCC GCGGCCTCGA AGTCTGCGAA TTCGTCTGCG GCATCCCGCA TCTCGCCAAG GGCGAATTCA CCGAGGGCGC AGGCCCGGCG ATCGACATGT ATTCGATCCG CCAGCCGGTC GGCATCGGCG CCGGCATCAC GCCTTTCAAC TTCCCCGGCA TGATCCCGAT GTGGATGTTT GCGCCGGCGA TCGCCTGCGG CAACGCCTTC ATCCTGAAGC CCTCCGAGCG TGATCCCTCC CTGCCGATCC GTCTCGGTGA ACTGATGATC GAGGCCGGCC TGCCCGCCGG CATCCTCAAC GTCGTCAATG GCGACAAGGG TGCTGTCGAC GCGATCCTCA CCGATCCCGA TATCGGCGCC GTCTCCTTCG TCGGCTCGAC GCCGATCGCC CGCTACGTCT ACGGCACCGC GGCGATGAAC GGCAAGCGCG CCCAGTGCTT CGGCGGCGCC AAGAACCACA TGATCATCAT GCCGGATGCG GACCTGGATC AGGCCGTCAA CGCGCTGATG GGCGCAGGCT ACGGTTCGGC CGGCGAGCGC TGCATGGCGA TCTCGGTTGC CGTTCCGGTC GGCGAGGAGA CTGCCAACCG CCTCGTCGAG AAGCTGACGC CGAAGATCGA ATCCCTGCGT ATCGGCCCCT ATACCGACGA CAAGGCCGAC ATGGGCCCGC TCGTCACCAA GGAAGCCTAT ACCCGTGTTC GCGGCCTGAT CGACCGCGGC ATCGAGGAAG GCGCCAAGCT CGTCGTCGAC GGCCGCGATT TCAAACTCCA GGGCTATGAA GACGGCTATT TCGTCGGCGG CTGCCTGTTC GATCACGTCA CGCCGGAGAT GGATATCTAC AAGACAGAGA TCTTCGGACC TGTCCTCTCC GTCGTTCGCG CCAACAACTA TGAGGAAGCG CTGTCGTTGC CGATGAAGCA CGAATACGGC AACGGCGTTG CGATCTACAC CCGCGACGGC GATGCCGCCC GCGATTTTGC CTCGCGCATC AATATCGGCA TGATCGGCAT CAACGTTCCG ATCCCGGTTC CGCTCGCCTA CCACTCCTTC GGCGGCTGGA AGGCCTCGAG CTTCGGCGAC CTCAACCAGC ACGGCACGGA TTCGATCAAG TTCTGGACGA AGACCAAGAC CGTCACTGCT CGTTGGCCCT CCGGCATCAA AAGCGGCGCG GAATTCGTCA TGCCGACGAT GAAGTGA
|
Protein sequence | MREIGHFIGG KHVAGTSGRV SNVYNPATGE VQATVALASV EELRAAVENA KAAQPKWAAT NPQRRARVFF KFVELLNKHM DELAEILSKE HGKTIEDAKG DVIRGLEVCE FVCGIPHLAK GEFTEGAGPA IDMYSIRQPV GIGAGITPFN FPGMIPMWMF APAIACGNAF ILKPSERDPS LPIRLGELMI EAGLPAGILN VVNGDKGAVD AILTDPDIGA VSFVGSTPIA RYVYGTAAMN GKRAQCFGGA KNHMIIMPDA DLDQAVNALM GAGYGSAGER CMAISVAVPV GEETANRLVE KLTPKIESLR IGPYTDDKAD MGPLVTKEAY TRVRGLIDRG IEEGAKLVVD GRDFKLQGYE DGYFVGGCLF DHVTPEMDIY KTEIFGPVLS VVRANNYEEA LSLPMKHEYG NGVAIYTRDG DAARDFASRI NIGMIGINVP IPVPLAYHSF GGWKASSFGD LNQHGTDSIK FWTKTKTVTA RWPSGIKSGA EFVMPTMK
|
| |