Gene Rleg_0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0419 
Symbol 
ID8011621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp435646 
End bp437142 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content63% 
IMG OID644823015 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_002974269 
Protein GI241203173 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.214339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00208567 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTGAGA TCGGTCATTT CATCGGCGGC AAACATGTTG CCGGCACCAG CGGCCGCGTG 
AGCAATGTCT ACAATCCGGC GACGGGCGAA GTGCAGGCGA CGGTCGCACT CGCAAGCGTC
GAGGAACTGC GCGCCGCCGT CGAAAACGCC AAGGCTGCGC AGCCGAAATG GGCTGCCACC
AATCCGCAGC GCCGCGCCCG CGTCTTCTTC AAATTCGTCG AACTCCTGAA CAAGCACATG
GACGAGCTTG CCGAAATCCT CTCCAAGGAA CACGGCAAGA CGATCGAGGA TGCCAAGGGC
GACGTCATCC GCGGCCTCGA AGTCTGCGAA TTCGTCTGCG GCATCCCGCA TCTCGCCAAG
GGCGAATTCA CCGAGGGCGC AGGCCCGGCG ATCGACATGT ATTCGATCCG CCAGCCGGTC
GGCATCGGCG CCGGCATCAC GCCTTTCAAC TTCCCCGGCA TGATCCCGAT GTGGATGTTT
GCGCCGGCGA TCGCCTGCGG CAACGCCTTC ATCCTGAAGC CCTCCGAGCG TGATCCCTCC
CTGCCGATCC GTCTCGGTGA ACTGATGATC GAGGCCGGCC TGCCCGCCGG CATCCTCAAC
GTCGTCAATG GCGACAAGGG TGCTGTCGAC GCGATCCTCA CCGATCCCGA TATCGGCGCC
GTCTCCTTCG TCGGCTCGAC GCCGATCGCC CGCTACGTCT ACGGCACCGC GGCGATGAAC
GGCAAGCGCG CCCAGTGCTT CGGCGGCGCC AAGAACCACA TGATCATCAT GCCGGATGCG
GACCTGGATC AGGCCGTCAA CGCGCTGATG GGCGCAGGCT ACGGTTCGGC CGGCGAGCGC
TGCATGGCGA TCTCGGTTGC CGTTCCGGTC GGCGAGGAGA CTGCCAACCG CCTCGTCGAG
AAGCTGACGC CGAAGATCGA ATCCCTGCGT ATCGGCCCCT ATACCGACGA CAAGGCCGAC
ATGGGCCCGC TCGTCACCAA GGAAGCCTAT ACCCGTGTTC GCGGCCTGAT CGACCGCGGC
ATCGAGGAAG GCGCCAAGCT CGTCGTCGAC GGCCGCGATT TCAAACTCCA GGGCTATGAA
GACGGCTATT TCGTCGGCGG CTGCCTGTTC GATCACGTCA CGCCGGAGAT GGATATCTAC
AAGACAGAGA TCTTCGGACC TGTCCTCTCC GTCGTTCGCG CCAACAACTA TGAGGAAGCG
CTGTCGTTGC CGATGAAGCA CGAATACGGC AACGGCGTTG CGATCTACAC CCGCGACGGC
GATGCCGCCC GCGATTTTGC CTCGCGCATC AATATCGGCA TGATCGGCAT CAACGTTCCG
ATCCCGGTTC CGCTCGCCTA CCACTCCTTC GGCGGCTGGA AGGCCTCGAG CTTCGGCGAC
CTCAACCAGC ACGGCACGGA TTCGATCAAG TTCTGGACGA AGACCAAGAC CGTCACTGCT
CGTTGGCCCT CCGGCATCAA AAGCGGCGCG GAATTCGTCA TGCCGACGAT GAAGTGA
 
Protein sequence
MREIGHFIGG KHVAGTSGRV SNVYNPATGE VQATVALASV EELRAAVENA KAAQPKWAAT 
NPQRRARVFF KFVELLNKHM DELAEILSKE HGKTIEDAKG DVIRGLEVCE FVCGIPHLAK
GEFTEGAGPA IDMYSIRQPV GIGAGITPFN FPGMIPMWMF APAIACGNAF ILKPSERDPS
LPIRLGELMI EAGLPAGILN VVNGDKGAVD AILTDPDIGA VSFVGSTPIA RYVYGTAAMN
GKRAQCFGGA KNHMIIMPDA DLDQAVNALM GAGYGSAGER CMAISVAVPV GEETANRLVE
KLTPKIESLR IGPYTDDKAD MGPLVTKEAY TRVRGLIDRG IEEGAKLVVD GRDFKLQGYE
DGYFVGGCLF DHVTPEMDIY KTEIFGPVLS VVRANNYEEA LSLPMKHEYG NGVAIYTRDG
DAARDFASRI NIGMIGINVP IPVPLAYHSF GGWKASSFGD LNQHGTDSIK FWTKTKTVTA
RWPSGIKSGA EFVMPTMK