Gene Smed_0324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0324 
Symbol 
ID5321157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp352885 
End bp354381 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content63% 
IMG OID640789259 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001326017 
Protein GI150395550 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.948085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.128274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGAAC TCGGTCATTT CATCGATGGA AAGCGCGTCG CCGGCAAGAG CGGGCGCGTA 
AGCAAGATCT TCAACCCGGC GACTGGCGAG GTGCAAGGCA CCGTGGCGCT GGCGAGCGAC
GCGGATCTTG CTGCCGCCGT CGAGAGCGCC AAGGCCGCCC AGCCCAAATG GGCCGCCACC
AATCCGCAGC GCCGCGCCCG CGTTTTCATG AAGTTCGTCC AGCTCCTGAA CGACAACATG
GACGAGCTCG CCGAGATGCT TTCCCGCGAG CATGGCAAGA CCATCGACGA TGCCAAGGGC
GACATCGTGC GCGGCCTTGA AGTCTGCGAA TTCGTCATCG GCATTCCGCA TCTGCAGAAG
AGCGAATTCA CGGAAGGTGC CGGTCCCGGC ATCGACATGT ATTCGATGCG CCAGCCCGTC
GGCGTCGGCG CGGGCATCAC GCCGTTCAAC TTCCCCGGCA TGATCCCGAT GTGGATGTTC
GCTCCGGCAA TCGCCTGCGG CAACGCCTTC ATCCTGAAGC CTTCCGAGCG TGATCCCTCC
GTGCCGATCC GGCTTGCCGA ACTGATGATC GAAGCGGGCC TGCCTGCCGG CATTCTCAAC
GTCGTCAACG GCGACAAGGG CGCGGTCGAT GCGATCCTGA CGCATCCTGA CATCGCCGCA
GTTTCCTTCG TCGGCTCGAC CCCCATCGCC CGCTACGTCT ACGGTACGGC TGCGATGAAC
GGCAAGCGTG CGCAATGCTT CGGCGGCGCG AAGAACCACA TGATCATCAT GCCGGATGCC
GACCTCGACC AGGCCGCCAA TGCGCTGATC GGCGCCGGCT ACGGTTCCGC CGGCGAGCGC
TGCATGGCGA TCTCGGTGGC CGTTCCGGTC GGCGAGGAAA CCGCAAACCG GCTGATCGAC
AAGCTTGTGC CTATGGTCGA AAGCCTGCGC ATCGGCCCCT ATACCGACGA TAAGGCCGAT
ATGGGGCCCG TCGTCACCAA GGAGGCGGAG CAGCGGATCC GCGGCCTGAT CGAGAGCGGC
ATCGAGCAGG GTGCGAAGCT CGTCGTCGAC GGTCGCGATT TCAAGCTGCA GGGCTATGAG
AACGGCCACT TCGTCGGCGG CTGCCTCTTC GATCACGTCA CGCCCGATAT GGACATCTAC
AAGACGGAAA TCTTCGGACC CGTCCTGTCT GTCGTGCGCG CAACGAATTA CGAAGAGGCC
CTGTCTCTGC CGATGAAACA CGAATACGGC AACGGCGTTG CCATCTATAC CCGCGACGGT
GACGCTGCCC GCGACTTCGC CTCGCGCATC AACATCGGCA TGGTGGGCGT CAATGTTCCG
ATCCCGGTTC CGCTCGCCTA CCATTCCTTC GGCGGCTGGA AATCTTCGTC CTTCGGCGAC
CTCAACCAGC ATGGCCCGGA CTCGATCAAG TTCTGGACCC GCACCAAGAC CATCACCTCC
CGTTGGCCGT CGGGCATCAA GGACGGTGCC GAGTTCTCGA TCCCGACGAT GCGGTAA
 
Protein sequence
MYELGHFIDG KRVAGKSGRV SKIFNPATGE VQGTVALASD ADLAAAVESA KAAQPKWAAT 
NPQRRARVFM KFVQLLNDNM DELAEMLSRE HGKTIDDAKG DIVRGLEVCE FVIGIPHLQK
SEFTEGAGPG IDMYSMRQPV GVGAGITPFN FPGMIPMWMF APAIACGNAF ILKPSERDPS
VPIRLAELMI EAGLPAGILN VVNGDKGAVD AILTHPDIAA VSFVGSTPIA RYVYGTAAMN
GKRAQCFGGA KNHMIIMPDA DLDQAANALI GAGYGSAGER CMAISVAVPV GEETANRLID
KLVPMVESLR IGPYTDDKAD MGPVVTKEAE QRIRGLIESG IEQGAKLVVD GRDFKLQGYE
NGHFVGGCLF DHVTPDMDIY KTEIFGPVLS VVRATNYEEA LSLPMKHEYG NGVAIYTRDG
DAARDFASRI NIGMVGVNVP IPVPLAYHSF GGWKSSSFGD LNQHGPDSIK FWTRTKTITS
RWPSGIKDGA EFSIPTMR