Gene Smed_3159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3159 
Symbol 
ID5324038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3318346 
End bp3319878 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content65% 
IMG OID640792107 
Productaldehyde dehydrogenase 
Protein accessionYP_001328818 
Protein GI150398351 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.510599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTG CAGCCAAGAA GGTCGACGTT GCAAAGGAGG CTGCGGCTCT CCTTGAGAAG 
ATGGGCGTCG CCAAAGAACT CTACGCAGGC GGGGACATGC CGTCTTTCAG CCCTGTCACC
GGCGAGAAGA TCGCCAGCCT CAAGACCGTG ACGGCCAGCG AGGCCGCCGG GAAGATCGAG
CGGGCCGACG AGGCCTTCCG CTCCTGGAGG CTCGTACCGG CGCCCAAGCG CGGCGAACTC
GTCCGCCTGC TTGGCGAGGA GCTGCGCGCC TTCAAGGCAG ATCTCGGACG CCTCGTCTCG
ATCGAAGCCG GAAAGATCCC CTCCGAAGGC CTCGGCGAAG TGCAGGAAAT GATCGACATC
TGCGATTTCG CCGTCGGCCT TTCCCGTCAG CTCTACGGTC TGACGATCGC GACCGAGCGT
CCCGGCCACC GGATGATGGA AACCTGGCAT CCGCTCGGCG TCGTCGGCAT CATCTCGGCG
TTCAACTTTC CCGTCGCGGT ATGGTCTTGG AATGCGGCGC TCGCGCTCGT TTGCGGCGAT
GCCGTCGTCT GGAAGCCGTC GGAAAAGACA CCGCTTACCG CGCTTGCATG CCAGGCGATC
CTGGAACGCG CCATTGCCCG TTTCGGCGAC GCCCCGGAAG GCCTGTCGCA GGTCCTGATC
GGTGACCGTG CGATTGGCGA GGTACTCGTC GACCATCCGA AGGTGCCTCT CGTCTCGGCG
ACCGGCTCGA CCCGCATGGG CCGCGAGGTC GGTCCGCGGC TTGCCAAGCG CTTCGCCCGT
GCGATCCTGG AACTCGGCGG CAACAATGCG GGCATCGTCT GCCCCTCCGC CGATCTCGAC
ATGGCGCTTC GCGCCATCGC CTTCGGCGCA ATGGGCACCG CCGGTCAACG CTGCACGACG
CTGCGCCGCC TCTTCGTCCA TGAGAGCGTC TATGATCAGC TCGTGCCGCG GCTGAAGAAA
GCCTATCAGT CGGTTTCCGT CGGCAATCCG CTGGAACCGG CCGCACTGGT CGGGCCGCTC
GTCGACAAGG CAGCCTTTGA CGGCATGCAG AAGGCGATCT CGGAGGCGCA GAGCCATGGC
GGATCCGTCA CCGGCGGCGA ACGCGTCGAA CTCGGCTACG ACAATGGCTT CTACGTCAAG
CCCGCTCTGG TCGAAATGCC GCAGCAGGAG GGACCGGTTC TCGAAGAGAC CTTCGCGCCG
ATCCTCTACG TCATGAAGTA CAGCGACTTC GACGCGGTGC TCGCCGAACA CAATGCAGTT
GCCGCCGGAC TTTCGTCCTC GATCTTCACC CGGGACATGC AGGAAGCGGA GCGCTTCCTC
GCAGCCGATG GCTCCGACTG CGGCATCGCC AACGTCAATA TCGGCACCTC CGGGGCCGAG
ATCGGTGGGG CGTTCGGTGG CGAGAAGGAG ACCGGCGGCG GCCGCGAATC CGGTTCGGAC
GCCTGGAAGG CCTATATGCG ACGCGCCACA AATACGGTGA ACTATTCCAA GGCTCTGCCG
CTGGCGCAGG GCGTCTCTTT CGACATCGAA TAA
 
Protein sequence
MNIAAKKVDV AKEAAALLEK MGVAKELYAG GDMPSFSPVT GEKIASLKTV TASEAAGKIE 
RADEAFRSWR LVPAPKRGEL VRLLGEELRA FKADLGRLVS IEAGKIPSEG LGEVQEMIDI
CDFAVGLSRQ LYGLTIATER PGHRMMETWH PLGVVGIISA FNFPVAVWSW NAALALVCGD
AVVWKPSEKT PLTALACQAI LERAIARFGD APEGLSQVLI GDRAIGEVLV DHPKVPLVSA
TGSTRMGREV GPRLAKRFAR AILELGGNNA GIVCPSADLD MALRAIAFGA MGTAGQRCTT
LRRLFVHESV YDQLVPRLKK AYQSVSVGNP LEPAALVGPL VDKAAFDGMQ KAISEAQSHG
GSVTGGERVE LGYDNGFYVK PALVEMPQQE GPVLEETFAP ILYVMKYSDF DAVLAEHNAV
AAGLSSSIFT RDMQEAERFL AADGSDCGIA NVNIGTSGAE IGGAFGGEKE TGGGRESGSD
AWKAYMRRAT NTVNYSKALP LAQGVSFDIE