Gene Smed_5259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5259 
Symbol 
ID5319561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp218832 
End bp220295 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content64% 
IMG OID640777036 
Productaldehyde dehydrogenase 
Protein accessionYP_001313968 
Protein GI150377373 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTCC AAACCAGGAT TTCCGGCTAT GCCGACCCGG CCCTTCATGC CGAGGGCCTT 
TATATCGGCG GCAAGTGGCA GAAGGGCAGC GGCATAGCAG TTACAGATCC CTCGACTGGC
AACAGGCTGA CCGAAGTCGC TGACGCCTCG GTCGAGGACG CGATCCGCGC CGTCGCCGCC
GCAGAGACTG CGGCGGCGGG GTGGCGCGAC ACCCCCGCCC GTCAGCGTTC AGAAATCCTG
CGGCGCTGGT TCCAGCTCAT GACCCAGCAT GCCGAAGACC TCGCGACGCT CATCGCGCTC
GAAAACGGCA AGGCCTTGTC CGACGCCCGC GGAGAAGTGA CCTATGCGGC TGAGTTCTTC
CGCTGGTACG CGGAAGAGGC GACGCGCATT CCCGGTGAAT ACAGACACAC GCCTTCCGGC
TCGCACCATA TCCTCGTCGA CCATGAGCCG ATCGGTATCG CCGTCCTGAT TACGCCCTGG
AATTTCCCGG CTGCCATGGC GACGCGCAAG ATCGGCCCGG CGCTCGCTGC CGGCTGCACC
GTCATTCTGA AGCCGGCGTC CGAAACACCG CTCACCGCCT ATGCCATGGC GCGACTCGGG
GAAGAGGCTG GGGTGCCGGC AGGCGTCGTA AACGTGCTGA CCACGAGCAG GCCGGCCGAG
GTAACGAATG CCATGCTCGC CGATCCGCGC GTACGCAAGC TCTCCTTCAC CGGCTCGACC
GGAGTCGGCC GCACGCTCCT TGCCGAAGCC GCCAAATCGG TCGTCAGCTG CTCGATGGAA
CTCGGCGGCA ATGCGCCCTT CGTTGTCTTC GACGACGCCG ATCTGGAAGC CGCGATCGAC
GGCGCCATGG TCGCAAAGAT GCGCAATGCC GGTGAGGCCT GCACCGCCGC CAACCGCTTC
TACGTCCAGT CTGGCATCCA CGACGCCTTC GTCGCATGTC TGACGGCCCG AATGGCCGCG
CTCAAGATCG GCCCGGGGTA TGAGCCGTCC ACGCAATGCG GGCCGATGAT CACGCAGAAC
GCGGTGCGCA AGATCGACAG ACTCGTCTCG GATGCGGTTG CCGGAGGCGC CCGTGCGACG
ACCGGGGGCA AACCGCTTTC TGAAAATGGC TACTTCTATC CGCCGACCGT TCTCGAAAAC
GTGCCGGTCG ATGCCGCGAT CGCGCGGGAA GAGATCTTCG GCCCTGTGGC GCCTATCTAC
AAATTCGATA GCGAGGAGGA GGTCATCCGC CTTGCGAACG ACACCGAGTA CGGGCTCGCC
GCCTACATCT ACAGTGGAGA CCTGAAACGC GCCATGAAGG TGGGCAAGCG CCTGGAAGCC
GGCATGCTCG GCATCAACCG GGGCCTGATG TCTGATCCCG CCGCCCCGTT TGGTGGCGTC
AAGCAGAGCG GCCTCGGGCG CGAAGGCGGG GTAACAGGCA TTCTGGAATT CATGGAGGCG
AAGTATTACG CAGTGGACTA CTGA
 
Protein sequence
MTFQTRISGY ADPALHAEGL YIGGKWQKGS GIAVTDPSTG NRLTEVADAS VEDAIRAVAA 
AETAAAGWRD TPARQRSEIL RRWFQLMTQH AEDLATLIAL ENGKALSDAR GEVTYAAEFF
RWYAEEATRI PGEYRHTPSG SHHILVDHEP IGIAVLITPW NFPAAMATRK IGPALAAGCT
VILKPASETP LTAYAMARLG EEAGVPAGVV NVLTTSRPAE VTNAMLADPR VRKLSFTGST
GVGRTLLAEA AKSVVSCSME LGGNAPFVVF DDADLEAAID GAMVAKMRNA GEACTAANRF
YVQSGIHDAF VACLTARMAA LKIGPGYEPS TQCGPMITQN AVRKIDRLVS DAVAGGARAT
TGGKPLSENG YFYPPTVLEN VPVDAAIARE EIFGPVAPIY KFDSEEEVIR LANDTEYGLA
AYIYSGDLKR AMKVGKRLEA GMLGINRGLM SDPAAPFGGV KQSGLGREGG VTGILEFMEA
KYYAVDY