Gene Smed_2414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2414 
Symbol 
ID5323275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2490588 
End bp2492021 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content62% 
IMG OID640791352 
Productaldehyde dehydrogenase 
Protein accessionYP_001328081 
Protein GI150397614 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.183209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACA AGCGGCAATT CTATATCGGT GGTGAATGGG TCGAGCCGGC CACGCAGAAC 
GATCTTTTCG TACTCAACCC GGCGACCGAA AAGCCGATCG CGGTCATCTC GCTCGGCACC
GCCGTCGATA TCGACCGCGC CGTCGCGGCT GCAAAGAAAG CCTTCGCGAG CTATAGCCGG
ACCAGCGTCG AGGAGCGGCT GGCACTGCTC GAAAAGCTGC TCGCGATTTA CAAGCGCCGC
TACGACGAAA TGGCCGACAC GATCACCGCC GAACTCGGTG CTCCCAAGAC GATGAGCAGA
GAGCAGCAGG CCGAGGTGGG TGTCGGTCAC CTGCAGGGTT TCATCGATGC GCTGAAGCGC
TTGAAGCTTA GGGAGAAGCT GCCGAACGGA GACACGCTCC TGCGCGAGCC GATCGGCGTC
TGCGGTCTTA TCACGCCTTG GAACTGGCCT GTGAATCAGA TCGCGCTCAA GGTCGTGCCG
GCACTTGCGA CAGGCTGCAC CTGCATCCTG AAGCCAAGCG AGTTCACGCC GCTGAACGCG
ATGCTCTACG CCGAGATGAT CGAGGAGGCG GGTTTCCCGG CGGGCGTTTT CAACCTCGTC
AATGGTGACG GTATCCATGC GGGAGCGGCA CTCTCCAAGC ACAGGGACAT CGACATGATG
TCCTTCACCG GATCGACGCG CGCCGGTATC GCGGTCAGCA AGGATGCCGC CGACACGGTC
AAACGCGTGA CTCTCGAACT CGGCGGCAAG TCGCCGAACA TCGTCTTCGC CGATGCGGAT
ATCGAGGAAC GGGTGACTGC AAGCATTCTC GAGTGCTTCA ACAATTCCGG CCAGTCCTGC
GACGCGCCGA CACGCATGCT CGTCGAGCGC AGTGTCTATG ACGAAGTCGT GGAGATCGCC
CGGCGTGTCG GCATGGAGGC GAGGGTCGGC GATCCGACGA AGGAAGGCGC TCACATTGGC
CCGCTGGTCA GCCATATTCA GTATGAGCGC GTGCAGACGC TGATCGAAGC CGGTGTTGCT
GAAGGCGCGA CGCTCCTCGC CGGCGGCCCG GGCAAACCCG AAGGTTTCGA GTCCGGCTAT
TTCGTCCGGC CGACAATCTT CGCCAATGTC GATAATTCCA TGCGGATCGC GCGCGAGGAA
GTGTTCGGCC CGGTCCTTTC GATCATGCCA TTCGATACCG AAGAGGAGGC GGTCGCGGTC
GCCAACGACA CCAATTACGG TCTCGCCGCC TATGTCCAGA CGCGCGACCG CGAAAGGGCG
GAGCGGGTGG CATCCCGCCT GCGCGCCGGA ATGGTGCACA TCAACGGCGG TCCGCACCGT
TACGGAAGTC CCTTCGGCGG CTACAAGCAG TCGGGCAACG GCCGCGAGGG CGGGATGTTC
GGGCTGGAGG ATTTCCTGGA GGTGAAAACG GTTCACCTGC CCGACGCAGC CTGA
 
Protein sequence
MLDKRQFYIG GEWVEPATQN DLFVLNPATE KPIAVISLGT AVDIDRAVAA AKKAFASYSR 
TSVEERLALL EKLLAIYKRR YDEMADTITA ELGAPKTMSR EQQAEVGVGH LQGFIDALKR
LKLREKLPNG DTLLREPIGV CGLITPWNWP VNQIALKVVP ALATGCTCIL KPSEFTPLNA
MLYAEMIEEA GFPAGVFNLV NGDGIHAGAA LSKHRDIDMM SFTGSTRAGI AVSKDAADTV
KRVTLELGGK SPNIVFADAD IEERVTASIL ECFNNSGQSC DAPTRMLVER SVYDEVVEIA
RRVGMEARVG DPTKEGAHIG PLVSHIQYER VQTLIEAGVA EGATLLAGGP GKPEGFESGY
FVRPTIFANV DNSMRIAREE VFGPVLSIMP FDTEEEAVAV ANDTNYGLAA YVQTRDRERA
ERVASRLRAG MVHINGGPHR YGSPFGGYKQ SGNGREGGMF GLEDFLEVKT VHLPDAA