Gene Smed_3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3171 
Symbol 
ID5324050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3333287 
End bp3334795 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content64% 
IMG OID640792119 
Productaldehyde dehydrogenase 
Protein accessionYP_001328830 
Protein GI150398363 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTAC TCGTAAGGCC GAAGGCGCTC GGCGAATACA AGGTCCGTGA ATTCCGAATG 
CTGATCGACG GCCAGTGGGT CGATAGCGCC GAGGGCCGGA CGATCGAGCG CGTGGCACCC
GGGCACGGCG TCGTCGTCAG CCGCTATCAG GCCGCTACGA AGGTCGATGC CGAGCGGGCG
ATCGCCGCGG CGCGGCGCGC CTTCGACCAG GGCCCATGGC CGCGCATGAC CGCCGCGGAG
AGATCGCTGA TCCTGCTCAA GGTCGCCGAC ATGGTCGCGG CGCGCGCGGA TGAACTCGCC
TTTCTCGACG CGGTCGAATC CGGCAAGCCG ATCAGCCAGG CAAAAGGCGA ACTGGCCGGT
GCCGCCGACA TCTGGCGCTA TGCTGCGGCA CTTGCCCGCG ATCTCTCCGG CGAAAGCTAC
AACACGCTCG GGGAGGGGAC GCTCGGCGTG GTGCTGCGCG AGCCGATCGG TGTCGTTTCA
ATCATCACGC CCTGGAATTT CCCTTTCCTG ATCGTAAGCC AGAAGCTCCC CTTTGCACTC
GCCGCCGGCT GCACGACCGT GGTGAAACCG TCGGAGCTCA CCTCCGCCTC GACGCTGGTG
CTCGGCGAGA TCCTGGAAGC TGCAGGCGTG CCCGCAGGCG CCGTGAACAT CATCGTCGGC
ACGGGACCGG AGGCCGGCGC CCCGCTCACG ACGCACCCGC ATGTCGACAT GGTCTCCTTC
ACGGGCTCCA CCGGCATTGG CAGGCTGACC ATGGCGAATG CGGCGCAGAC CCTGAAAAAA
GTCTCGCTGG AACTCGGCGG CAAGAACCCG CAGATCGTCT TTCCGGACGC CAATCTGGAT
GAATTCATCG ATGCGGCCGT CTTTGGTGCC TATTTCAATG CCGGCGAATG CTGCAATGCC
GGCTCACGGC TGATCCTTCA CCGCGACATT GCCGAGGAGG TGACGGCGCG CATCGCCGGC
CTTTCGGCAA AGGTGAAGGT GGGCGATCCG CTCGATCCTG AAACGCAGGT CGGCGCGATC
ATCACCCCGC AGCATCTTCA GAAGATCGCC GGCTATGTCT CGTCGGCGTC GAACGAAGGT
GCGAGGGTGA CCCATGGTGG CATGGAGCTC GATCTCGGCA TGGGGCAGTT CATGGCTCCG
ACCATCCTCT CTGCGGTCAG ACCCGAAATG GCAGTGGCGC GGGAGGAAGT CTTCGGCCCG
GTTCTCTCTG TGCTGACTTT CGAAACGACG GAGGAGGCGA TCCGCATCGC CAATTCGATC
GACTACGGAC TTTCGGCCGG TGTCTGGAGC CGTGATTTCG ACACCTGCCT TACGATCGGG
CGGCGCGTGC GCGCCGGGAC CATCTGGATG AATACTTTCA TGGACGGCGC ATCGGAGCTG
CCCTTCGGCG GCTACCGGCA ATCGGGGCTC GGCCGCGAAC TCGGACGCCA TGCGGTCGAG
GATTATACGG AAACGAAGAC GCTCAACATG CATATCGGCC AGAGAACCAA CTGGTGGATG
CCGCGCTGA
 
Protein sequence
MTVLVRPKAL GEYKVREFRM LIDGQWVDSA EGRTIERVAP GHGVVVSRYQ AATKVDAERA 
IAAARRAFDQ GPWPRMTAAE RSLILLKVAD MVAARADELA FLDAVESGKP ISQAKGELAG
AADIWRYAAA LARDLSGESY NTLGEGTLGV VLREPIGVVS IITPWNFPFL IVSQKLPFAL
AAGCTTVVKP SELTSASTLV LGEILEAAGV PAGAVNIIVG TGPEAGAPLT THPHVDMVSF
TGSTGIGRLT MANAAQTLKK VSLELGGKNP QIVFPDANLD EFIDAAVFGA YFNAGECCNA
GSRLILHRDI AEEVTARIAG LSAKVKVGDP LDPETQVGAI ITPQHLQKIA GYVSSASNEG
ARVTHGGMEL DLGMGQFMAP TILSAVRPEM AVAREEVFGP VLSVLTFETT EEAIRIANSI
DYGLSAGVWS RDFDTCLTIG RRVRAGTIWM NTFMDGASEL PFGGYRQSGL GRELGRHAVE
DYTETKTLNM HIGQRTNWWM PR