Gene Smed_4477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4477 
Symbol 
ID5318342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp959217 
End bp960713 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content63% 
IMG OID640776278 
Productaldehyde dehydrogenase 
Protein accessionYP_001313210 
Protein GI150376614 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.749064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCG CGTGCCGTAA CCCGGCCGTC GCGCTTGCGG AAGAAGACAA GGAAGCGGAA 
GCCATGACAC TTCACCAGAA CCTGATCGCC GGCGAATGGG TCGGCGAAGA CGGCGTTGCG
AACGTCAACC CGTCGAACAC CAATGACGTC GTCGGAGATT ATGCCCGCGC CAGCGCCGAA
GACGCGAAAG CCGCGATTGC GGCGGCCAAG GCTGCCTTTC CCACCTGGTC GCGCTCCGGC
ATCCTCGAGC GCCATGCGAT CCTGAAGAAA ACCGCCGACG AAATCCTCGC CCGCAAGGAC
GAGCTGGGAC GGCTGCTGTC GCGTGAGGAG GGCAAGACCC TCGCCGAGGG GATCGGCGAA
ACGGTCCGGG CCGGTCAGAT CTTCGAATTC TTCGCCGGCG AAACTTTGCG CCTCGCTGGC
GAGGTCGTCC CATCGGTCAG GCCGGGCATC GGCGTCGAGA TCACCCGCGA GCCGGTCGGC
GTCGTTGGCA TCATCACGCC CTGGAACTTT CCCATCGCCA TTCCCGCCTG GAAGGTCGCT
CCGGCGCTCT GCTACGGCAA TACCGTCGTC TTCAAGCCGG CGGAACTGGT ACCGGGCTGT
TCATGGGCGA TCGTCGATAT CCTCCATCGT GCGGGCCTGC CAAAGGGCGT ACTGAACCTC
GTCATGGGCA AGGGTTCGGT CGTAGGCCAG GCAATGCTCG ACAGCCCGGA CGTTCAGGCG
ATAACCTTCA CTGGCTCGAC CGCAACCGGA AAACGGGTCG CAGTCGCCTC GGTCGAACAT
AACCGCAAAT ACCAGTTGGA GATGGGAGGT AAGAACCCGT TCGTCGTTCT CGACGACGCC
GATCTTTCCG TTGCCGTCGA AGCGGCAGTC AATTCCGCCT TTTTCTCGAC CGGTCAGCGT
TGCACCGCCT CCTCGCGGAT CATCGTCACC GAGGGCATCC ATGACCGGTT CGTCGCCGCC
ATGGGCGAGC GGATCAAGGG TCTCGTCGTC GACGACGCGC TGAAGGCCGG CACCCATATC
GGACCGGTGG TCGATCAGAG CCAGCTCAAT CAGGACACCG ACTACATCGC CATCGGCAAG
AAGGAGGGCG CGAAGCTCGC CTTCGGCGGT GAACTGGTCT CGCGTGACAC GCCCGGCTTC
TATCTGCAGC CGGCGCTGTT CACCGAGGCG ACGAACGATA TGCGTATCTC CCGCGAGGAA
ATCTTCGGAC CTGTCGCGGC CGTCATCCGC GTCAGGGATT ACGATGAAGC GCTGGCCGTC
GCCAATGACA CGCCCTTCGG TCTGTCTTCG GGTATCGCCA CCACCAGCTT GAAACACGCG
ACGCACTTCA AGCGCAATGC CGAGGCCGGC ATGGTGATGG TCAACCTGCC CACGGCGGGT
GTCGACTTCC ACGTGCCGTT CGGCGGCCGC AAGGCTTCCT CCTACGGTCC TCGCGAGCAG
GGCAAATACG CCGCTGAATT CTACACCAAT GTCAAAACCG CCTACACGCT GGCTTGA
 
Protein sequence
MAGACRNPAV ALAEEDKEAE AMTLHQNLIA GEWVGEDGVA NVNPSNTNDV VGDYARASAE 
DAKAAIAAAK AAFPTWSRSG ILERHAILKK TADEILARKD ELGRLLSREE GKTLAEGIGE
TVRAGQIFEF FAGETLRLAG EVVPSVRPGI GVEITREPVG VVGIITPWNF PIAIPAWKVA
PALCYGNTVV FKPAELVPGC SWAIVDILHR AGLPKGVLNL VMGKGSVVGQ AMLDSPDVQA
ITFTGSTATG KRVAVASVEH NRKYQLEMGG KNPFVVLDDA DLSVAVEAAV NSAFFSTGQR
CTASSRIIVT EGIHDRFVAA MGERIKGLVV DDALKAGTHI GPVVDQSQLN QDTDYIAIGK
KEGAKLAFGG ELVSRDTPGF YLQPALFTEA TNDMRISREE IFGPVAAVIR VRDYDEALAV
ANDTPFGLSS GIATTSLKHA THFKRNAEAG MVMVNLPTAG VDFHVPFGGR KASSYGPREQ
GKYAAEFYTN VKTAYTLA