Gene Smed_2129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2129 
Symbol 
ID5322989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2194319 
End bp2195704 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content63% 
IMG OID640791067 
Productaldehyde dehydrogenase 
Protein accessionYP_001327797 
Protein GI150397330 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.237347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA TCAGATGCAT TTCGCCGGTC AATGGCGAAG TTTACGCCGA GCGCCCGGCA 
ATGACGTTGG AGATGGCGAG GCAGGCGGTG GCGCATGCGC GCCAGGCGCA GAAAGGCTGG
GCGCGGCGAC CGCTCGAAGA GCGTGTGAAA CTGGTACTTG CCGGCGTGGC GCGGCTCAAT
GAGATGGTTG ACGAGGTGGT GCCGGAACTC GCCTGGCAGA TGGGCCGGCC GGTGCGCTAT
GGTGGCGAGT TCAAGGGCTT CAACGAGCGC TCCAATTATG TCGCCTCGAT TGCCGCCGAC
GCGCTGACAT CTATTGTCGT GGAGGAGAGC GACCGTTTCG AGCGCCGCAT CGAACGCGAG
CCACATGGGG TCGTCTTCGT CATCGCGCCG TGGAACTATC CCTATATGAC GGCGATCAAC
ACGATCGCGC CGGCGCTGAT GGCGGGCAAT GCAGTGATCC TCAAACACGC CAGCCAGACG
ATCCTCGTCG GGGAGCGCAT GGTGCGCGCC TTCATCGAGG CAGGCGTTCC GGCAGATGTG
TTCCAGAACC TGTTCCTGGA TCACGACACG ACGGCGGCGC TGATCGCCGC CAAGAGCTTC
GATTTCATCA ACTTCACCGG CTCGGTCGAG GGCGGCCGCT CGATCGAACG GGCGGCGGCA
GGCACCTTCA CCGGTCTCGG ACTTGAACTC GGCGGCAAGG ACCCGGGTTA TGTGATGGAG
GATGCCGATC TCGATGCTGC CGTCGACACG CTGATGGACG GAGCGACCTA CAATTCCGGC
CAGTGCTGCT GCGGCATCGA GCGCATCTAT GTGCACGAGT CGCTCTATGA CGCTTTCGTC
GAGAAGTCGG TCGCCTGGGT CTCCAACTAC AAGCTCGGCG ACCCGCTCGA TCCCGAAACG
ACACTCGGCC CCATGGCCAA CAAGCGGTTT GCCGCGAACG TGCGCCACCA GATCGCCGAC
GCCGTGTCGA AGGGCGCCAA GGCGCTGATC GATCCCAAGC TTTTCCCTGA GGACGATGGC
GGCGCCTACC TCGCGCCACA GATTCTTGTC GACGTCGATC ACTCCATGGA ATTCATGCGG
GAGGAGACAT TCGGGCCTGC TGTCGGCATC ATGAGGGTGA AGAGCGACGC TCAGGCAATC
GAACTTATGA ACGACAGCCG ATACGGCTTG ACTGCGTCGC TCTGGACGAA GGATGCGGAG
CGTGCGGGGC GCATCGGGCG GGAGCTCGAG ACAGGCACGG TCTTCATGAA CCGCGCCGAC
TATCTGGATC CGGCGCTTTG CTGGACCGGC GTCAAGGAGA CGGGGCGCGG CGGCTCGCTC
TCGGTCCTCG GCTTCCACAA TCTGACCCGC CCGAAATCCT ATCATCTGAA GAAAGCGACC
GCATGA
 
Protein sequence
MTMIRCISPV NGEVYAERPA MTLEMARQAV AHARQAQKGW ARRPLEERVK LVLAGVARLN 
EMVDEVVPEL AWQMGRPVRY GGEFKGFNER SNYVASIAAD ALTSIVVEES DRFERRIERE
PHGVVFVIAP WNYPYMTAIN TIAPALMAGN AVILKHASQT ILVGERMVRA FIEAGVPADV
FQNLFLDHDT TAALIAAKSF DFINFTGSVE GGRSIERAAA GTFTGLGLEL GGKDPGYVME
DADLDAAVDT LMDGATYNSG QCCCGIERIY VHESLYDAFV EKSVAWVSNY KLGDPLDPET
TLGPMANKRF AANVRHQIAD AVSKGAKALI DPKLFPEDDG GAYLAPQILV DVDHSMEFMR
EETFGPAVGI MRVKSDAQAI ELMNDSRYGL TASLWTKDAE RAGRIGRELE TGTVFMNRAD
YLDPALCWTG VKETGRGGSL SVLGFHNLTR PKSYHLKKAT A