Gene Smed_6251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6251 
Symbol 
ID5320553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1170983 
End bp1172446 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content63% 
IMG OID640777851 
Productaldehyde dehydrogenase 
Protein accessionYP_001314783 
Protein GI150378188 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.392289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCT TCCAGTGTTA CATCAACGGT GAATTCGCAG ACGGCGAAGC CCGCTTCGAA 
AGCATCGATC CCACCACCGG CCGTGCCTGG GCAGAGATGC CAGAGGCACG GGAAGCGGAC
GTCAATCGTG CCGTCGAGGC TGCACGTATC GCCCTTCACG ACCAGTCATG GTCCACACTG
ACGGCCACGC AGAGAGGCAA GCTCCTCTAC AAGCTTGCCG ATCTCGTCGC TGAGAATGCC
GGAAGACTTG CCGAGCTGGA GACCCGCGAC ACGGGCAAGA TCATCCGCGA GACCTCGTCG
CAGATCGCCT ATGTCGCCGA CTACTATCGC TACTACGCCG GGATCGCCGA CAAGATTGAG
GGCTCCTATC TGCCGATCGA CAAGCCCGAC ATGGATGTCT GGCTGCGCCG CGAGCCGATC
GGCGTCGTGG CCATGGTCGT GCCGTGGAAC AGCCAGCTTT TCCTTTCGGC CGTAAAGATC
GGTCCGGCAC TGGCGGCCGG CTGCACCATG GTGGTGAAGG CCTCGGAGGA CGGGCCGGCG
CCGCTTCTCG AATTTGCCCG GCTGGTGCAT GCGGCGGGTT TTCCCGCCGG TGTCGTCAAC
ATCGTCACCG GCTTCGGCCC ATCATGCGGC GCGGCGCTCA GCCGCCATCC GCAGGTCGAT
CACATAGCCT TCACTGGCGG GCCGGAGACG GCGCGCCATA TAGTCCGCAA TTCAGCGGAA
AATCTCGCCT CGACCTCGCT CGAGCTCGGC GGCAAATCGC CCTTTATCGT CTTTGCCGAC
GCAGATCTTG AAAGTGCGGC CAATGCCCAG ATCGCCGGGA TCTTTGCCGC GACCGGGCAG
AGCTGCGTGG CCGGCTCACG GCTGATCGTC GAAAAAAGTG TCAAGGACCG CTTCTTGCAG
ATCCTAAAGG CCAAGGCAGA GACAATTCGC ATCGGCAGCC CGCTCGAGAT GTCGACGGAG
GTGGGACCGC TCGCGACGGA GCGCCAGTGC AACCACGTCA AGGCCCTTAT CGCACGCTCG
CTGGCTGCTG GCGCGAAGCT GGTGACCGGA GGCACAGCGC CGGAGGGCAC CGGGTTCTAT
TATCGCCCGA CCATTCTCGA TTGCGACGGC AGCGCATCGC CGTCCCTCGA GAACGAATTC
TTCGGTCCTG TGCTCTCGGT TCTGTCTTTC GAGACAGAAG CTGAAGCTCT CCATCTCGCC
AACGGCTCCC GCTTCGGCCT TGCAGCCGGG GTCTTTACGC AGAATCTCAC CCGAGCGCAC
CGCCTCATGA AGGGAATTCG CGCGGGAATC GTCTGGGTCA ATACCTATCG GGCGGTCTCC
CCGGTCGCGC CCTTCGGCGG CTTCGGGCTC TCGGGTCACG GGCGTGAGGG CGGCCTGGAG
GCGGCGCTCG ACTATACCCG GAGCAAGACC GTTTGGCTCA GGACGTCGGA CGATCCAATT
CCTGATCCCT TCGTGATGCG GTGA
 
Protein sequence
MQRFQCYING EFADGEARFE SIDPTTGRAW AEMPEAREAD VNRAVEAARI ALHDQSWSTL 
TATQRGKLLY KLADLVAENA GRLAELETRD TGKIIRETSS QIAYVADYYR YYAGIADKIE
GSYLPIDKPD MDVWLRREPI GVVAMVVPWN SQLFLSAVKI GPALAAGCTM VVKASEDGPA
PLLEFARLVH AAGFPAGVVN IVTGFGPSCG AALSRHPQVD HIAFTGGPET ARHIVRNSAE
NLASTSLELG GKSPFIVFAD ADLESAANAQ IAGIFAATGQ SCVAGSRLIV EKSVKDRFLQ
ILKAKAETIR IGSPLEMSTE VGPLATERQC NHVKALIARS LAAGAKLVTG GTAPEGTGFY
YRPTILDCDG SASPSLENEF FGPVLSVLSF ETEAEALHLA NGSRFGLAAG VFTQNLTRAH
RLMKGIRAGI VWVNTYRAVS PVAPFGGFGL SGHGREGGLE AALDYTRSKT VWLRTSDDPI
PDPFVMR