Gene Smed_0563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0563 
Symbol 
ID5321399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp610662 
End bp612125 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content64% 
IMG OID640789499 
Productbetaine aldehyde dehydrogenase 
Protein accessionYP_001326254 
Protein GI150395787 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01804] glycine betaine aldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.886348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.578799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCCC AACCGAAAGC CTCGCACTTC ATCGACGGCG AATATGTCGA GGACGCCGCC 
GGCACGGTGA TCGAAAGCAT CTATCCGGCG ACCGGCGAAG TGATCGCCCG CCTCTATGCT
GCAACGCCCG CAATCGTCGA GAAGGCGATC GCCGCGGCCA AGCGGGCGCA GCCGGAATGG
GCGGCGATGA GCCCGACGGC TCGCGGCCGC ATCCTGAAGC GGGCCGCCGA GATCATGCGC
GAGCGCAACC GGGAGCTCTC CGAACTAGAG ACGCTCGACA CGGGCAAGCC CATCCAGGAA
ACCATCGTCG CCGACCCGAC CTCGGGCGCC GACAGCTTCG AGTTCTTCGG GGGTATCGCC
CCGGCCGCCC TCAATGGCGA TTATATCCCG CTCGGCGGCG ATTTCGCCTA TACGAAGCGG
GTACCTCTCG GCGTCTGTGT CGGCATCGGC GCCTGGAACT ATCCGCAGCA GATCGCCTGT
TGGAAGGGTG CGCCTGCCCT CGTCGCCGGC AATTCGATGG TGTTCAAGCC GTCGGAGAAC
ACGCCTCTCG GCGCGCTGAA GATCGCCGAA ATCCTGATCG AAGCGGGTCT GCCGAAGGGC
CTGTACAACG TCGTTCAGGG CGACCGATCG ACTGGACCCC TCCTCGTCAA TCATCCTGAC
GTCGCCAAGG TATCGCTGAC CGGCTCGGTG CCGACCGGCC GCAAGGTCTA TGAGGCGGCC
GCGGCCGGAC TTCGCCACGT CACGATGGAG CTCGGGGGCA AGTCGCCGCT GATCGTCTTC
GACGATGCCG ATCTCGAAAG CGCGATCGGC GGCGCCATGC TCGGCAATTT CTATTCGACC
GGTCAGGTCT GCTCCAACGG CACGCGCGTC TTCGTTCAGA AGAAGATAAA GCAATCCTTC
CTGGCGCGGC TGAAGGAGCG TACCGATGCG ATCGTCATCG GGGACCCGAT GGACGAGGCG
ACGCAGCTCG GGCCGATGGT ATCCACGGCG CAGCGTGACA AGGTCTTCTC TTATATCGAG
AAAGGCAAAT CGGAGGGCGC GCGGCTCGTG ACCGGCGGCG GCATCCCCAA CAATGTGAGC
GCCGAAGGCA CCTATATCCA GCCGACGGTC TTTGCCGATG TCACCGACGA GATGACCATC
GCGCGCGAAG AAATCTTCGG CCCGGTCATG TGCGTGCTTG ATTTCGATGA CGAGGCGGAG
GTCGTCGCAC GCGCCAACGC CACCGAATTC GGCCTCTCCG CCGGCGTCTT CACCGCCGAC
CTCACCCGCG CCCATCGCGT CGTCGACCGG CTGGAGGCCG GAACGCTCTG GATCAACACC
TATAATCTCT GCCCGGTCGA GATTCCGTTC GGCGGATCCA AGCAGTCCGG CTTCGGGCGC
GAGAATTCAG CCGAAGCGCT CAAACACTAT ACCGAGCTCA AGACCGTCTA TGTCGGCATG
GGACCGGTCG AGGCGCCGTA TTGA
 
Protein sequence
MRAQPKASHF IDGEYVEDAA GTVIESIYPA TGEVIARLYA ATPAIVEKAI AAAKRAQPEW 
AAMSPTARGR ILKRAAEIMR ERNRELSELE TLDTGKPIQE TIVADPTSGA DSFEFFGGIA
PAALNGDYIP LGGDFAYTKR VPLGVCVGIG AWNYPQQIAC WKGAPALVAG NSMVFKPSEN
TPLGALKIAE ILIEAGLPKG LYNVVQGDRS TGPLLVNHPD VAKVSLTGSV PTGRKVYEAA
AAGLRHVTME LGGKSPLIVF DDADLESAIG GAMLGNFYST GQVCSNGTRV FVQKKIKQSF
LARLKERTDA IVIGDPMDEA TQLGPMVSTA QRDKVFSYIE KGKSEGARLV TGGGIPNNVS
AEGTYIQPTV FADVTDEMTI AREEIFGPVM CVLDFDDEAE VVARANATEF GLSAGVFTAD
LTRAHRVVDR LEAGTLWINT YNLCPVEIPF GGSKQSGFGR ENSAEALKHY TELKTVYVGM
GPVEAPY