Gene Smed_3164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3164 
Symbol 
ID5324043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3324751 
End bp3326406 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content64% 
IMG OID640792112 
Productcholine dehydrogenase 
Protein accessionYP_001328823 
Protein GI150398356 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.637341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTATG ACTATATCAT TACCGGCGGA GGTCCTGCGG GCTGCGTTCT CGCAAACCGC 
CTGAGCGAGG ACCCGGCCGT CAAGGTGCTG CTTCTCGAGG CAGGCGGTGG CGATTGGAAC
CCGCTTTTCC ACATGCCGGC CGGTTTCGCC AAGATGACGA AGGGAGTCGC AAGCTGGGGC
TGGCACACAG TGCCGCAGAA GCACATGAAG GGCAGGGTGC TGCGCTATAC GCAGGCCAAG
GTGATCGGCG GCGGTTCCTC GATCAATGCG CAGCTTTATA CGCGCGGCAA CGCGGCGGAT
TACGACCTCT GGGCTGGCGA GGATGGCTGC ACCGGCTGGG ATTATCGCAG CGTGCTGCCC
TATTTCAAGC GCGCGGAAGA CAACCAGCGT TTCGCCGACG ATTATCACGC CTATGGCGGG
CCGCTCGGCG TCTCCATGCC GGTTTCGACG CTGCCGATCT GCGACGCCTA TATCCGCGCG
GGGCAGGAGC TCGGCATTCC CTACAATCAC GATTTCAACG GCAAGCAGCA GGCGGGTGTC
GGCTTTTATC AACTGACCCA GCGTGATCGC CGCCGTTCCT CCGCTTCGCT GGCCTACCTT
TCTCCGGTCA GGGACCGGAA AAACCTCATT GTGCGCACCG GCGCTCGCGT AGCCCGCATC
GTCCTCGAAG GAAAACGCGC GGTCGGTGTA GAGGTGGTAA CGGGGAAAGG CAGCGAAATC
ATCCGTGCGA ACCGGGAAGT GCTGGTCACC TCCGGCGCGA TCGGCTCGCC CAAGCTGCTC
CTTCAGTCCG GCATCGGCCC GGCCGATCAT CTGCGCTCCG TCGGCGTCGA GGTGCGGCAC
GATCTCCCCG GCGTTGGCGG GAACCTCCAG GATCACCTCG ATCTCTTCGT CATTGCCGAA
TGCACCGGCG ATCACACCTA TGACGGCGTC GCCCGGCTGC ACCGCACTTT CTGGGCCGGC
CTGCAATATG TGCTCTTCCG CTCCGGCCCG GTGGCCTCGT CGCTCTTCGA GACCGGCGGC
TTCTGGTATG CCGATCCGAA TGCCCGCTCG CCGGACATCC AGTTTCATCT CGGTCTCGGT
TCGGGCATCG AGGCCGGCGT CGCCCGGCTC AAGAACGCCG GCGTCACGCT CAACTCCGCC
TATCTGCATC CGCGTTCGCG CGGCACCGTG CGGCTCTCCT CCGCCGATCC GGCGGCCGCA
CCGCTGATCG ACCCGAACTA TTGGGAGGAC CCGCACGATC GCAAAATGTC GCTGGAAGGC
CTGAAAATCG CGCGCGAGAT CATGCAGCAG GCGGCATTGA AGCCCTTTGT CTTGGCTGAA
CGCTTGCCGG GAGACGAAAT CCGGACCGAG GAACAGCTCT TCGACTATGG CTGTGCCAAT
GCCAAGACCG ACCACCACCC TGTCGGGACC TGCAGGATGG GCACCGATGC TTCGGCGGTC
GTCGATCTGG AGCTCAAAGT TCGCGGCATC GACGGACTGC GTGTCTGCGA CAGTTCGGTC
ATGCCGCGGG TACCTTCCTG CAATACCAAT GGCCCGACGA TTATGATGGG CGAGAAGGGG
GCCGACATTA TCCGCAGCCT GCCGCCGCTG CCGCCTGCCG TCTTCCAGCA CGAGCGCAAC
GATATGCGGC CGCGGGCGCG GACGGAGGTT CGGTGA
 
Protein sequence
MSYDYIITGG GPAGCVLANR LSEDPAVKVL LLEAGGGDWN PLFHMPAGFA KMTKGVASWG 
WHTVPQKHMK GRVLRYTQAK VIGGGSSINA QLYTRGNAAD YDLWAGEDGC TGWDYRSVLP
YFKRAEDNQR FADDYHAYGG PLGVSMPVST LPICDAYIRA GQELGIPYNH DFNGKQQAGV
GFYQLTQRDR RRSSASLAYL SPVRDRKNLI VRTGARVARI VLEGKRAVGV EVVTGKGSEI
IRANREVLVT SGAIGSPKLL LQSGIGPADH LRSVGVEVRH DLPGVGGNLQ DHLDLFVIAE
CTGDHTYDGV ARLHRTFWAG LQYVLFRSGP VASSLFETGG FWYADPNARS PDIQFHLGLG
SGIEAGVARL KNAGVTLNSA YLHPRSRGTV RLSSADPAAA PLIDPNYWED PHDRKMSLEG
LKIAREIMQQ AALKPFVLAE RLPGDEIRTE EQLFDYGCAN AKTDHHPVGT CRMGTDASAV
VDLELKVRGI DGLRVCDSSV MPRVPSCNTN GPTIMMGEKG ADIIRSLPPL PPAVFQHERN
DMRPRARTEV R