Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3164 |
Symbol | |
ID | 5324043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3324751 |
End bp | 3326406 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640792112 |
Product | choline dehydrogenase |
Protein accession | YP_001328823 |
Protein GI | 150398356 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.637341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTATG ACTATATCAT TACCGGCGGA GGTCCTGCGG GCTGCGTTCT CGCAAACCGC CTGAGCGAGG ACCCGGCCGT CAAGGTGCTG CTTCTCGAGG CAGGCGGTGG CGATTGGAAC CCGCTTTTCC ACATGCCGGC CGGTTTCGCC AAGATGACGA AGGGAGTCGC AAGCTGGGGC TGGCACACAG TGCCGCAGAA GCACATGAAG GGCAGGGTGC TGCGCTATAC GCAGGCCAAG GTGATCGGCG GCGGTTCCTC GATCAATGCG CAGCTTTATA CGCGCGGCAA CGCGGCGGAT TACGACCTCT GGGCTGGCGA GGATGGCTGC ACCGGCTGGG ATTATCGCAG CGTGCTGCCC TATTTCAAGC GCGCGGAAGA CAACCAGCGT TTCGCCGACG ATTATCACGC CTATGGCGGG CCGCTCGGCG TCTCCATGCC GGTTTCGACG CTGCCGATCT GCGACGCCTA TATCCGCGCG GGGCAGGAGC TCGGCATTCC CTACAATCAC GATTTCAACG GCAAGCAGCA GGCGGGTGTC GGCTTTTATC AACTGACCCA GCGTGATCGC CGCCGTTCCT CCGCTTCGCT GGCCTACCTT TCTCCGGTCA GGGACCGGAA AAACCTCATT GTGCGCACCG GCGCTCGCGT AGCCCGCATC GTCCTCGAAG GAAAACGCGC GGTCGGTGTA GAGGTGGTAA CGGGGAAAGG CAGCGAAATC ATCCGTGCGA ACCGGGAAGT GCTGGTCACC TCCGGCGCGA TCGGCTCGCC CAAGCTGCTC CTTCAGTCCG GCATCGGCCC GGCCGATCAT CTGCGCTCCG TCGGCGTCGA GGTGCGGCAC GATCTCCCCG GCGTTGGCGG GAACCTCCAG GATCACCTCG ATCTCTTCGT CATTGCCGAA TGCACCGGCG ATCACACCTA TGACGGCGTC GCCCGGCTGC ACCGCACTTT CTGGGCCGGC CTGCAATATG TGCTCTTCCG CTCCGGCCCG GTGGCCTCGT CGCTCTTCGA GACCGGCGGC TTCTGGTATG CCGATCCGAA TGCCCGCTCG CCGGACATCC AGTTTCATCT CGGTCTCGGT TCGGGCATCG AGGCCGGCGT CGCCCGGCTC AAGAACGCCG GCGTCACGCT CAACTCCGCC TATCTGCATC CGCGTTCGCG CGGCACCGTG CGGCTCTCCT CCGCCGATCC GGCGGCCGCA CCGCTGATCG ACCCGAACTA TTGGGAGGAC CCGCACGATC GCAAAATGTC GCTGGAAGGC CTGAAAATCG CGCGCGAGAT CATGCAGCAG GCGGCATTGA AGCCCTTTGT CTTGGCTGAA CGCTTGCCGG GAGACGAAAT CCGGACCGAG GAACAGCTCT TCGACTATGG CTGTGCCAAT GCCAAGACCG ACCACCACCC TGTCGGGACC TGCAGGATGG GCACCGATGC TTCGGCGGTC GTCGATCTGG AGCTCAAAGT TCGCGGCATC GACGGACTGC GTGTCTGCGA CAGTTCGGTC ATGCCGCGGG TACCTTCCTG CAATACCAAT GGCCCGACGA TTATGATGGG CGAGAAGGGG GCCGACATTA TCCGCAGCCT GCCGCCGCTG CCGCCTGCCG TCTTCCAGCA CGAGCGCAAC GATATGCGGC CGCGGGCGCG GACGGAGGTT CGGTGA
|
Protein sequence | MSYDYIITGG GPAGCVLANR LSEDPAVKVL LLEAGGGDWN PLFHMPAGFA KMTKGVASWG WHTVPQKHMK GRVLRYTQAK VIGGGSSINA QLYTRGNAAD YDLWAGEDGC TGWDYRSVLP YFKRAEDNQR FADDYHAYGG PLGVSMPVST LPICDAYIRA GQELGIPYNH DFNGKQQAGV GFYQLTQRDR RRSSASLAYL SPVRDRKNLI VRTGARVARI VLEGKRAVGV EVVTGKGSEI IRANREVLVT SGAIGSPKLL LQSGIGPADH LRSVGVEVRH DLPGVGGNLQ DHLDLFVIAE CTGDHTYDGV ARLHRTFWAG LQYVLFRSGP VASSLFETGG FWYADPNARS PDIQFHLGLG SGIEAGVARL KNAGVTLNSA YLHPRSRGTV RLSSADPAAA PLIDPNYWED PHDRKMSLEG LKIAREIMQQ AALKPFVLAE RLPGDEIRTE EQLFDYGCAN AKTDHHPVGT CRMGTDASAV VDLELKVRGI DGLRVCDSSV MPRVPSCNTN GPTIMMGEKG ADIIRSLPPL PPAVFQHERN DMRPRARTEV R
|
| |