Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5162 |
Symbol | |
ID | 8007058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 566463 |
End bp | 568088 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644822072 |
Product | Choline dehydrogenase |
Protein accession | YP_002973332 |
Protein GI | 241113497 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0769143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATATTC GTGTAAATGA TGCAGCCATT GATGCGGGAA GCTACGACAT CATCATCGTC GGCGCGGGAT CGGCGGGATG CGTTCTGGCC AACCGGCTAT CGGCCGATCC GAAAACCCGC GTCCTGCTGC TCGAAGCCGG CGGCAGCGAC CGGTACCATT GGGTGCATGT ACCGATCGGC TATCTCTACT GCATGGGCAA TCCGCGCACC GATTGGATGA TGAGGACGGC GGCGGAGGCC GGACTGAACG GTCGGTCACT GCCCTATCCG CGTGGAAAAG TGCTGGGCGG CTGTTCGTCG ATCAACGGCA TGATCTACAT GCGCGGCCAG GCGGCCGATT ACGACGGCTG GCGGCAGGCG GGTAATTCCG GCTGGGGCTG GGACGACGTG CTGCCCTATT TCCTGAAATC CGAGGACAAC TATCGTGGCC AGTCGCCAAT GCACGGGGCA GGCGGCGAAT GGCGGGTGGA AAAGCAGCGG CTGTCCTGGC CGATCCTCGA TGCCTTTCGC GATGCCGCCG AAGAACTCGG CATCCCGAAG ACCGACGATT TTAACGATGG CGACAATGAA GGCTCGGGCT ATTTCGAGGT CAACCAGCGC GGCGGCCTGC GCTGGAACAC GACCAAGGCC TTCTTGCGCC CGGCGATGAA GCGGCCCAAT CTGCGTGTGC TGACCGGCGC CGAAACCGAG CGGCTGGAAT TCGAGGGCAG AATGGTGACC GGCGTGCGGT TCCGGCTGAA CGGACGGAGT CATCTGGCTC GCGCCGGTCG CGAGGTCATT CTGTCTGCCG GTGCGATCAA TTCGCCGAAA ATCCTTGAGC TTTCCGGGAT CGGTCGACCG GATGTGTTGT CGGCTGCTGG GCTGGACGTC GTCCACGAAC TTCCAGGTGT CGGCGAAAAC CTGCAGGATC ATCTGCAAAT CCGCACGGTC TTCCGCATCG AGGGCGCAAA GACGCTGAAC CAGCTCTATC ACAACCTGTT CACCCGCGCG GGCATGGGGC TTGAATATAT GCTGCGCCGG TCCGGGCCTC TGTCGATGGC GCCGAGCCAG CTCGGTATCT TTGCCAAGAG CGATCCGGCT GTTGCGACCG CCGATCTCGA ATATCACGTG CAGCCCTTGA GCACCGACCG GCTCGGCGAG CCGCTGCACA AATATCCCGC CGTCACCGTC TCCGTCTGCA ATCTGCGGCC GGAGAGCCGG GGGAGCGTGC ATGTTAGCGG CCCGAACCTT TCGGTGGCGC CGGAAATACG CCCGAATTAT CTTTCGACCG TCGGCGACCG GATGGTTGCG ACGAAATCGA TCCGGCACGC CCGCCGGCTC ATGGAGGCCG GTGCCATCGC CAAGTACCGG CCGCAGGAGA TGTTGCCGGG CACGGAATAC CGGACCGACG AGGACCTGAT CCGTCGTGTC GGCGATATCG CCACGACGAT CTTCCATCCG GTCGGCACCT GCAAGATGGG CAGCGACACG ATGGCGGTTG TTGATTCGCA ATTGCGGGTG CATGGGCTGG CGAAACTGCG GGTGGTCGAT GCCTCGATCA TGCCGACAAT CGTGTCGGGC AATACCAACT CGCCGGTGAT CATGATTGCC GAGAAGGCCG CGGAAAGCAT TCTATCAGGG GTGTGA
|
Protein sequence | MNIRVNDAAI DAGSYDIIIV GAGSAGCVLA NRLSADPKTR VLLLEAGGSD RYHWVHVPIG YLYCMGNPRT DWMMRTAAEA GLNGRSLPYP RGKVLGGCSS INGMIYMRGQ AADYDGWRQA GNSGWGWDDV LPYFLKSEDN YRGQSPMHGA GGEWRVEKQR LSWPILDAFR DAAEELGIPK TDDFNDGDNE GSGYFEVNQR GGLRWNTTKA FLRPAMKRPN LRVLTGAETE RLEFEGRMVT GVRFRLNGRS HLARAGREVI LSAGAINSPK ILELSGIGRP DVLSAAGLDV VHELPGVGEN LQDHLQIRTV FRIEGAKTLN QLYHNLFTRA GMGLEYMLRR SGPLSMAPSQ LGIFAKSDPA VATADLEYHV QPLSTDRLGE PLHKYPAVTV SVCNLRPESR GSVHVSGPNL SVAPEIRPNY LSTVGDRMVA TKSIRHARRL MEAGAIAKYR PQEMLPGTEY RTDEDLIRRV GDIATTIFHP VGTCKMGSDT MAVVDSQLRV HGLAKLRVVD ASIMPTIVSG NTNSPVIMIA EKAAESILSG V
|
| |