Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4996 |
Symbol | |
ID | 6978090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 640672 |
End bp | 642366 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643394142 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_002278960 |
Protein GI | 209547042 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.833385 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGTCG ATCATCTGAT CCTTGGGGGA GGCTCGGCGG GCTGCGTGCT GGCGGCTCGG CTATCGGCCG ATCCGAACCG GAGGGTCGTT CTGGTCGAGG CGGGACGCAA CATTGCGCCC GACGACATCC CAGGCGATAT CCGCAGTCGT TATCCCGGCC GGGCCTATCT CGACACGCGC AATATCTGGT CGCAACTGAC GGCGCTGATG GGTTATGCCC GCTCCAACGT GGCACCGCGT TCCTCGCGCA GATACGAGCA GGCGCGCCTA CTGGGTGGCG GCTCGGCGAT CAATGCGCTT ATGGCCAATC GCGGCGCGCC GGCCGATTAT GCCGAATGGG AGGCTCTGGG CGCGGACGGC TGGGGCTGGG ACGAATGTCT GCCCTATTTC CGCAAGATCG AGTCCGACCG CGATTTCAAT GGCCCGCTGC ATGGACAGGA TGGACCGCTG ACGATCCGCC GCATTTCGGA CGAGAAGATT TCACCCTTCG TGGATCGCAC AATGAAGGCG CTCGACCGGC GCGGTCATCC GATCCGGCAG GATCAGAATG GTGTCTGGGA AGACGGCGCC TTCCGGGCCG CCATCGCCGT CAGCGACCGC GGTGAACGGC TGCCGACCTC GCTCGCCTAT CTGACATCCG AGGTCCGCAG ACGGCCCAAT CTGCGTATCG TGACCGAGAG CGTTGCCATT CGGATCCTGT TCGATGGTCG CCGCGCCACG GGGGCGACCC TTTCGGGTGC TGCGGGAGAG ACGATTCACG CGGCGGAAGT GATCGTTTCT GCCGGCGCCA TCCATACGCC GGCGCTGCTG CTGCGATCCG GTATCGGTCC GGCCGGCGAT CTCAATTCTA CCGGGGTGAC GATTGTGGCC GGGTGCGAAG GCGTCGGCCG CAACCTGATG GAACATCCCT CGATCGCAGT CGCAGCCTAT CTGCCGCCGC ATATGCGGGT GCGCGATCCG GCCGAGCACC ACGAACAGGC GATCTGGCGT TTCTCATCCG GCCTCGTTGG CACGCCGCAG GGCGACATGC ATGGCTCCAT CCTGTCGCGC TCGGGTTGGC ATTCGGTCGG CATGCGGCTC GGTAGCCTGT TCTTCTGGGT CAACAAATCC TATTCGCGCG GCGTCGTGCG GCTCGCCTCG GCCATCCCGC AGGCCGAGCC GGACGTCGAT TTCCGCATGT TGAGCGACGA GCGAGATCTC AACCGGCTCA AGCTCGCACT GCGAATGGGA GCAGAGGCTT TGTTGGACCC GTTCCTGGAT GGCCATCGCG GCACGGTCTT TCCATCGAGC TATTCGCCGC GCGTCGCGAA AGTTGCCGTA CCGGGCGCAT GGAACGCCCT GCAACGCGGC ATCCTGTCCG GCATGCTCGA CGTGGCCGGA CCGCTGCGCG CAGCGCTGAT ACATTCGGCG ATCACTCTGG GCACGACGAT GGACGGACTT TTGGCGGACG ATGTGGCGCT GACGGAATTC GTCCGCCGCC ATGTCGGCGG CACCTGGCAT GCATCCGGCA CCTGCCGGAT GGGTTCGATA GACGACCCGA TGGCGGTCAC GTCGCCGACC GGGCGCGTCC ATGGCGTCGA GGGATTGCGC GTCTGTGATG CCTCTTTGAT GCCATCCATT CCCTGCGCCA ATACGAACAT ACCCACGATC ATGATTGCCG AGCGTATCGC CGATTTCATC TTGGGCGGCC GCTGA
|
Protein sequence | MIVDHLILGG GSAGCVLAAR LSADPNRRVV LVEAGRNIAP DDIPGDIRSR YPGRAYLDTR NIWSQLTALM GYARSNVAPR SSRRYEQARL LGGGSAINAL MANRGAPADY AEWEALGADG WGWDECLPYF RKIESDRDFN GPLHGQDGPL TIRRISDEKI SPFVDRTMKA LDRRGHPIRQ DQNGVWEDGA FRAAIAVSDR GERLPTSLAY LTSEVRRRPN LRIVTESVAI RILFDGRRAT GATLSGAAGE TIHAAEVIVS AGAIHTPALL LRSGIGPAGD LNSTGVTIVA GCEGVGRNLM EHPSIAVAAY LPPHMRVRDP AEHHEQAIWR FSSGLVGTPQ GDMHGSILSR SGWHSVGMRL GSLFFWVNKS YSRGVVRLAS AIPQAEPDVD FRMLSDERDL NRLKLALRMG AEALLDPFLD GHRGTVFPSS YSPRVAKVAV PGAWNALQRG ILSGMLDVAG PLRAALIHSA ITLGTTMDGL LADDVALTEF VRRHVGGTWH ASGTCRMGSI DDPMAVTSPT GRVHGVEGLR VCDASLMPSI PCANTNIPTI MIAERIADFI LGGR
|
| |