Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4456 |
Symbol | |
ID | 6977550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 88131 |
End bp | 89726 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643393634 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_002278452 |
Protein GI | 209546534 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.691807 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0210476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACTT ACGATTACAT CATCGTCGGA GGCGGATCCT CGGGTTGCGT GTTGGCGAAC CGGCTGTCGG AAAATGCGGC GCACAAGGTG CTGCTGATCG AGTCCGGCCG CCGCGACGCC GATCCGTGGA TCCATATACC CGCGACCTTC TTCAAGGTGC TGGGCAAGGG GATCGATATC CATCCCTATG CATCGAACCC CGACAAAGGG CTGAACGGGC GGCCGAGCAT CGTGCCGCAG GGCAATGTGC TTGGCGGCGG CAGCTCGGTG AATGCGATGA TCTATATTCG CGGCCATCGC AACGACTACG ACACCTGGTC GCAGATGGGC TGCCAGGGCT GGTCCTATGA CGACGTGCTT CCGGCCTTCC GCTCGCTGGA GCACAACGAG CGGCTGAACG GCCAATTCCA CGGTAGGAAG GGCGGCCTGC ACGTCTCCGA CCCCCGCCAT CGGCATCCGC TGAGCGAGGC CTTCGTCCAG GCGGCGACCG AAATCGGCAT TCCCCAGAAC GATGATTTCA ACGGCGCCGA TCAGGCCGGT GTCGGCTTTT ATCAGAGCAC CACCCATGGC GGCCGGCGCT GGAGTTCGGC CCAGGCTTTC CTGCGCGAAG CGGAGAAACG GCCGAACCTG ACGGTGCTCA CCGAGCGCAA GGTCGCCCGC ATCCTATTCG AAGGACAGAA GGCCGTCGGC GTCGAGCTTC TCGACGGCAC GACATTCAAG GCGTCGCGCG AAATCGCCCT CACAGCAGGC GCGATCGCGA CGCCGAAGAT CCTGCAACAC TCCGGTATCG GCGACGGCGC GCATCTTTCC TCGCTCGGCA TCAAGGTCGT CGCCGATCTG CCGGGCGTCG GCGCGAACTA CCAGGACCAT CTGGAAGTGC CGGTGCAGGG CGAGACGCGC GAGCCGATTT CGATCCTCGG CCACGATACG GGGCTGCGGG CCGTCGGCCA TATGCTGCGC TATCTCACCT CCCGCCGCGG ATTGCTGGCA TCTAACGTCG TCGAATGCGG CGGCTTCGTC GATACGGCGG GCACGGGGCA GCCGGACGTT CAGTTCCATG TCCTGCCGGT GCTGATCGGC TTTGTCGATC GCGAACCGGA GCCCGGCCAC GGCCTCAGCA TCGGGCCGTG TTACCTGCGG CCGCGGTCGC GCGGCTGGAT CAGGCTGAAG AGCGCCGATC CCAGCGAGCA GACGGATTTC AACGCCAATC TGCTCTCCGA TCCCGCCGAT ATCGAAACCC TGGTGCGCGG CGTCGAGACG GCGATCCGCA TCCTCGATGC CCCGGCGCTT GCCAAGCTGG TCAAGCGCCG CGTGCTGCCG AAGCCCGGCG TCGAAAAGGA TCCGGAGGCT CTGCGCGATT ACATCCGTCA ATCAGCCAAA ACAGTGTTCC ATCCGGCGGG AACGGCGCGC ATGGGCCGCG CTGACGATCG CATGGCGGTC GTCGGACCGG ACCTCAAGGT GCGCGGCGTC GAGGGGCTGC GCGTCTGCGA CGCTTCGGTA ATGCCGACGC TGGTGTCGGG CAATACCAAC GCGCCGACGA TGATGATTGC GGCAAAGGCC GGTGCGTTCA TGACAGCGAA GGCGATTTCC GGCTGA
|
Protein sequence | MTTYDYIIVG GGSSGCVLAN RLSENAAHKV LLIESGRRDA DPWIHIPATF FKVLGKGIDI HPYASNPDKG LNGRPSIVPQ GNVLGGGSSV NAMIYIRGHR NDYDTWSQMG CQGWSYDDVL PAFRSLEHNE RLNGQFHGRK GGLHVSDPRH RHPLSEAFVQ AATEIGIPQN DDFNGADQAG VGFYQSTTHG GRRWSSAQAF LREAEKRPNL TVLTERKVAR ILFEGQKAVG VELLDGTTFK ASREIALTAG AIATPKILQH SGIGDGAHLS SLGIKVVADL PGVGANYQDH LEVPVQGETR EPISILGHDT GLRAVGHMLR YLTSRRGLLA SNVVECGGFV DTAGTGQPDV QFHVLPVLIG FVDREPEPGH GLSIGPCYLR PRSRGWIRLK SADPSEQTDF NANLLSDPAD IETLVRGVET AIRILDAPAL AKLVKRRVLP KPGVEKDPEA LRDYIRQSAK TVFHPAGTAR MGRADDRMAV VGPDLKVRGV EGLRVCDASV MPTLVSGNTN APTMMIAAKA GAFMTAKAIS G
|
| |