Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0889 |
Symbol | |
ID | 8015494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 879322 |
End bp | 880971 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644823474 |
Product | choline dehydrogenase |
Protein accession | YP_002974725 |
Protein GI | 241203629 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCAG ATTTCGTCAT CATCGGCTCG GGCTCCGCCG GCTCCGCCCT CGCCTACCGC TTGTCGGAAG ACGGCAAGAA CAGTGTCATC GTCATCGAGG CCGGTGGCAG CGATTTCGGG CCGTTCATCC AGATGCCGGC AGCCCTTGCC TGGCCGATGA GCATGAAGAG ATATAATTGG GGTTATCTGT CCGAGCCCGA GGCGAACCTC AACAACCGGC GCATCACCGC GCCGCGCGGC AAGGTGATCG GCGGCTCCTC GTCGATCAAC GGCATGGTCT ATGTGCGCGG CCATGCCGAG GACTTCAACC GCTGGGAGGA GCTCGGCGCC AGCGGCTGGG CCTATGCCGA TGTACTTCCC TATTTCAAGC GGATGGAACA TTCGCATGGC GGCGAAGAGG GCTGGCGCGG CACCGATGGG CCGCTGCATG TCCAGCGCGG CGGCTTCACC AATCCGCTCT TCCGCGCCTT CGTCGAGGCC GGCAAACAGG CGGGCTTCGA GACGACGGAG GATTACAACG GCAGCAAGCA GGAAGGCTTC GGTCTGATGG AGCAGACCAT CTTCGGCGGC CGCCGCTGGT CTGCCGCCAA CGCCTATCTG AGACCGGCGC TGAAGCGTGA CAATGTCAGG ATCGTCTATG GCTTTGCGCA GAAGATCGTG ATCGAGGACG GGCGGGCGAC CGGCGTCGAG ATTGAACGCA ACGGCAGGAT CGAGGTGCTG AAGGCGAACC GCGAGGTGAT CGTCTCGGCC TCCTCTTTCA ATTCGCCGAA GCTCTTGATG CTGTCGGGCA TCGGTCCCGG CCAACATCTG CAGGACATGG GCATTACGGT GAAGGCCGAC CGGCCGGGCG TCGGCGCCAA CTTGCAGGAC CATATGGAAT TCTACTTCCA GCAGGTGAGC ACCAAGCCGG TGTCGCTCTA TTCCTGGCTG CCGTGGTTCT GGCAGGGGGT GGCGGGCGCC CAATGGCTGC TCTCGCGCGG CGGGCTCGGC GCCTCCAACC AGTTCGAGGC CTGCGCCTTC CTGCGCTCGG CGCCGGGGCT GAAGCAGCCC GACATCCAGT ATCATTTCCT GCCGGTGGCG ATCAGCTATG ACGGCAAGGC GGCCGCGAAA AGCCACGGCT TCCAGGTTCA TGTCGGCTAT AACCTGTCGA AATCGCGCGG CAGCGTGAGC TTGCGCTCCG CCGATCCCAA GGCCGACCCG GTGCTGCGCT TCAACTATAT GAGCCATGCC GAGGATTGGG AGAAATTCCG CCACTGCGTG CGCCTCACCC GCGAAATCTT CGGGCAGACG GCCTTCAACG ACTATCGCGG CCCGGAGATC CAGCCGGGCG AAAGCGTGCA AAGCGACGAA GAGATCGACG CCTTCCTGCG CGAACATCTG GAAAGCGCCT ATCACCCCTG CGGCACCTGC CGGATGGGCG CCAAGGACGA TCCGATGGCG GTGGTCGATC CGCAAACGCG GGTGATCGGC ATCGATGGCC TGCGCGTCGC CGACAGCTCG ATCTTCCCGC ACGTCACTTA TGGCAATTTG AACGGCCCCT CGATCATGAC CGGAGAGAAG GCCGCCGACC ATATCCTCGG CAAACAACCG CTGGCGCGTT CGAACCAGGA ACCCTGGGTC AACCCGCGCG CGGCCGTCAG CGATCGATAA
|
Protein sequence | MQADFVIIGS GSAGSALAYR LSEDGKNSVI VIEAGGSDFG PFIQMPAALA WPMSMKRYNW GYLSEPEANL NNRRITAPRG KVIGGSSSIN GMVYVRGHAE DFNRWEELGA SGWAYADVLP YFKRMEHSHG GEEGWRGTDG PLHVQRGGFT NPLFRAFVEA GKQAGFETTE DYNGSKQEGF GLMEQTIFGG RRWSAANAYL RPALKRDNVR IVYGFAQKIV IEDGRATGVE IERNGRIEVL KANREVIVSA SSFNSPKLLM LSGIGPGQHL QDMGITVKAD RPGVGANLQD HMEFYFQQVS TKPVSLYSWL PWFWQGVAGA QWLLSRGGLG ASNQFEACAF LRSAPGLKQP DIQYHFLPVA ISYDGKAAAK SHGFQVHVGY NLSKSRGSVS LRSADPKADP VLRFNYMSHA EDWEKFRHCV RLTREIFGQT AFNDYRGPEI QPGESVQSDE EIDAFLREHL ESAYHPCGTC RMGAKDDPMA VVDPQTRVIG IDGLRVADSS IFPHVTYGNL NGPSIMTGEK AADHILGKQP LARSNQEPWV NPRAAVSDR
|
| |