Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1383 |
Symbol | |
ID | 6980111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1404724 |
End bp | 1406379 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643396104 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_002280903 |
Protein GI | 209548986 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00471975 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTTCG ATTATATCAT CACAGGCGCC GGTCCCGCGG GCTGCGTTCT TGCAAGCCGG CTGAGCGAAG ATCCCGATAT CCGCGTCCTC CTGCTCGAAG CAGGCGGCGG CGACTGGAAT CCCCTCTTTC ACATGCCGGC GGGCTTCGCC AAGATGACCA AGGGCGTGGC CAGCTGGGGA TGGCAAACCG TTCCCCAGAA GCACATGAAA GACCGGGTGC TTCGCTATAC CCAGGCGAAG GTCATCGGCG GCGGCTCTTC GATCAACGCC CAGCTCTATA CGCGTGGAAA TGCGGCCGAT TACGACCTCT GGGCCAGTGA AGACGGCTGC GAGGGCTGGG ACTATCGCTC GATCCTGCCC TATTTCAAAC GCGCCGAGGA CAATCAGCGC TTCGCCGACG ACTACCACGC CTATGGCGGT CCGCTGGGGG TCTCGATGCC GGCAGCACCC CTGCCGATCT GCGACGCCTA TATCCGTGCA GGTCAGGAGC TCGGCATTCC CTATAATCAC GACTTCAACG GCCGCCAGCA GGCGGGTGTT GGATTTTATC AGCTGACGCA GCGCAATCGC CGCCGGTCCT CGGCATCGCT TGCCTATCTC TCGCCGATCA AGGAGCGGAA GAACCTGACG GTCAGGACAG GCGCGCGTGT TACGCGGATC ATTGTCGAAG GAGGCCGTGC GACAGGCGTC GAGATCGCCA CCGCGGGCGG CTCGGAGATC GTGCGCGCCG AGCGTGAGGT CCTGGTGTCG TCAGGCGCGA TCGGATCGCC GAAGCTTCTT CTGCAGTCCG GCATCGGGCC GGCCGATCAT CTGAAATCGG TGGGCGTCAA GGTGAACCAC GATCTGCCCG GTGTCGGCGG CAACCTGCAG GATCACCTGG ACCTTTTCGT CATCGCCGAA TGCACCGGCG ATCACACCTA TGACGGCGTT GCAAAGCTTC ACCGCACACT CTGGGCCGGG GTTCAATATG TCCTGTTCCG CACCGGTCCT GTGGCGTCGT CGCTCTTCGA GACCGGCGGC TTCTGGTATG CCGATCCCGA GGCTCGGTCT CCCGATATCC AGTTTCATCT CGGCTTGGGA TCGGGCATCG AAGCCGGCGT CGAGCGGCTC AAAAATGCGG GCGTGACGCT GAATTCCGCC TATCTGCATC CGCGCTCCCT CGGGACGGTG CGTCTGTCAT CCGCCGACCC GGCTGCCGCG CCCCTGATCG ATCCGAACTA CTGGTCCGAT CCGCACGACA GGCAGATGTC GCTGGAGGGG CTCAAGATTG CCCGCGAGAT CATGCAGCAG GCGGCGCTGA AGCCCTTTGT CATGGCCGAA AGGCTTCCCG GGCCGAAGGT CATGACCGAC GAGCAGCTGT TCGACTATGG CTGCGCCAAT GCAAAGACCG ACCATCATCC TGTGGGCACC TGCAAGATGG GAACGGGGCC CGACGCCGTT GTCGGGCTGG ATCTCAAGGT TCATGGCCTC GAAGGCCTCC GCGTCTGCGA TTCCTCGGTC ATGCCGCGCG TGCCGTCCTG CAATACCAAC GCGCCGACGA TCATGGTCGG CGAAAAGGGA TCGGACCTGA TCAGGGGCTT GCCTGCGCTG CCGTCGGCAA TCTTTGCCTA TGAACGCAAC GATGTCAGGC CTCGGGCCCG CGCCGAAATC CGCTAA
|
Protein sequence | MGFDYIITGA GPAGCVLASR LSEDPDIRVL LLEAGGGDWN PLFHMPAGFA KMTKGVASWG WQTVPQKHMK DRVLRYTQAK VIGGGSSINA QLYTRGNAAD YDLWASEDGC EGWDYRSILP YFKRAEDNQR FADDYHAYGG PLGVSMPAAP LPICDAYIRA GQELGIPYNH DFNGRQQAGV GFYQLTQRNR RRSSASLAYL SPIKERKNLT VRTGARVTRI IVEGGRATGV EIATAGGSEI VRAEREVLVS SGAIGSPKLL LQSGIGPADH LKSVGVKVNH DLPGVGGNLQ DHLDLFVIAE CTGDHTYDGV AKLHRTLWAG VQYVLFRTGP VASSLFETGG FWYADPEARS PDIQFHLGLG SGIEAGVERL KNAGVTLNSA YLHPRSLGTV RLSSADPAAA PLIDPNYWSD PHDRQMSLEG LKIAREIMQQ AALKPFVMAE RLPGPKVMTD EQLFDYGCAN AKTDHHPVGT CKMGTGPDAV VGLDLKVHGL EGLRVCDSSV MPRVPSCNTN APTIMVGEKG SDLIRGLPAL PSAIFAYERN DVRPRARAEI R
|
| |