Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0444 |
Symbol | |
ID | 8011644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 460089 |
End bp | 461999 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644823038 |
Product | Transketolase central region |
Protein accession | YP_002974292 |
Protein GI | 241203196 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3958] Transketolase, C-terminal subunit [COG3959] Transketolase, N-terminal subunit |
TIGRFAM ID | [TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCGATC TCGTAGGACA AGCGGTAGAA GGCACCCTTT ATTACCTGCC ATATAAGGAA TTCCAGCGGG TACGTGCGAT CAACGCGCCT CGCGAGCAGA GAGCAGCTCT ATTTTCTGAC ATGTGTCGCC TGAACGCGCT CTACATGATC GGTCGGGCCG GCTCCGGTCA TATCGGCAGC AGTTTCAGCA GTCTCGATAT CGTCAGCTGG CTTCTCCTCG AAGGAATGAC GGGCGATGAT GTCTATTTCA GCTCCAAGGG GCACGATGCG CCAGGCTATT ACGCGGCACT GATTGGCGCC GGTAAGCTCG ATTTCGAGCT TACCCACAAA CTTCGCCAGA TTGATGGTCT GCCCGGCCAT CCCGACGTCG GCACCCCGGG CATGGTGACA AATACGGGCT CGCTCGGCAT GGGCGTCTCC AAGGCCAAGG GCATGGTGAT TGCGAACAGG CTGAACAACC GCCCGGGGCG CATCTTCGTG ATGACGGGCG ACGGCGAGCT TCAGGAAGGC CAGTTCTGGG AATCCCTGGT CTCGGCAGCC AATTCTGGGC TGCAGGAGAT TACGGTCATC ATCGATCACA ACAAGCTGCA GTCGGACACT TTCGTGAAGA ACGTGTCTGA CCTTGGTGAC CTTGAGGCAA AGCTCAGGGC GTTTGGATGG TGTGTCGCTC GCTGTGATGG AAACGATATC GCTGCTTTTG CCGCCACCCT GGCCGCCATC CAAGACGATC CGCGTCCGAA GGTTATCGTT GCCGATACCG TCAAGGGCAA GGGCGTCTCC TTCATGGAGC ACACGTCACT GGATTCCGAT GTGGCCATGT ACCGCTTCCA CAGCGGCGCG CCTGATGCGA GTAATTATCG CGCCGCCGCG CAGGAAATCA TGGATCGTCT GCAAGCCAAT CTGAGCAGCG CCGGTATCAC CGAGCTGGAG TTTGAAACGC TCGAGCGGCC TGCATCCGCG GCCCCGTCGG AAAAGGCTCA GCGCCTGGTT GCCGCCTACA GCAAGGCTCT GATCGCACAG GCTAAGAAGC ATTCGAACCT CGTTGCACTT GACGCAGACC TGGTGCTGGA TACCGGCCTC ATCCCATTCC GCGAGCAGTT TCCCGATCGA TTTGTCGAGT GCGGTATCGC AGAACAGGAT ATGGTTTCTA CAGCCGGCGG CATGGCGCTG AACGGCCTGC TGCCGATCGT CCACTCATTC GCGTGCTTCC TGTCGACGCG GCCAAATGAG CAGATCTACA ACAACGCCAC GGAAAAGACG AAGATTATAT ATGTCGGCTC CCTTGCAGGC GTCGTTCCCG GCGGCCCGGG CCACTCGCAT CAATCCGTTC GAGACATCTC TGCGCTGGCA GCCATGCCTG GAATGACGCT GGTCGAGCCG TCTTGCGATG CGGAAGTTGG TCTGCTCCTG GATTGGTGCG TGAACGAAGC TCCTGGCAGC AGCTACATTC GGCTGATTTC ACTACCTTGG GAAATCCCGT ATTCACTTCC GGCCGACTAC CGGCCAACAA ACGGCCGAGG ACTCACTTTG GCTGAAGGAG AGGATGTCGC CATTATCGCT TACGGACCGG TGCTCCTGAG CAATGCGATC GCCGCCTCAA AAGTGCTCGC TGAGAAGCAC GGTATCAGTG CCAAGGTCAT CAACCTACCT TGGCTAAATC ACGTGGATGC GGAGTGGCTG CAATCGACGG TTTCAGCTTG CAAGGCAATT GTTTGCCTCG ACAACCACTA TGTCATCGGT GGTCAAGGCG ATACGATCGC GCGTGCGCTT GCCGAAGCCG GAACCGGAAT TCCCGTCAAG CACATCGGGA TCACAGGGGT TCCTCCTAGC GGAACCAACG TCCAAGTCCT CGGCGCAGTT GGCCTCGATG CCTCGGCGAT CGCCGAGACG GTCGCGTCGG TAATCGGTTG A
|
Protein sequence | MGDLVGQAVE GTLYYLPYKE FQRVRAINAP REQRAALFSD MCRLNALYMI GRAGSGHIGS SFSSLDIVSW LLLEGMTGDD VYFSSKGHDA PGYYAALIGA GKLDFELTHK LRQIDGLPGH PDVGTPGMVT NTGSLGMGVS KAKGMVIANR LNNRPGRIFV MTGDGELQEG QFWESLVSAA NSGLQEITVI IDHNKLQSDT FVKNVSDLGD LEAKLRAFGW CVARCDGNDI AAFAATLAAI QDDPRPKVIV ADTVKGKGVS FMEHTSLDSD VAMYRFHSGA PDASNYRAAA QEIMDRLQAN LSSAGITELE FETLERPASA APSEKAQRLV AAYSKALIAQ AKKHSNLVAL DADLVLDTGL IPFREQFPDR FVECGIAEQD MVSTAGGMAL NGLLPIVHSF ACFLSTRPNE QIYNNATEKT KIIYVGSLAG VVPGGPGHSH QSVRDISALA AMPGMTLVEP SCDAEVGLLL DWCVNEAPGS SYIRLISLPW EIPYSLPADY RPTNGRGLTL AEGEDVAIIA YGPVLLSNAI AASKVLAEKH GISAKVINLP WLNHVDAEWL QSTVSACKAI VCLDNHYVIG GQGDTIARAL AEAGTGIPVK HIGITGVPPS GTNVQVLGAV GLDASAIAET VASVIG
|
| |