Gene Rleg_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0444 
Symbol 
ID8011644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp460089 
End bp461999 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content58% 
IMG OID644823038 
ProductTransketolase central region 
Protein accessionYP_002974292 
Protein GI241203196 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3958] Transketolase, C-terminal subunit
[COG3959] Transketolase, N-terminal subunit 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGATC TCGTAGGACA AGCGGTAGAA GGCACCCTTT ATTACCTGCC ATATAAGGAA 
TTCCAGCGGG TACGTGCGAT CAACGCGCCT CGCGAGCAGA GAGCAGCTCT ATTTTCTGAC
ATGTGTCGCC TGAACGCGCT CTACATGATC GGTCGGGCCG GCTCCGGTCA TATCGGCAGC
AGTTTCAGCA GTCTCGATAT CGTCAGCTGG CTTCTCCTCG AAGGAATGAC GGGCGATGAT
GTCTATTTCA GCTCCAAGGG GCACGATGCG CCAGGCTATT ACGCGGCACT GATTGGCGCC
GGTAAGCTCG ATTTCGAGCT TACCCACAAA CTTCGCCAGA TTGATGGTCT GCCCGGCCAT
CCCGACGTCG GCACCCCGGG CATGGTGACA AATACGGGCT CGCTCGGCAT GGGCGTCTCC
AAGGCCAAGG GCATGGTGAT TGCGAACAGG CTGAACAACC GCCCGGGGCG CATCTTCGTG
ATGACGGGCG ACGGCGAGCT TCAGGAAGGC CAGTTCTGGG AATCCCTGGT CTCGGCAGCC
AATTCTGGGC TGCAGGAGAT TACGGTCATC ATCGATCACA ACAAGCTGCA GTCGGACACT
TTCGTGAAGA ACGTGTCTGA CCTTGGTGAC CTTGAGGCAA AGCTCAGGGC GTTTGGATGG
TGTGTCGCTC GCTGTGATGG AAACGATATC GCTGCTTTTG CCGCCACCCT GGCCGCCATC
CAAGACGATC CGCGTCCGAA GGTTATCGTT GCCGATACCG TCAAGGGCAA GGGCGTCTCC
TTCATGGAGC ACACGTCACT GGATTCCGAT GTGGCCATGT ACCGCTTCCA CAGCGGCGCG
CCTGATGCGA GTAATTATCG CGCCGCCGCG CAGGAAATCA TGGATCGTCT GCAAGCCAAT
CTGAGCAGCG CCGGTATCAC CGAGCTGGAG TTTGAAACGC TCGAGCGGCC TGCATCCGCG
GCCCCGTCGG AAAAGGCTCA GCGCCTGGTT GCCGCCTACA GCAAGGCTCT GATCGCACAG
GCTAAGAAGC ATTCGAACCT CGTTGCACTT GACGCAGACC TGGTGCTGGA TACCGGCCTC
ATCCCATTCC GCGAGCAGTT TCCCGATCGA TTTGTCGAGT GCGGTATCGC AGAACAGGAT
ATGGTTTCTA CAGCCGGCGG CATGGCGCTG AACGGCCTGC TGCCGATCGT CCACTCATTC
GCGTGCTTCC TGTCGACGCG GCCAAATGAG CAGATCTACA ACAACGCCAC GGAAAAGACG
AAGATTATAT ATGTCGGCTC CCTTGCAGGC GTCGTTCCCG GCGGCCCGGG CCACTCGCAT
CAATCCGTTC GAGACATCTC TGCGCTGGCA GCCATGCCTG GAATGACGCT GGTCGAGCCG
TCTTGCGATG CGGAAGTTGG TCTGCTCCTG GATTGGTGCG TGAACGAAGC TCCTGGCAGC
AGCTACATTC GGCTGATTTC ACTACCTTGG GAAATCCCGT ATTCACTTCC GGCCGACTAC
CGGCCAACAA ACGGCCGAGG ACTCACTTTG GCTGAAGGAG AGGATGTCGC CATTATCGCT
TACGGACCGG TGCTCCTGAG CAATGCGATC GCCGCCTCAA AAGTGCTCGC TGAGAAGCAC
GGTATCAGTG CCAAGGTCAT CAACCTACCT TGGCTAAATC ACGTGGATGC GGAGTGGCTG
CAATCGACGG TTTCAGCTTG CAAGGCAATT GTTTGCCTCG ACAACCACTA TGTCATCGGT
GGTCAAGGCG ATACGATCGC GCGTGCGCTT GCCGAAGCCG GAACCGGAAT TCCCGTCAAG
CACATCGGGA TCACAGGGGT TCCTCCTAGC GGAACCAACG TCCAAGTCCT CGGCGCAGTT
GGCCTCGATG CCTCGGCGAT CGCCGAGACG GTCGCGTCGG TAATCGGTTG A
 
Protein sequence
MGDLVGQAVE GTLYYLPYKE FQRVRAINAP REQRAALFSD MCRLNALYMI GRAGSGHIGS 
SFSSLDIVSW LLLEGMTGDD VYFSSKGHDA PGYYAALIGA GKLDFELTHK LRQIDGLPGH
PDVGTPGMVT NTGSLGMGVS KAKGMVIANR LNNRPGRIFV MTGDGELQEG QFWESLVSAA
NSGLQEITVI IDHNKLQSDT FVKNVSDLGD LEAKLRAFGW CVARCDGNDI AAFAATLAAI
QDDPRPKVIV ADTVKGKGVS FMEHTSLDSD VAMYRFHSGA PDASNYRAAA QEIMDRLQAN
LSSAGITELE FETLERPASA APSEKAQRLV AAYSKALIAQ AKKHSNLVAL DADLVLDTGL
IPFREQFPDR FVECGIAEQD MVSTAGGMAL NGLLPIVHSF ACFLSTRPNE QIYNNATEKT
KIIYVGSLAG VVPGGPGHSH QSVRDISALA AMPGMTLVEP SCDAEVGLLL DWCVNEAPGS
SYIRLISLPW EIPYSLPADY RPTNGRGLTL AEGEDVAIIA YGPVLLSNAI AASKVLAEKH
GISAKVINLP WLNHVDAEWL QSTVSACKAI VCLDNHYVIG GQGDTIARAL AEAGTGIPVK
HIGITGVPPS GTNVQVLGAV GLDASAIAET VASVIG