Gene Rleg_0209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0209 
Symbol 
ID8015412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp220085 
End bp221131 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content60% 
IMG OID644822802 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002974059 
Protein GI241202963 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.120111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.224068 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTATC TTGGTAAGAT TCGTGCGCGC TCTGTACAAG TTTCTGCTCG TGATGCAAAT 
TGTCCTTCCG GAATACATCG CGCTCCCAAG AGAGTTCTTG TCACCGGCGG TGCAGGTTTC
CTCGGATCGC ATCTCTGCGA GACGCTTTTG GCCGCCGGAC ACCAGGTGAT CTGCCTCGAC
AACTTTTCCA CCGGCATGCG GCGCAATATC GTCCATCTGA AGCGAGTCGA TCGCTTCAAT
GTCGTCGCCC ACGATATCGT CCACCCGCTC GATCTGGAAG TCGACGAGAT CTATAACCTC
GCCTGCCCGG CATCGCCCCC GCATTATCAG GCCGATCCGA TCCATACGAC AAAGACCTGC
GTGCTGGGCT CCCTCAACCT TCTGGAGCTG GCCGCGCGCA CCGGCGCACG TATCCTTCAG
GCATCCACCT CCGAAGTCTA CGGCGACCCG AACGTCCACC CGCAGGTCGA AAGCTACTGG
GGCAACGTCA ATTCGTTCGG GCCGCGCTCC TGCTATGACG AGGGCAAGCG CTGCGCCGAG
ACGCTGTTCT TCGACTTCCA CAACACGCAC GGCGTCGAGA TCAAGATCAT CCGCATCTTC
AACACCTACG GCCCGCGGAT GCGTCCGGAC GACGGCCGCG TCGTCTCGAA TTTCATCGTC
CAGGCCCTGA CGGGGCAAGA CATCACGATA TATGGCGACG GTTCGCAGAC CCGCTCGTTC
TGTTTCGTCG ATGATCTCAT CGGCGGCATG GTCCGCATGA TGGCCTCACC GTCGTCGCTG
ACGGGGCCTG TCAATCTCGG CAATCCGGGC GAATTCACGA TCCGGGAACT GGCCGAGCAG
GTGATCGGAT TGACCGGTTC CCGGTCGCAA ATCATCCATC GCGCTCTGCC GGTTGACGAT
CCCCGTCAGC GTCGCCCCGA TATTTCGCTT GCCATGCAGG AACTCGACTG GCGGCCGAAG
ATCGACTTGT CGAGCGGCCT GCGTCAGACG ATCGACTATT TCGATGGCGT TCTCACCCGT
CCGGCACGCG AGCTGGAGGC GGTCTGA
 
Protein sequence
MNYLGKIRAR SVQVSARDAN CPSGIHRAPK RVLVTGGAGF LGSHLCETLL AAGHQVICLD 
NFSTGMRRNI VHLKRVDRFN VVAHDIVHPL DLEVDEIYNL ACPASPPHYQ ADPIHTTKTC
VLGSLNLLEL AARTGARILQ ASTSEVYGDP NVHPQVESYW GNVNSFGPRS CYDEGKRCAE
TLFFDFHNTH GVEIKIIRIF NTYGPRMRPD DGRVVSNFIV QALTGQDITI YGDGSQTRSF
CFVDDLIGGM VRMMASPSSL TGPVNLGNPG EFTIRELAEQ VIGLTGSRSQ IIHRALPVDD
PRQRRPDISL AMQELDWRPK IDLSSGLRQT IDYFDGVLTR PARELEAV