Gene Rleg_6238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6238 
Symbol 
ID8016029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp299016 
End bp300086 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content57% 
IMG OID644827543 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002978743 
Protein GI241258859 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.435131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.526104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGG TGATCTACAG CCTTACCAGA AAGAGGGTCT ATGTCGCGGG CCACCGCGGC 
ATGGTGGGCT CTGCGATCGT GCGGCGTCTC GCTTCCGAGG GCTGCGAAAT TTTGACGTCC
ACCCGCGCCG AGGTCGACCT CAGACGGCAG GACCAGGTGG AGGCCTGGAT GAGTAAGCAT
CGTCCCGATG CTGTCTTCCT AGCTGCTGCG AGGGTCGGCG GTATTCTCGC GAACGCTACC
TATCCGGCCG ACTTCCTTTA CGACAACTTG ATTCTCCAAG CGAATGTCAT CCACGCAGCC
CATAGAACTG ACGTCGAAAA ACTGATGTTT CTGGGCTCGT CCTGCATCTA TCCGAAATTC
GCCGACCAGC CGATCGTTGA GGACTCACTT CTGACCGGAT CGCTTGAACC CACCAATGAA
TGGTATGCGA TCGCCAAAAT TGCCGGATTA AAGCTCTGCC AAGCCTATCG CAAACAGCAC
GGTAGAGATT TCATCTCGGC CATGCCGACC AATCTTTACG GTCCAGGCGA CAATTTTGAC
CTCGGGTCAA GCCATGTCAT GCCGGCGCTC ATACGCAAGA CACATGAGGC CAAGGTCAGC
GAGCAGCAAG AGATATGCGT CTGGGGTACG GGCACGCCGC GGCGCGAATT CCTGCATGTT
GACGATTGCG CCGACGCCTG CCTCCATCTC ATGAAAACCT ATTCCGCCGA AAGTCATGTG
AACGTAGGTT GTGGCGAAGA CATTACCATT CTCGAATTGG CATACCTCGT CTCCAAGATC
GTTGGTTTCG AAGGCAAGAT CACCCGCGAC CTCACCAAGC CAGATGGCAC GCCACGTAAA
CTCCTGAGCG TCGACAAGCT CCGCAGTCTC GGCTGGTCTC CTAAGATAGG TCTGAAAGAG
GGCATCGCAG ATGCCTACCG CTCCTTCCTT GATGGCCATC ATCTCGAACG CAGCGACAGA
GCTGTGTCCA GCGACTTGAT CGGTCAAAGC GACATCAGTT TCGAGAAAGC GAAGAGTTCG
GCGCCGCACG CGCCCACGCT CTCGACCGTT GCGCATCATC CCTCGCCATA G
 
Protein sequence
MPEVIYSLTR KRVYVAGHRG MVGSAIVRRL ASEGCEILTS TRAEVDLRRQ DQVEAWMSKH 
RPDAVFLAAA RVGGILANAT YPADFLYDNL ILQANVIHAA HRTDVEKLMF LGSSCIYPKF
ADQPIVEDSL LTGSLEPTNE WYAIAKIAGL KLCQAYRKQH GRDFISAMPT NLYGPGDNFD
LGSSHVMPAL IRKTHEAKVS EQQEICVWGT GTPRREFLHV DDCADACLHL MKTYSAESHV
NVGCGEDITI LELAYLVSKI VGFEGKITRD LTKPDGTPRK LLSVDKLRSL GWSPKIGLKE
GIADAYRSFL DGHHLERSDR AVSSDLIGQS DISFEKAKSS APHAPTLSTV AHHPSP