Gene Rleg_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0042 
Symbol 
ID8011289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp37209 
End bp38207 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content59% 
IMG OID644822632 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002973892 
Protein GI241202796 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.527428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.235169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCA CGTCAGATAA TTCTAATGGC GCCACCGTAC TGGTGACCGG CATTGGCGGA 
TTTCTCGCAG GCCACATTGC CTTGCAGTTG CTCAAGCAGG GGTATCGGGT CAGAGGAAGC
CTGCGCAGCA TCGGTACAAG CGCTGCGACG GTCGGTCAGC TTGGAGCGCA CACCGACGGG
CAACTGCAAA ATCTCGGTTT GGTGCAGGCC GATCTTGACA GCGATAGCGG TTGGGCTGCG
GCTGTCGAAG GATGCGACTA TGTCATTCAC ACCGCATCGC CGTTCCCTCC GGGATATCCC
GAAAATGAGA ATGCACTGAT CCAGACAGCC CGCGATGGTG CGTTGCGCGT GCTTCGCGAG
GCGCATCGGG CACGGGTCAA ACGTGTTGTT CTGACATCCT CCATAGCTGC CACCAACCAT
GGCGACGGGC GGGCGCCCTT TACCGAAGAG AATTGGACCG ACCCGGAAAG CCCGCGGGCG
ACGCCCTATT ACAAATCTAA GACGCTCGAT CTGGCCGTGA TCAATCCAAG CGTCATCCTC
GGGCCGTTGC TCGGGCCGAA TTTCGGGACG TCTGTTGGAT TGATCCACCA TTTGATGACG
GGACGATTCA ACGGTATCCC GCGCTTTGGC TTCTCCGTCG TGGATGTGCG TGATACCGCC
GATGCCCACA TTCGAGCGAT GACCGATCCT GCTGCCGGCG GCCAACGGTT CATCATCGGT
GGACGGTTTT TCTGGCTCAA GGACCTTGTG GCCATTCTTG CCCATTCCTT TCCCGACCAT
GCCTCCCGCC TGCCGTCCGG CGAAGTCTCT GACGAGATCG TCAGGGTCAT GGCGCAATCC
GACCCCGATG CACGAACCAT TGTTCATGAG CTCAATCGCG ACCTCAGTGT CAGTGCGGCA
AAAGCCCACC GCGTCCTCGG GTGGCGCTCA CGTCCAGAAG AGCAATGCAT CCGCGCCAGC
GCCCAAAGCC TCATCGACTT GGGATTGGTG CCGGCCTAG
 
Protein sequence
MSITSDNSNG ATVLVTGIGG FLAGHIALQL LKQGYRVRGS LRSIGTSAAT VGQLGAHTDG 
QLQNLGLVQA DLDSDSGWAA AVEGCDYVIH TASPFPPGYP ENENALIQTA RDGALRVLRE
AHRARVKRVV LTSSIAATNH GDGRAPFTEE NWTDPESPRA TPYYKSKTLD LAVINPSVIL
GPLLGPNFGT SVGLIHHLMT GRFNGIPRFG FSVVDVRDTA DAHIRAMTDP AAGGQRFIIG
GRFFWLKDLV AILAHSFPDH ASRLPSGEVS DEIVRVMAQS DPDARTIVHE LNRDLSVSAA
KAHRVLGWRS RPEEQCIRAS AQSLIDLGLV PA