Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0042 |
Symbol | |
ID | 8011289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 37209 |
End bp | 38207 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644822632 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_002973892 |
Protein GI | 241202796 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.527428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.235169 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTATCA CGTCAGATAA TTCTAATGGC GCCACCGTAC TGGTGACCGG CATTGGCGGA TTTCTCGCAG GCCACATTGC CTTGCAGTTG CTCAAGCAGG GGTATCGGGT CAGAGGAAGC CTGCGCAGCA TCGGTACAAG CGCTGCGACG GTCGGTCAGC TTGGAGCGCA CACCGACGGG CAACTGCAAA ATCTCGGTTT GGTGCAGGCC GATCTTGACA GCGATAGCGG TTGGGCTGCG GCTGTCGAAG GATGCGACTA TGTCATTCAC ACCGCATCGC CGTTCCCTCC GGGATATCCC GAAAATGAGA ATGCACTGAT CCAGACAGCC CGCGATGGTG CGTTGCGCGT GCTTCGCGAG GCGCATCGGG CACGGGTCAA ACGTGTTGTT CTGACATCCT CCATAGCTGC CACCAACCAT GGCGACGGGC GGGCGCCCTT TACCGAAGAG AATTGGACCG ACCCGGAAAG CCCGCGGGCG ACGCCCTATT ACAAATCTAA GACGCTCGAT CTGGCCGTGA TCAATCCAAG CGTCATCCTC GGGCCGTTGC TCGGGCCGAA TTTCGGGACG TCTGTTGGAT TGATCCACCA TTTGATGACG GGACGATTCA ACGGTATCCC GCGCTTTGGC TTCTCCGTCG TGGATGTGCG TGATACCGCC GATGCCCACA TTCGAGCGAT GACCGATCCT GCTGCCGGCG GCCAACGGTT CATCATCGGT GGACGGTTTT TCTGGCTCAA GGACCTTGTG GCCATTCTTG CCCATTCCTT TCCCGACCAT GCCTCCCGCC TGCCGTCCGG CGAAGTCTCT GACGAGATCG TCAGGGTCAT GGCGCAATCC GACCCCGATG CACGAACCAT TGTTCATGAG CTCAATCGCG ACCTCAGTGT CAGTGCGGCA AAAGCCCACC GCGTCCTCGG GTGGCGCTCA CGTCCAGAAG AGCAATGCAT CCGCGCCAGC GCCCAAAGCC TCATCGACTT GGGATTGGTG CCGGCCTAG
|
Protein sequence | MSITSDNSNG ATVLVTGIGG FLAGHIALQL LKQGYRVRGS LRSIGTSAAT VGQLGAHTDG QLQNLGLVQA DLDSDSGWAA AVEGCDYVIH TASPFPPGYP ENENALIQTA RDGALRVLRE AHRARVKRVV LTSSIAATNH GDGRAPFTEE NWTDPESPRA TPYYKSKTLD LAVINPSVIL GPLLGPNFGT SVGLIHHLMT GRFNGIPRFG FSVVDVRDTA DAHIRAMTDP AAGGQRFIIG GRFFWLKDLV AILAHSFPDH ASRLPSGEVS DEIVRVMAQS DPDARTIVHE LNRDLSVSAA KAHRVLGWRS RPEEQCIRAS AQSLIDLGLV PA
|
| |