Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6238 |
Symbol | |
ID | 8016029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | + |
Start bp | 299016 |
End bp | 300086 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644827543 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_002978743 |
Protein GI | 241258859 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.435131 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.526104 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGG TGATCTACAG CCTTACCAGA AAGAGGGTCT ATGTCGCGGG CCACCGCGGC ATGGTGGGCT CTGCGATCGT GCGGCGTCTC GCTTCCGAGG GCTGCGAAAT TTTGACGTCC ACCCGCGCCG AGGTCGACCT CAGACGGCAG GACCAGGTGG AGGCCTGGAT GAGTAAGCAT CGTCCCGATG CTGTCTTCCT AGCTGCTGCG AGGGTCGGCG GTATTCTCGC GAACGCTACC TATCCGGCCG ACTTCCTTTA CGACAACTTG ATTCTCCAAG CGAATGTCAT CCACGCAGCC CATAGAACTG ACGTCGAAAA ACTGATGTTT CTGGGCTCGT CCTGCATCTA TCCGAAATTC GCCGACCAGC CGATCGTTGA GGACTCACTT CTGACCGGAT CGCTTGAACC CACCAATGAA TGGTATGCGA TCGCCAAAAT TGCCGGATTA AAGCTCTGCC AAGCCTATCG CAAACAGCAC GGTAGAGATT TCATCTCGGC CATGCCGACC AATCTTTACG GTCCAGGCGA CAATTTTGAC CTCGGGTCAA GCCATGTCAT GCCGGCGCTC ATACGCAAGA CACATGAGGC CAAGGTCAGC GAGCAGCAAG AGATATGCGT CTGGGGTACG GGCACGCCGC GGCGCGAATT CCTGCATGTT GACGATTGCG CCGACGCCTG CCTCCATCTC ATGAAAACCT ATTCCGCCGA AAGTCATGTG AACGTAGGTT GTGGCGAAGA CATTACCATT CTCGAATTGG CATACCTCGT CTCCAAGATC GTTGGTTTCG AAGGCAAGAT CACCCGCGAC CTCACCAAGC CAGATGGCAC GCCACGTAAA CTCCTGAGCG TCGACAAGCT CCGCAGTCTC GGCTGGTCTC CTAAGATAGG TCTGAAAGAG GGCATCGCAG ATGCCTACCG CTCCTTCCTT GATGGCCATC ATCTCGAACG CAGCGACAGA GCTGTGTCCA GCGACTTGAT CGGTCAAAGC GACATCAGTT TCGAGAAAGC GAAGAGTTCG GCGCCGCACG CGCCCACGCT CTCGACCGTT GCGCATCATC CCTCGCCATA G
|
Protein sequence | MPEVIYSLTR KRVYVAGHRG MVGSAIVRRL ASEGCEILTS TRAEVDLRRQ DQVEAWMSKH RPDAVFLAAA RVGGILANAT YPADFLYDNL ILQANVIHAA HRTDVEKLMF LGSSCIYPKF ADQPIVEDSL LTGSLEPTNE WYAIAKIAGL KLCQAYRKQH GRDFISAMPT NLYGPGDNFD LGSSHVMPAL IRKTHEAKVS EQQEICVWGT GTPRREFLHV DDCADACLHL MKTYSAESHV NVGCGEDITI LELAYLVSKI VGFEGKITRD LTKPDGTPRK LLSVDKLRSL GWSPKIGLKE GIADAYRSFL DGHHLERSDR AVSSDLIGQS DISFEKAKSS APHAPTLSTV AHHPSP
|
| |