Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3093 |
Symbol | |
ID | 6981838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3156874 |
End bp | 3158658 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643397803 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002282586 |
Protein GI | 209550669 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0227397 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGACA GCCATCTTCC GAAGCGGCGC CTGCGGTCGC AGGATTGGTT CGACAATCCC GACCATATCG ACATGGCAGC GCTCTATCTC GAGCGTTTCA TGAATTACGG CATCACGCCG GAAGAGCTGC GCTCCGGCAA GCCCATCATC GGGATTGCCC AGAGCGGCAG CGATCTTACG CCTTGCAACA GGGTGCATGT CGAGCTTGCC AAGCGCGTGC GCGACGGCAT TCGCGATGCC GGCGGCATTC CGATCGAATT TCCGACACAT CCGATCTTCG AGAACTGCAA GCGCCCAACG GCGGCACTCG ACCGCAATCT TGCCTATCTC GGCCTCGTTG AAATCCTCTA TGGCTATCCG CTCGACGGCG TCGTGCTGAC GACCGGCTGC GACAAGACCA CGCCGTCGGC GATCATGGCG GCGTCGACGG TCGATATTCC GGCGATCGTG CTCTCCGGCG GCCCGATGCT CGACGGCTGG CATGAGGGAG AGCTGGCGGG TTCCGGCACG GTGATCTGGC GGATGCGGCG GAAATATGCG GCAGGCGAGA TCGACCGCGA GGAATTCCTG CAGGCAGCGC TCGACTCAGC GCCCTCCGTC GGCCACTGCA ATACGATGGG CACGGCCTCG ACGATGAATG CGCTCGCCGA GGCGCTCGGT CTGTCGCTGA CCGGATGCGG CGCCATTCCA GCTCCCTACC GCGAACGCGG GCAGATGGCC TATCGCACCG GGCGGCGTGC CGTCGAGATC GTCTTCGAGG ACCTGAAGCC GTCGGATATC CTGACGCGCG CGGCCTTCCT CAATGCCATC CGCACCAATT CGGCGATCGG CGGTTCGACC AACGCGCAGC CGCATCTGGC GGCGATGGCC AAACACGCCG GCGTCGAGAT TCATCCCGAC GACTGGCAGG TGCACGGCTT CGATATCCCG CTGCTCGCCA ACGTCCAGCC GGCCGGCGCC TATCTCGGCG AGCGTTATCA TCGCGCCGGC GGCACGCCGG CGATCATGTG GGAATTGCTG CAGGCTGGAA AGCTGGATGG CGACTGTCAT ACGGTGACGG GCAGGAGGAT GGCCGAGAAC CTGCAGGGCA AGGAAGCCAG CGACCGCGAG GTCATTCGGC CGTTCGGCGA GCCGCTGAAG GAGCGGGCCG GATTTCTCGT TCTCAAGGGC AATCTCTTCG ATTTCGCCAT CATGAAGATG AGCGTAGTCT CGGAGGATTT TCGCCGGCGC TACCTTCAGG AGCCCGGGCG GGAGGGCGTC TTCGAAGGCA AGGCGGTCGT CTTCGACGGG TCGGAAGATT ATCACAAGCG CATCAACGAT CCCGACCTCG GCATCGACGA GAATACCATT CTGGTCATCC GCGGCGCCGG GCCGCTCGGC TGGCCGGGTT CGGCAGAGGT TGTCAACATG CAGCCGCCCG ACCATCTTCT GAAGCGCGGC ATCAGCAGCC TGCCGACGAT CGGTGACGGC CGCCAGTCGG GCACGGCCGA CAGCCCGTCG ATCCTCAACG CCTCGCCGGA GAGCGCGGCC GGCGGCGGCC TCGCCTGGCT TCGCAGCGGC GATATCATCC GCATCAACTT CAACCATGGG CGTTGCGACA TGCTGGTCGA CGAGACCGAG ATCGAACGGC GCAAGGGCGA CGGTATTCCG CCGGTGCCGC CGGATGCGAC GCCTTGGCAG CAGATCTACC GCCGCTCGGT CACGCAACTG TCGGACGGCG CGGTTCTGGA AGGAGCGGCG GAATTCCGCC GGATTGCCAA AAACCCGCCG CGCCACAACC ACTGA
|
Protein sequence | MTDSHLPKRR LRSQDWFDNP DHIDMAALYL ERFMNYGITP EELRSGKPII GIAQSGSDLT PCNRVHVELA KRVRDGIRDA GGIPIEFPTH PIFENCKRPT AALDRNLAYL GLVEILYGYP LDGVVLTTGC DKTTPSAIMA ASTVDIPAIV LSGGPMLDGW HEGELAGSGT VIWRMRRKYA AGEIDREEFL QAALDSAPSV GHCNTMGTAS TMNALAEALG LSLTGCGAIP APYRERGQMA YRTGRRAVEI VFEDLKPSDI LTRAAFLNAI RTNSAIGGST NAQPHLAAMA KHAGVEIHPD DWQVHGFDIP LLANVQPAGA YLGERYHRAG GTPAIMWELL QAGKLDGDCH TVTGRRMAEN LQGKEASDRE VIRPFGEPLK ERAGFLVLKG NLFDFAIMKM SVVSEDFRRR YLQEPGREGV FEGKAVVFDG SEDYHKRIND PDLGIDENTI LVIRGAGPLG WPGSAEVVNM QPPDHLLKRG ISSLPTIGDG RQSGTADSPS ILNASPESAA GGGLAWLRSG DIIRINFNHG RCDMLVDETE IERRKGDGIP PVPPDATPWQ QIYRRSVTQL SDGAVLEGAA EFRRIAKNPP RHNH
|
| |