Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3350 |
Symbol | |
ID | 8014232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3356299 |
End bp | 3358083 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644825909 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002977136 |
Protein GI | 241206040 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.907345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00746579 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGGACA GCCATTCGCC GAAGCGGCGC CTGCGTTCGC AGGACTGGTT CGACAATCCC GATCATATCG ACATGGCAGC GCTCTATCTG GAGCGCTTCA TGAATTACGG CATCACGCCG GAAGAACTGC GCTCCGGCAA GCCGGTCATC GGGATTGCCC AGAGCGGCAG CGATCTCACG CCTTGCAACA GAGTGCATGT CGAGCTTGCC AAGCGGGTGC GCGACGGCAT CCGCGATGCC GGCGGCATTC CGATCGAGTT TCCGACGCAT CCGATCTTCG AGAATTGCAA GCGCCCGACG GCCGCACTCG ACCGCAATCT CGCCTATCTC GGCCTCGTCG AAATCCTCTA CGGCTATCCG CTCGACGGTG TCGTGCTGAC CACCGGCTGC GACAAGACCA CGCCTTCAGC GATCATGGCT GCTTCGACGG TCGATATTCC GGCGATCGTG CTCTCCGGAG GGCCGATGCT CGACGGTTGG CACGAGGGGG AGCTGGCGGG CTCCGGGACG GTGATCTGGC GGATGCGGCG GAAATATGCG GCAGGCGAGA TCGATCGGGA GGAATTTCTG CAGGCGGCGC TCGATTCTGC GCCTTCCGTC GGCCACTGCA ATACGATGGG CACCGCTTCG ACGATGAATG CGCTGGCCGA GGCGCTCGGC CTTTCGCTGA CCGGCTGTGG CGCCATTCCG GCCGCTTACC GCGAACGCGG CCAGATGGCC TACCGCACCG GGCGACGCGC CGTCGAAATC GTGTTCGAGG ATCTGAAGCC GTCGGATATC CTGACGCGCG AGGCTTTCCT GAATGCGATC CGCACCAATT CGGCGATCGG CGGCTCGACC AACGCGCAGC CGCATCTGGC CGCGATGGCG AAGCACGCCG GCGTCGAACT CTATCCCGAC GATTGGCAGG TACATGGTTT CGATATCCCG CTGCTGGCCA ATGTCCAGCC GGCGGGCGCC TATCTCGGAG AGCGCTTTCA TCGTGCCGGC GGTACGCCGG CGATCATGTG GGAGTTGCTG CAGGCCGGAA AGCTCGCCGG CAACTGTCGC ACGGTGACGG GCAGGACGAT CGCCGAGAAC CTAGAGGGCA AGGAAGCGCG CGACCGCGAG GTTATCAAGC CGTTCGCTGA GCCGCTGAAG GAGCGGGCGG GCTTCCTCGT TCTCAAAGGC AATCTCTTCG ATTTCGCGAT CATGAAGATG AGCGTGGTCT CGGAGGATTT CCGCCGGCGC TACCTTGAGG AACCCGGGCA CGAAGGCGTC TTCGAGGGCA GGGCGGTGGT TTTCGACGGT TCCGAGGACT ATCACAAGCG CATCAACGAT CCCGAACTCG GTATCGACGA AAACACCATC CTCGCCATCC GCGGCGCCGG GCCGATCGGC TGGCCGGGTT CGGCTGAGGT CGTCAACATG CAGCCGCCGG ATCATCTCCT GAAGCGCGGC ATCAGCAGCC TGCCGACGAT CGGCGACGGC CGCCAGTCGG GCACGGCGGA CAGTCCCTCG ATCCTCAACG CCTCGCCGGA GAGTGCAGCG GGAGGCGGCC TCGCCTGGCT TCGTACCGGC GATATCATCC GCATCGACTT CAACCACGGG CGCTGCGACA TGCTGGTCGA GGACGCCGAG ATCGAACGGC GCAAGGGCGA CGGCATCCCG CCAGTGCCGG CGGATGCGAC GCCGTGGCAG CAGATCTACC GCCGCTCGGT GACGCAATTG TCGGACGGCG CGGTGCTGGA GGGAGCGGCG GAATTCCGCC AGATCGCAAA AAACCCGCCG CGGCACAACC ACTGA
|
Protein sequence | MTDSHSPKRR LRSQDWFDNP DHIDMAALYL ERFMNYGITP EELRSGKPVI GIAQSGSDLT PCNRVHVELA KRVRDGIRDA GGIPIEFPTH PIFENCKRPT AALDRNLAYL GLVEILYGYP LDGVVLTTGC DKTTPSAIMA ASTVDIPAIV LSGGPMLDGW HEGELAGSGT VIWRMRRKYA AGEIDREEFL QAALDSAPSV GHCNTMGTAS TMNALAEALG LSLTGCGAIP AAYRERGQMA YRTGRRAVEI VFEDLKPSDI LTREAFLNAI RTNSAIGGST NAQPHLAAMA KHAGVELYPD DWQVHGFDIP LLANVQPAGA YLGERFHRAG GTPAIMWELL QAGKLAGNCR TVTGRTIAEN LEGKEARDRE VIKPFAEPLK ERAGFLVLKG NLFDFAIMKM SVVSEDFRRR YLEEPGHEGV FEGRAVVFDG SEDYHKRIND PELGIDENTI LAIRGAGPIG WPGSAEVVNM QPPDHLLKRG ISSLPTIGDG RQSGTADSPS ILNASPESAA GGGLAWLRTG DIIRIDFNHG RCDMLVEDAE IERRKGDGIP PVPADATPWQ QIYRRSVTQL SDGAVLEGAA EFRQIAKNPP RHNH
|
| |