Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1459 |
Symbol | |
ID | 8012548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1446089 |
End bp | 1447927 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644824048 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002975290 |
Protein GI | 241204194 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.141377 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGTTT ACCGTTCCAG AACCACGACC CATGGCCGCA ACATGGCGGG CGCCCGCGGC CTTTGGCGCG CCACGGGCAT GAAGGATTCG GATTTCGGCA AGCCGATCAT CGCGGTGGTG AATTCCTTCA CCCAGTTCGT GCCCGGCCAC GTGCACCTGA AGGACCTTGG CCAGCTCGTT GCCCGCGAGA TCGAGGCGGC CGGCGGTGTC GCCAAGGAAT TCAACACGAT CGCCGTCGAT GACGGCATCG CCATGGGCCA TGACGGCATG CTTTATTCGC TGCCCTCGCG TGAGCTCATC GCCGACAGCG TCGAATATAT GGTCAATGCT CATTGCGCCG ACGCCATGGT CTGCATCTCC AATTGCGACA AGATCACCCC CGGCATGCTG ATGGCGTCGC TGCGTCTCAA TATCCCGACC GTCTTCGTCT CGGGCGGTCC GATGGAAGCC GGCAAGGTCG TGCTGCACGG CAAGACGCAT GCGCTCGACC TGGTCGATGC CATGGTCGCC GCAGCCGATG ACAAGATCAG CGACGAGGAC GTCCAGACCA TCGAACGCTC GGCCTGTCCG ACCTGTGGTT CCTGCTCCGG CATGTTCACC GCCAATTCGA TGAACTGCCT GACGGAAGCC CTCGGCCTGT CGCTGCCCGG CAACGGCTCG ACGCTTGCCA CCCATCTCGA CCGCAAGCGC CTCTTCGTCG AGGCCGGTCA TCTGATTGTC GATCTCGCCC GCCGTTATTA CGAGCAGGAT GACGTCAAGG CGCTGCCGCG CACCATTGCC TCCAAGCAGG CCTTCGAGAA TGCCATGACG CTCGATATCG CCATGGGCGG TTCCACCAAT ACGGTCCTGC ACATTCTTGC CGCCGCCCAT GAAGGCGAGA TCGATTTCAA TATGGCCGAT ATCGACGCGC TGTCGCGCCG CGTGCCGTGC CTGTCGAAGG TCGCACCCGC CAAGAGTGAC GTGCATATGG AAGACGTCCA CCGCGCCGGC GGCATCATGT CGATCCTCGG CGAACTCGAC AAGGGTGGTC TCTTGAACCG CGATTGCCCG ACGGTCCATG CCGAGACGCT GGGCGATGCG ATCGATCGCT GGGATATCAC CCGCACGAAC AGCGAAACCG TGCGCAACTT CTATCGTGCC GCACCCGGCG GCATCCCGAC CCAGGTCGCC TTCAGCCAGG AAGCCCGTTG GGACGATCTC GACACCGATC GCGAGAACGG CGTCATCCGC TCGGTCGAGC ATCCCTTCTC CAAGGATGGC GGCCTTGCCG TGCTCAAGGG CAACCTTGCG ATTGACGGCT GCATCGTCAA GACCGCTGGC GTCGATGAAT CGATCCTGAA GTTCTCCGGC CCCGCCCGCG TCTTCGAAAG CCAGGATTCG TCGGTCAAGG CGATCCTTGC CAACGAGGTG AAGGCCGGCG ACGTCGTCGT CATCCGCTAC GAAGGTCCGA AGGGCGGCCC GGGCATGCAG GAAATGCTCT ATCCGACGAG CTATCTGAAG TCGAAGGGCC TCGGCAAGGC ATGCGCGCTC ATCACCGACG GCCGCTTCTC CGGCGGCACT TCCGGCCTCT CGATCGGCCA CGCCTCGCCG GAAGCGGCAA ATGGCGGCAC GATCGGCCTG GTGCGCGAAG GCGACATGAT CGACATCGAC ATCCCCAACC GCACGATCAG CCTGCGTGTC AGCGAGACTG AACTCGCCGC CCGCCGCGCC GAGCAGGACG CCAAGGGCTG GTACCCGGTC GAAGTCCGCA AGCGCAATGT CACGACCGCG CTAAAGGCCT ACGCGGCCTT CGCAACGAGT GCGGACCGCG GTGCCGTGCG CGATCTGAAC GCCCGCTGA
|
Protein sequence | MPVYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS NCDKITPGML MASLRLNIPT VFVSGGPMEA GKVVLHGKTH ALDLVDAMVA AADDKISDED VQTIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS TLATHLDRKR LFVEAGHLIV DLARRYYEQD DVKALPRTIA SKQAFENAMT LDIAMGGSTN TVLHILAAAH EGEIDFNMAD IDALSRRVPC LSKVAPAKSD VHMEDVHRAG GIMSILGELD KGGLLNRDCP TVHAETLGDA IDRWDITRTN SETVRNFYRA APGGIPTQVA FSQEARWDDL DTDRENGVIR SVEHPFSKDG GLAVLKGNLA IDGCIVKTAG VDESILKFSG PARVFESQDS SVKAILANEV KAGDVVVIRY EGPKGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGT SGLSIGHASP EAANGGTIGL VREGDMIDID IPNRTISLRV SETELAARRA EQDAKGWYPV EVRKRNVTTA LKAYAAFATS ADRGAVRDLN AR
|
| |