Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2909 |
Symbol | |
ID | 6981653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2963606 |
End bp | 2965345 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643397619 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002282403 |
Protein GI | 209550486 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGA AAGCAGAATG GCCGCGCAAG CTGCGCTCGC AGGAATGGTA TGGCGGCACC AGCCGCGACG TAATCTACCA TCGCGGCTGG CTGAAGAACC AGGGTTATCC GCATGACCTG TTCGATGGCC GTCCGGTCAT CGGCATCCTG AATACCTGGT CTGATATGAC GCCGTGTAAC GGCCATCTGC GCGAACTCGC CGAGAAGGTG AAGGCGGGTG TCTGGGAGGC CGGCGGCTTC CCGCTCGAGG TGCCGGTGTT CTCGGCATCC GAAAACACTT TCCGCCCGAC CGCGATGATG TATCGCAACC TCGCCGCGTT GGCGGTGGAA GAGGCGATCC GCGGCCAGCC GATGGACGGC TGCGTGCTCT TGGTCGGCTG CGATAAGACC ACGCCGTCGC TGCTCATGGG GGCTGCCTCC TGCGACCTGC CGTCGATCGT CGTCACCGGC GGGCCGATGC TGAACGGCTA TTTCCGCGGT GAGCGTGTCG GTTCGGGCAC GCATCTGTGG AAGTTCTCCG AAATGGTGAA GGCCGGCGAG ATGACGCAGG CCGAGTTCCT CGAGGCTGAG GCGTCGATGA GCCGTTCGTC GGGCACCTGC AACACCATGG GCACCGCCTC CACCATGGCC TCCATGGCCG AGGCGCTCGG CATGGCACTA TCAGGCAATG CCGCGATCCC GGGCGTCGAT TCCCGCCGCA AGGTCATGGC GCAGCTGACC GGCCGCCGGA TCGTACAGAT GGTCAAGGAC GACCTGAAGC CCTCCGAGAT CATGACGAAA CAGGCTTTCG AAAACGCCAT CCGCACCAAT GCGGCGATCG GCGGATCGAC CAACGCCGTC ATCCACCTGC TTGCGATTGC CGGCCGCGTC GGCATCGATC TGTCGCTCGA CGACTGGGAC CGCTGCGGCC GCGACGTTCC CACAATCGTC AACCTGATGC CGTCGGGCAA GTACCTGATG GAAGAGTTCT TCTATGCCGG CGGCCTGCCG GTGGTGCTGA AGCGCCTCGG CGAGGCGGGC CTGCTGCATA AGGATGCGCT GACGGTTTCT GGCGAAACCG TCTGGGACGA GGTCAAGGAC GTCGTCAACT GGAATGAGGA CGTCATCCTG CCGGCCGAAA AGGCGCTGAC CTCTTCGGGC GGCATCGTCG TGCTGCGCGG CAATCTGGCG CCGAAGGGCG CGGTGCTGAA GCCTTCGGCG GCCTCGCCGC ATCTGTTGGT GCACAAGGGC AGGGCAGTCG TGTTCGAGGA TATCGACGAC TACAAGGCGA AGATCAACGA CGACAATCTC GACATCGACG AAAACTGCAT CATGGTCATG AAGAATTGCG GGCCGAAGGG TTATCCCGGG ATGGCCGAAG TCGGCAACAT GGGACTGCCG CCGAAGGTGC TGAAGAAGGG CATCCTCGAC ATGGTGCGCA TTTCCGACGC CCGCATGTCC GGAACGGCCT ACGGCACAGT TGTGCTGCAC ACCTCGCCGG AAGCGGCGGT CGGCGGGCCG CTCGCGGTCG TGAAAAACGG CGACATGATT GAGCTCGATG TGCCGAACCG TCGTCTGCAT CTCGACATTT CCGACGAGGA ATTGGCGCGG CGGCTGGCCG AATGGCAGCC GAACCACGAC CTGCCGACAT CGGGTTATGC CTTCCTGCAT CAGCAGCATG TCGAAGGGGC CGATACCGGC GCCGACCTCG ACTTCCTCAA GGGATGTCGC GGAAACGCGG TCGGCAAAGA CAGCCACTAA
|
Protein sequence | MKKKAEWPRK LRSQEWYGGT SRDVIYHRGW LKNQGYPHDL FDGRPVIGIL NTWSDMTPCN GHLRELAEKV KAGVWEAGGF PLEVPVFSAS ENTFRPTAMM YRNLAALAVE EAIRGQPMDG CVLLVGCDKT TPSLLMGAAS CDLPSIVVTG GPMLNGYFRG ERVGSGTHLW KFSEMVKAGE MTQAEFLEAE ASMSRSSGTC NTMGTASTMA SMAEALGMAL SGNAAIPGVD SRRKVMAQLT GRRIVQMVKD DLKPSEIMTK QAFENAIRTN AAIGGSTNAV IHLLAIAGRV GIDLSLDDWD RCGRDVPTIV NLMPSGKYLM EEFFYAGGLP VVLKRLGEAG LLHKDALTVS GETVWDEVKD VVNWNEDVIL PAEKALTSSG GIVVLRGNLA PKGAVLKPSA ASPHLLVHKG RAVVFEDIDD YKAKINDDNL DIDENCIMVM KNCGPKGYPG MAEVGNMGLP PKVLKKGILD MVRISDARMS GTAYGTVVLH TSPEAAVGGP LAVVKNGDMI ELDVPNRRLH LDISDEELAR RLAEWQPNHD LPTSGYAFLH QQHVEGADTG ADLDFLKGCR GNAVGKDSH
|
| |