Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5407 |
Symbol | |
ID | 6978501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1050023 |
End bp | 1051759 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643394509 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002279327 |
Protein GI | 209547409 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.205419 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00130415 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGACCGCGA GGAAAACTTA TGAGCAATTG CGGTCGGCCC GATGGATGCT GCCGGACGAT CAGCGCTCGT TCGGTCACCG GTCGCGGACC ATGCAGATGG GTTATGCGCC GGAGGATTGG CAGGGAAAGC CGATCATCGC AGTCATCAAC ACCTGGTCGG ACGCGCAGCC GTGTCACATG CATTTTCGCG AACGCGCGGA ATGGGTGAAG CGGGGAATTC TTCAGTCGGG CGGGTTTCCC ATGGAACTGC CTGCACTTTC CCTTTCCGAA AACTTCGTCA AGCCGACCAC CATGCTCTAT CGCAACATGC TGGCGATGGA GACCGAGGAG CTATTGCGCA GCCATCCTGT CGATGGCGCC GTTCTGATGG GCGGTTGCGA CAAGACCACG CCCGGCCTTA TCATGGGTGC TGTCAGCATG GGCATTCCCT TTGTTTATCT GCCAGCCGGC CCGATGCTTC GCGGCAATTA CGCCGGTAAG ACGCTCGGCT CCGGGACCGA CGGTTTCAAA TATTGGGACG AGCGGCGTGC CGGCACGATC ACCAAGGAGG AGTGGCAGGG CATCGAAGGC GGCATTGCCC GCAGCTACGG CCATTGCATG ACCATGGGAA CGGCATCGAC CATGACGGCG ATCGCCGAGG CTATGGGATT GACGCTGCCG GGCGCTTCGT CGATTCCGGC AGCCGACGCC AACCACCAAC GCATGTCGGC GGCTTGCGGC CGCCGCATCG TCGATATGGT GTGGGAGGAT CTGACGCCCG ACCAGATCAT CACGCCGGCG GCCGTCGACA ATGCCGTCAC CGTCGCCATG GCGACCGGCT GCTCGACCAA TGCGATCATT CACCTGATCG CCATGGCACG GCGCGCCGGC GTGCCGCTGG AGCTCGATGA CCTTGATCGC ATCGGTCGCA CGACGCCGGT TCTTGCCAAC ATCCGGCCTT CCGGGTCGAC CTATCTGATG GAGGATTTCT TCTATGCCGG CGGCCTGCGG GCGCTGATGA AGCAGCTCGG CGACAAGCTC GATCCAACTG CGATTACCGT CACGGGAAAA CCGCTGGTGG ATGGCCTCGA CGAGGTGAAG ATTTACAATG ACGACGTCAT CCGGCCACTG TCGAACCCGG TCTATCATGA AGGTTCGCTG GCAGTGCTCA AGGGAAACCT GTGTCCCGAT GGCGCGGTCA TCAAGCCGGC GGCCTGCGAC CCGAAATTCC ACCGCCATTG CGGCCCGGCG CTGGTCGCCG ACAGCTATGC GGAGATGAAG AAGATCATCG ACGATCCCGA TTATCCCTTG ACGCCGGAGA CAGTGCTGGT GCTGCGCAAT GCCGGCCCCC AGGGCGGGCC CGGCATGCCG GAATGGGGCA TGATCCCGAT GCCGAAGGCA CTGTTGAAAC TCGGCCTGCG CGACATGTTG CGCATCTCCG ATGCCCGCAT GTCCGGAACC AGTTTCGGCG CCTGCGTGCT GCACGCCGCG CCGGAATCCT ACATCGGCGG GCCGCTGGCA TTGCTGAAAA CGGGCGATAT GGTCGAGCTC GACATTCCGG CGCGCAGCCT CAATATGCTG GTTTCGGAAG AGGAGATCGC AGCCCGCCGT GCCGCCTGGG TGGCGCCGAC GCGACACTAC GAGCGCGGTT ACGGCTTTAT GTTCTCCAAG CATATCGAGC AAGCCGACAA AGGCTGCGAC TTCGACTTCC TGACGACGGA ATTCGGTGGC AAGACTCCGG AACCGGCTAT CAACTGA
|
Protein sequence | MTARKTYEQL RSARWMLPDD QRSFGHRSRT MQMGYAPEDW QGKPIIAVIN TWSDAQPCHM HFRERAEWVK RGILQSGGFP MELPALSLSE NFVKPTTMLY RNMLAMETEE LLRSHPVDGA VLMGGCDKTT PGLIMGAVSM GIPFVYLPAG PMLRGNYAGK TLGSGTDGFK YWDERRAGTI TKEEWQGIEG GIARSYGHCM TMGTASTMTA IAEAMGLTLP GASSIPAADA NHQRMSAACG RRIVDMVWED LTPDQIITPA AVDNAVTVAM ATGCSTNAII HLIAMARRAG VPLELDDLDR IGRTTPVLAN IRPSGSTYLM EDFFYAGGLR ALMKQLGDKL DPTAITVTGK PLVDGLDEVK IYNDDVIRPL SNPVYHEGSL AVLKGNLCPD GAVIKPAACD PKFHRHCGPA LVADSYAEMK KIIDDPDYPL TPETVLVLRN AGPQGGPGMP EWGMIPMPKA LLKLGLRDML RISDARMSGT SFGACVLHAA PESYIGGPLA LLKTGDMVEL DIPARSLNML VSEEEIAARR AAWVAPTRHY ERGYGFMFSK HIEQADKGCD FDFLTTEFGG KTPEPAIN
|
| |