Gene Rleg2_5407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5407 
Symbol 
ID6978501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1050023 
End bp1051759 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content62% 
IMG OID643394509 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002279327 
Protein GI209547409 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.205419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00130415 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACCGCGA GGAAAACTTA TGAGCAATTG CGGTCGGCCC GATGGATGCT GCCGGACGAT 
CAGCGCTCGT TCGGTCACCG GTCGCGGACC ATGCAGATGG GTTATGCGCC GGAGGATTGG
CAGGGAAAGC CGATCATCGC AGTCATCAAC ACCTGGTCGG ACGCGCAGCC GTGTCACATG
CATTTTCGCG AACGCGCGGA ATGGGTGAAG CGGGGAATTC TTCAGTCGGG CGGGTTTCCC
ATGGAACTGC CTGCACTTTC CCTTTCCGAA AACTTCGTCA AGCCGACCAC CATGCTCTAT
CGCAACATGC TGGCGATGGA GACCGAGGAG CTATTGCGCA GCCATCCTGT CGATGGCGCC
GTTCTGATGG GCGGTTGCGA CAAGACCACG CCCGGCCTTA TCATGGGTGC TGTCAGCATG
GGCATTCCCT TTGTTTATCT GCCAGCCGGC CCGATGCTTC GCGGCAATTA CGCCGGTAAG
ACGCTCGGCT CCGGGACCGA CGGTTTCAAA TATTGGGACG AGCGGCGTGC CGGCACGATC
ACCAAGGAGG AGTGGCAGGG CATCGAAGGC GGCATTGCCC GCAGCTACGG CCATTGCATG
ACCATGGGAA CGGCATCGAC CATGACGGCG ATCGCCGAGG CTATGGGATT GACGCTGCCG
GGCGCTTCGT CGATTCCGGC AGCCGACGCC AACCACCAAC GCATGTCGGC GGCTTGCGGC
CGCCGCATCG TCGATATGGT GTGGGAGGAT CTGACGCCCG ACCAGATCAT CACGCCGGCG
GCCGTCGACA ATGCCGTCAC CGTCGCCATG GCGACCGGCT GCTCGACCAA TGCGATCATT
CACCTGATCG CCATGGCACG GCGCGCCGGC GTGCCGCTGG AGCTCGATGA CCTTGATCGC
ATCGGTCGCA CGACGCCGGT TCTTGCCAAC ATCCGGCCTT CCGGGTCGAC CTATCTGATG
GAGGATTTCT TCTATGCCGG CGGCCTGCGG GCGCTGATGA AGCAGCTCGG CGACAAGCTC
GATCCAACTG CGATTACCGT CACGGGAAAA CCGCTGGTGG ATGGCCTCGA CGAGGTGAAG
ATTTACAATG ACGACGTCAT CCGGCCACTG TCGAACCCGG TCTATCATGA AGGTTCGCTG
GCAGTGCTCA AGGGAAACCT GTGTCCCGAT GGCGCGGTCA TCAAGCCGGC GGCCTGCGAC
CCGAAATTCC ACCGCCATTG CGGCCCGGCG CTGGTCGCCG ACAGCTATGC GGAGATGAAG
AAGATCATCG ACGATCCCGA TTATCCCTTG ACGCCGGAGA CAGTGCTGGT GCTGCGCAAT
GCCGGCCCCC AGGGCGGGCC CGGCATGCCG GAATGGGGCA TGATCCCGAT GCCGAAGGCA
CTGTTGAAAC TCGGCCTGCG CGACATGTTG CGCATCTCCG ATGCCCGCAT GTCCGGAACC
AGTTTCGGCG CCTGCGTGCT GCACGCCGCG CCGGAATCCT ACATCGGCGG GCCGCTGGCA
TTGCTGAAAA CGGGCGATAT GGTCGAGCTC GACATTCCGG CGCGCAGCCT CAATATGCTG
GTTTCGGAAG AGGAGATCGC AGCCCGCCGT GCCGCCTGGG TGGCGCCGAC GCGACACTAC
GAGCGCGGTT ACGGCTTTAT GTTCTCCAAG CATATCGAGC AAGCCGACAA AGGCTGCGAC
TTCGACTTCC TGACGACGGA ATTCGGTGGC AAGACTCCGG AACCGGCTAT CAACTGA
 
Protein sequence
MTARKTYEQL RSARWMLPDD QRSFGHRSRT MQMGYAPEDW QGKPIIAVIN TWSDAQPCHM 
HFRERAEWVK RGILQSGGFP MELPALSLSE NFVKPTTMLY RNMLAMETEE LLRSHPVDGA
VLMGGCDKTT PGLIMGAVSM GIPFVYLPAG PMLRGNYAGK TLGSGTDGFK YWDERRAGTI
TKEEWQGIEG GIARSYGHCM TMGTASTMTA IAEAMGLTLP GASSIPAADA NHQRMSAACG
RRIVDMVWED LTPDQIITPA AVDNAVTVAM ATGCSTNAII HLIAMARRAG VPLELDDLDR
IGRTTPVLAN IRPSGSTYLM EDFFYAGGLR ALMKQLGDKL DPTAITVTGK PLVDGLDEVK
IYNDDVIRPL SNPVYHEGSL AVLKGNLCPD GAVIKPAACD PKFHRHCGPA LVADSYAEMK
KIIDDPDYPL TPETVLVLRN AGPQGGPGMP EWGMIPMPKA LLKLGLRDML RISDARMSGT
SFGACVLHAA PESYIGGPLA LLKTGDMVEL DIPARSLNML VSEEEIAARR AAWVAPTRHY
ERGYGFMFSK HIEQADKGCD FDFLTTEFGG KTPEPAIN