Gene Rleg_5349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5349 
Symbol 
ID8007307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp758933 
End bp760669 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content63% 
IMG OID644822253 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002973513 
Protein GI241113678 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.7217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCGA GGAAAACCTA CGAGCAATTG CGGTCCGCCC GATGGATGTT GCCGGATGAT 
CAACGCTCGT TCGGCCACCG GTCGCGGACC ATGCAGATGG GTTACGCGCC GGAGGATTGG
CAGGGAAAGC CGATCATTGC TGTGATCAAC ACCTGGTCCG ATGCCCAGCC GTGCCACATG
CATTTCCGTG AACGCGCCGA ATGGGTGAAG CGGGGGATTC TTCAGTCGGG TGGCTTCCCC
ATGGAGCTGC CCGCTCTTTC CCTCTCCGAA AACTTCGTCA AGCCGACCAC GATGCTCTAT
CGCAATATGC TGGCGATGGA GACTGAGGAA CTGCTGCGGA GCCATCCTGT CGATGGCGCC
GTTCTGATGG GTGGCTGCGA CAAGACCACA CCAGGCCTTG TCATGGGCGC AGTCAGCATG
GGCATTCCCT TCATTTACCT GCCGGCCGGC CCGATGCTTC GCGGCAACTA TGCCGGCAAG
ACGCTGGGCT CCGGCACTGA CGGCTTCAAA TACTGGGACG AGCGCCGCGC CGGCACGATC
ACCCAGGAGG AGTGGCAGGG CATCGAAGGC GGGATCGCCC GCAGTTACGG CCATTGCATG
ACCATGGGCA CGGCGTCGAC GATGACGGCG ATCGCCGAAG CTATGGGATT GACGCTGCCG
GGCGCATCCT CGATCCCGGC TGCCGACGCC AACCACCAGC GCATGTCTGC GGCCTGTGGC
CGCCGCATCG TCGATATGGT GTGGGAGGAT CTGACCCCGG ACCAGATCAT CACTCCGGCG
GCCGTCGACA ACGCAGTCAC GGTCGCCATG GCGACCGGCT GCTCCACCAA TGCCATCATT
CACCTGATCG CCATGGCGCG GCGCGCCGGC GTCCCGCTGG AGCTCGATGA TCTCGATCGC
ATCGGTCGCA CGACGCCGGT TCTCGCCAAC ATCCGGCCTT CCGGTTCGAC CTATCTGATG
GAGGATTTCT TTTATGCCGG CGGCTTGCGC GCCCTGATGA AGCAGCTCGG CGACAAGCTG
GATCCCACCG CGATCACCGT TATGGGAAAA CCCTTGGTGG ACGGTCTCGA CCAGGTGAAG
ATCTACAATG ACGACGTTAT CCGGCCATTG TCGAACCCGG TCTATCACGA AGGTTCGCTG
GCCGTGCTCA AGGGGAACCT GTGTCCCGAC GGCGCGGTCA TCAAACCGGC GGCCTGCGAC
CCAAAATTCC ATCGCCATCG CGGCCCGGCG CTGGTCGCCG ACAGCTATGC GGAGATGAAG
AAGATCATCG ATGATCCCGA CTATCCGCTG ACGCCGGACA CGGTTCTCGT GCTCCGCAAT
GCCGGCCCCC AGGGTGGACC GGGCATGCCG GAATGGGGCA TGATCCCGAT GCCGAAGGCG
CTGTTGAAGC TCGGCCTGCG CGACATGGTA CGCATCTCGG ACGCCCGCAT GTCCGGAACC
AGTTTCGGCG CCTGCGTGCT GCATGTCGCA CCGGAATCTT ATGTTGGCGG GCCTTTGGCG
CTGCTGAGAA CGGGGGACAT GGTCGAGCTT GATATTCCGG CACGCAGCCT CAATATGCTG
GTTGCCGAAG AAGAGATCAC AGCGCGGCGG GCCGCCTGGG TGGCGCCGAC GCGGCATTAC
GAGCGCGGTT ATGGCTTTAT GTTCTCCGGC CATATCGAGC AGGCCGACAA AGGCTGCGAC
TTTGATTTCC TGACCACGGA ATTCGGTGGC AAGACGCCGG AACCGGCAAT CAACTGA
 
Protein sequence
MTARKTYEQL RSARWMLPDD QRSFGHRSRT MQMGYAPEDW QGKPIIAVIN TWSDAQPCHM 
HFRERAEWVK RGILQSGGFP MELPALSLSE NFVKPTTMLY RNMLAMETEE LLRSHPVDGA
VLMGGCDKTT PGLVMGAVSM GIPFIYLPAG PMLRGNYAGK TLGSGTDGFK YWDERRAGTI
TQEEWQGIEG GIARSYGHCM TMGTASTMTA IAEAMGLTLP GASSIPAADA NHQRMSAACG
RRIVDMVWED LTPDQIITPA AVDNAVTVAM ATGCSTNAII HLIAMARRAG VPLELDDLDR
IGRTTPVLAN IRPSGSTYLM EDFFYAGGLR ALMKQLGDKL DPTAITVMGK PLVDGLDQVK
IYNDDVIRPL SNPVYHEGSL AVLKGNLCPD GAVIKPAACD PKFHRHRGPA LVADSYAEMK
KIIDDPDYPL TPDTVLVLRN AGPQGGPGMP EWGMIPMPKA LLKLGLRDMV RISDARMSGT
SFGACVLHVA PESYVGGPLA LLRTGDMVEL DIPARSLNML VAEEEITARR AAWVAPTRHY
ERGYGFMFSG HIEQADKGCD FDFLTTEFGG KTPEPAIN