Gene Rleg2_3093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3093 
Symbol 
ID6981838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3156874 
End bp3158658 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content65% 
IMG OID643397803 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002282586 
Protein GI209550669 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0227397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACA GCCATCTTCC GAAGCGGCGC CTGCGGTCGC AGGATTGGTT CGACAATCCC 
GACCATATCG ACATGGCAGC GCTCTATCTC GAGCGTTTCA TGAATTACGG CATCACGCCG
GAAGAGCTGC GCTCCGGCAA GCCCATCATC GGGATTGCCC AGAGCGGCAG CGATCTTACG
CCTTGCAACA GGGTGCATGT CGAGCTTGCC AAGCGCGTGC GCGACGGCAT TCGCGATGCC
GGCGGCATTC CGATCGAATT TCCGACACAT CCGATCTTCG AGAACTGCAA GCGCCCAACG
GCGGCACTCG ACCGCAATCT TGCCTATCTC GGCCTCGTTG AAATCCTCTA TGGCTATCCG
CTCGACGGCG TCGTGCTGAC GACCGGCTGC GACAAGACCA CGCCGTCGGC GATCATGGCG
GCGTCGACGG TCGATATTCC GGCGATCGTG CTCTCCGGCG GCCCGATGCT CGACGGCTGG
CATGAGGGAG AGCTGGCGGG TTCCGGCACG GTGATCTGGC GGATGCGGCG GAAATATGCG
GCAGGCGAGA TCGACCGCGA GGAATTCCTG CAGGCAGCGC TCGACTCAGC GCCCTCCGTC
GGCCACTGCA ATACGATGGG CACGGCCTCG ACGATGAATG CGCTCGCCGA GGCGCTCGGT
CTGTCGCTGA CCGGATGCGG CGCCATTCCA GCTCCCTACC GCGAACGCGG GCAGATGGCC
TATCGCACCG GGCGGCGTGC CGTCGAGATC GTCTTCGAGG ACCTGAAGCC GTCGGATATC
CTGACGCGCG CGGCCTTCCT CAATGCCATC CGCACCAATT CGGCGATCGG CGGTTCGACC
AACGCGCAGC CGCATCTGGC GGCGATGGCC AAACACGCCG GCGTCGAGAT TCATCCCGAC
GACTGGCAGG TGCACGGCTT CGATATCCCG CTGCTCGCCA ACGTCCAGCC GGCCGGCGCC
TATCTCGGCG AGCGTTATCA TCGCGCCGGC GGCACGCCGG CGATCATGTG GGAATTGCTG
CAGGCTGGAA AGCTGGATGG CGACTGTCAT ACGGTGACGG GCAGGAGGAT GGCCGAGAAC
CTGCAGGGCA AGGAAGCCAG CGACCGCGAG GTCATTCGGC CGTTCGGCGA GCCGCTGAAG
GAGCGGGCCG GATTTCTCGT TCTCAAGGGC AATCTCTTCG ATTTCGCCAT CATGAAGATG
AGCGTAGTCT CGGAGGATTT TCGCCGGCGC TACCTTCAGG AGCCCGGGCG GGAGGGCGTC
TTCGAAGGCA AGGCGGTCGT CTTCGACGGG TCGGAAGATT ATCACAAGCG CATCAACGAT
CCCGACCTCG GCATCGACGA GAATACCATT CTGGTCATCC GCGGCGCCGG GCCGCTCGGC
TGGCCGGGTT CGGCAGAGGT TGTCAACATG CAGCCGCCCG ACCATCTTCT GAAGCGCGGC
ATCAGCAGCC TGCCGACGAT CGGTGACGGC CGCCAGTCGG GCACGGCCGA CAGCCCGTCG
ATCCTCAACG CCTCGCCGGA GAGCGCGGCC GGCGGCGGCC TCGCCTGGCT TCGCAGCGGC
GATATCATCC GCATCAACTT CAACCATGGG CGTTGCGACA TGCTGGTCGA CGAGACCGAG
ATCGAACGGC GCAAGGGCGA CGGTATTCCG CCGGTGCCGC CGGATGCGAC GCCTTGGCAG
CAGATCTACC GCCGCTCGGT CACGCAACTG TCGGACGGCG CGGTTCTGGA AGGAGCGGCG
GAATTCCGCC GGATTGCCAA AAACCCGCCG CGCCACAACC ACTGA
 
Protein sequence
MTDSHLPKRR LRSQDWFDNP DHIDMAALYL ERFMNYGITP EELRSGKPII GIAQSGSDLT 
PCNRVHVELA KRVRDGIRDA GGIPIEFPTH PIFENCKRPT AALDRNLAYL GLVEILYGYP
LDGVVLTTGC DKTTPSAIMA ASTVDIPAIV LSGGPMLDGW HEGELAGSGT VIWRMRRKYA
AGEIDREEFL QAALDSAPSV GHCNTMGTAS TMNALAEALG LSLTGCGAIP APYRERGQMA
YRTGRRAVEI VFEDLKPSDI LTRAAFLNAI RTNSAIGGST NAQPHLAAMA KHAGVEIHPD
DWQVHGFDIP LLANVQPAGA YLGERYHRAG GTPAIMWELL QAGKLDGDCH TVTGRRMAEN
LQGKEASDRE VIRPFGEPLK ERAGFLVLKG NLFDFAIMKM SVVSEDFRRR YLQEPGREGV
FEGKAVVFDG SEDYHKRIND PDLGIDENTI LVIRGAGPLG WPGSAEVVNM QPPDHLLKRG
ISSLPTIGDG RQSGTADSPS ILNASPESAA GGGLAWLRSG DIIRINFNHG RCDMLVDETE
IERRKGDGIP PVPPDATPWQ QIYRRSVTQL SDGAVLEGAA EFRRIAKNPP RHNH