Gene Rleg_3350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3350 
Symbol 
ID8014232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3356299 
End bp3358083 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content65% 
IMG OID644825909 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002977136 
Protein GI241206040 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.907345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00746579 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGGACA GCCATTCGCC GAAGCGGCGC CTGCGTTCGC AGGACTGGTT CGACAATCCC 
GATCATATCG ACATGGCAGC GCTCTATCTG GAGCGCTTCA TGAATTACGG CATCACGCCG
GAAGAACTGC GCTCCGGCAA GCCGGTCATC GGGATTGCCC AGAGCGGCAG CGATCTCACG
CCTTGCAACA GAGTGCATGT CGAGCTTGCC AAGCGGGTGC GCGACGGCAT CCGCGATGCC
GGCGGCATTC CGATCGAGTT TCCGACGCAT CCGATCTTCG AGAATTGCAA GCGCCCGACG
GCCGCACTCG ACCGCAATCT CGCCTATCTC GGCCTCGTCG AAATCCTCTA CGGCTATCCG
CTCGACGGTG TCGTGCTGAC CACCGGCTGC GACAAGACCA CGCCTTCAGC GATCATGGCT
GCTTCGACGG TCGATATTCC GGCGATCGTG CTCTCCGGAG GGCCGATGCT CGACGGTTGG
CACGAGGGGG AGCTGGCGGG CTCCGGGACG GTGATCTGGC GGATGCGGCG GAAATATGCG
GCAGGCGAGA TCGATCGGGA GGAATTTCTG CAGGCGGCGC TCGATTCTGC GCCTTCCGTC
GGCCACTGCA ATACGATGGG CACCGCTTCG ACGATGAATG CGCTGGCCGA GGCGCTCGGC
CTTTCGCTGA CCGGCTGTGG CGCCATTCCG GCCGCTTACC GCGAACGCGG CCAGATGGCC
TACCGCACCG GGCGACGCGC CGTCGAAATC GTGTTCGAGG ATCTGAAGCC GTCGGATATC
CTGACGCGCG AGGCTTTCCT GAATGCGATC CGCACCAATT CGGCGATCGG CGGCTCGACC
AACGCGCAGC CGCATCTGGC CGCGATGGCG AAGCACGCCG GCGTCGAACT CTATCCCGAC
GATTGGCAGG TACATGGTTT CGATATCCCG CTGCTGGCCA ATGTCCAGCC GGCGGGCGCC
TATCTCGGAG AGCGCTTTCA TCGTGCCGGC GGTACGCCGG CGATCATGTG GGAGTTGCTG
CAGGCCGGAA AGCTCGCCGG CAACTGTCGC ACGGTGACGG GCAGGACGAT CGCCGAGAAC
CTAGAGGGCA AGGAAGCGCG CGACCGCGAG GTTATCAAGC CGTTCGCTGA GCCGCTGAAG
GAGCGGGCGG GCTTCCTCGT TCTCAAAGGC AATCTCTTCG ATTTCGCGAT CATGAAGATG
AGCGTGGTCT CGGAGGATTT CCGCCGGCGC TACCTTGAGG AACCCGGGCA CGAAGGCGTC
TTCGAGGGCA GGGCGGTGGT TTTCGACGGT TCCGAGGACT ATCACAAGCG CATCAACGAT
CCCGAACTCG GTATCGACGA AAACACCATC CTCGCCATCC GCGGCGCCGG GCCGATCGGC
TGGCCGGGTT CGGCTGAGGT CGTCAACATG CAGCCGCCGG ATCATCTCCT GAAGCGCGGC
ATCAGCAGCC TGCCGACGAT CGGCGACGGC CGCCAGTCGG GCACGGCGGA CAGTCCCTCG
ATCCTCAACG CCTCGCCGGA GAGTGCAGCG GGAGGCGGCC TCGCCTGGCT TCGTACCGGC
GATATCATCC GCATCGACTT CAACCACGGG CGCTGCGACA TGCTGGTCGA GGACGCCGAG
ATCGAACGGC GCAAGGGCGA CGGCATCCCG CCAGTGCCGG CGGATGCGAC GCCGTGGCAG
CAGATCTACC GCCGCTCGGT GACGCAATTG TCGGACGGCG CGGTGCTGGA GGGAGCGGCG
GAATTCCGCC AGATCGCAAA AAACCCGCCG CGGCACAACC ACTGA
 
Protein sequence
MTDSHSPKRR LRSQDWFDNP DHIDMAALYL ERFMNYGITP EELRSGKPVI GIAQSGSDLT 
PCNRVHVELA KRVRDGIRDA GGIPIEFPTH PIFENCKRPT AALDRNLAYL GLVEILYGYP
LDGVVLTTGC DKTTPSAIMA ASTVDIPAIV LSGGPMLDGW HEGELAGSGT VIWRMRRKYA
AGEIDREEFL QAALDSAPSV GHCNTMGTAS TMNALAEALG LSLTGCGAIP AAYRERGQMA
YRTGRRAVEI VFEDLKPSDI LTREAFLNAI RTNSAIGGST NAQPHLAAMA KHAGVELYPD
DWQVHGFDIP LLANVQPAGA YLGERFHRAG GTPAIMWELL QAGKLAGNCR TVTGRTIAEN
LEGKEARDRE VIKPFAEPLK ERAGFLVLKG NLFDFAIMKM SVVSEDFRRR YLEEPGHEGV
FEGRAVVFDG SEDYHKRIND PELGIDENTI LAIRGAGPIG WPGSAEVVNM QPPDHLLKRG
ISSLPTIGDG RQSGTADSPS ILNASPESAA GGGLAWLRTG DIIRIDFNHG RCDMLVEDAE
IERRKGDGIP PVPADATPWQ QIYRRSVTQL SDGAVLEGAA EFRQIAKNPP RHNH