Gene Rleg_3955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3955 
Symbol 
ID8014770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4031372 
End bp4033084 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content65% 
IMG OID644826524 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002977735 
Protein GI241206639 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGAACG GTCTTCGCCG CAAGCTGACG AGTTATGGCG ATCAGGGATT CTCCCTTTTC 
CTGCGCAAAG CATTCATCAA GGCCATGGGT TATTCGGACG ATGCGCTCGA CCGTCCGATC
GTCGGCATCG CCAACACCTA CAGTGATTTC AACCCCTGTC ACGGCAATGT GCCGCAACTC
ATCGAGGCAA CCAAGCGCGG CGTGATGCTG ACGGGTGCGA TGCCCATGGT GTTTCCGACG
ATCTCGATCC ATGAGAGCTT TGCCTCGCCG ACCTCGATGT ATCTGCGCAA TCTGATGGCG
ATGGAAACGG AAGAGATGAT CCGCGCCCAG CCGATGGATG CCGTCGTCCT GATCGGCGGC
TGCGACAAGA CCCTGCCGGC CCAGATCATG GCCGCCGCAA GCAGCGAGAT ACCGGCGATC
TTCCTGCCGA CCGGCCCGAT GGCCGTCGGC CACCACAAGG GCGAACGGCT TGGCGCCTGT
ACGGATTGTC GCCGTTTCTG GGGGCGCTTC CGCGCCGGAG AGATCGACGA GGCCGAGATC
GCCGAGGTCA ACAACAAGCT GGCCAGCTCG ATCGGCACCT GCACGGTGAT GGGCACGGCG
AGCACCATGG CGAACCTGAC CGAAGTCATG GGTCTCTGCC TTCCCCGCGC CGGTTCGGCG
CCCGCCGTCG AATCCGAACG GGTTCGCCTT GCCGAGGAGA CGGGCCGTGT CGCTGCGCGT
CTCGCCATGG ATGAGGCCGC GCCGACGGTG CGCGATATCC TGACGCCGCA GGCAGTCCGC
AACGGCCTCG TCGCGCTTCA GGCGATGGGC GGCTCGACCA ATGCCGTCGT CCACCTTACC
GCAATCACCG GCCGCCTCGG CCTGCGCCTC GACATGGCCG AGCTCGACCG GCTGGGCCGG
AGCATTCCCC TGCTCGTCGA CCTGCAGCCC TCCGGCCAGC ATTATATGGA GCAGTTCCAC
GAGGCTGGCG GCGTCCCGGC TCTGTTGAAG GCGGTGCGCC ATGAAATCGA CGGCAGCGCC
CCGACGGTCT ACGGCAAAAC GATCGGCGAA ATCATCGACG CTGTCGTCGA CGAGCCGGGC
CAGACCATCA TCCGCACCGT CGAGAAACCG TTGAAGCCGA TCGGAACGAT CGCCGTCCTG
CACGGCAATC TTGCGCCGCG TGGCGCCGTC ATCAAACAAT CCGCAGCCTC CAAGGATCTG
CTCCAGCATA TCGGCCGCGC CGTGGTCTTC GATTCCGTCG AGGATATGAC GCTGCGCATC
GACAGCGACG ATCTCGACGT CCATGCCGAT GACATCCTGG TTCTGCGCAA TGCCGGGCCA
AAAGGTGCGC CCGGCATGCC GGAAGCCGGC TATCTACCCA TCCCGCGCAA GCTCGCGCGG
CAGGGCGTGA AGGACATGGT GCGGATTTCC GATGCCCGCA TGAGCGGCAC GGCCTTCGGC
ACGATCATCC TCCATATCGC GCCTGAAGCC GCCGATGGCG GCCCTCTGGC GATCGTCCGC
ACCGGCGATC GCATTCGCCT CGACGTCGAG GGCAGGCGCA TCGACCTTGA CATCGACCAG
GCGGAATTCG ATCGCCGCAT GCTTGATGTC GTCAACCGGC CCTCCCCGGC CCCGGCGCGC
GGCTATGCCA GGCTCTATCA GGAGCGCGTG CTCCAGGCTG ACGAAGGCGC CGACTTCGAC
TTCCTGCAAA GCGAGGAGTT CGACGCGCGA TGA
 
Protein sequence
MSNGLRRKLT SYGDQGFSLF LRKAFIKAMG YSDDALDRPI VGIANTYSDF NPCHGNVPQL 
IEATKRGVML TGAMPMVFPT ISIHESFASP TSMYLRNLMA METEEMIRAQ PMDAVVLIGG
CDKTLPAQIM AAASSEIPAI FLPTGPMAVG HHKGERLGAC TDCRRFWGRF RAGEIDEAEI
AEVNNKLASS IGTCTVMGTA STMANLTEVM GLCLPRAGSA PAVESERVRL AEETGRVAAR
LAMDEAAPTV RDILTPQAVR NGLVALQAMG GSTNAVVHLT AITGRLGLRL DMAELDRLGR
SIPLLVDLQP SGQHYMEQFH EAGGVPALLK AVRHEIDGSA PTVYGKTIGE IIDAVVDEPG
QTIIRTVEKP LKPIGTIAVL HGNLAPRGAV IKQSAASKDL LQHIGRAVVF DSVEDMTLRI
DSDDLDVHAD DILVLRNAGP KGAPGMPEAG YLPIPRKLAR QGVKDMVRIS DARMSGTAFG
TIILHIAPEA ADGGPLAIVR TGDRIRLDVE GRRIDLDIDQ AEFDRRMLDV VNRPSPAPAR
GYARLYQERV LQADEGADFD FLQSEEFDAR