Gene Rleg_1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1459 
Symbol 
ID8012548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1446089 
End bp1447927 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content64% 
IMG OID644824048 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002975290 
Protein GI241204194 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.141377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGTTT ACCGTTCCAG AACCACGACC CATGGCCGCA ACATGGCGGG CGCCCGCGGC 
CTTTGGCGCG CCACGGGCAT GAAGGATTCG GATTTCGGCA AGCCGATCAT CGCGGTGGTG
AATTCCTTCA CCCAGTTCGT GCCCGGCCAC GTGCACCTGA AGGACCTTGG CCAGCTCGTT
GCCCGCGAGA TCGAGGCGGC CGGCGGTGTC GCCAAGGAAT TCAACACGAT CGCCGTCGAT
GACGGCATCG CCATGGGCCA TGACGGCATG CTTTATTCGC TGCCCTCGCG TGAGCTCATC
GCCGACAGCG TCGAATATAT GGTCAATGCT CATTGCGCCG ACGCCATGGT CTGCATCTCC
AATTGCGACA AGATCACCCC CGGCATGCTG ATGGCGTCGC TGCGTCTCAA TATCCCGACC
GTCTTCGTCT CGGGCGGTCC GATGGAAGCC GGCAAGGTCG TGCTGCACGG CAAGACGCAT
GCGCTCGACC TGGTCGATGC CATGGTCGCC GCAGCCGATG ACAAGATCAG CGACGAGGAC
GTCCAGACCA TCGAACGCTC GGCCTGTCCG ACCTGTGGTT CCTGCTCCGG CATGTTCACC
GCCAATTCGA TGAACTGCCT GACGGAAGCC CTCGGCCTGT CGCTGCCCGG CAACGGCTCG
ACGCTTGCCA CCCATCTCGA CCGCAAGCGC CTCTTCGTCG AGGCCGGTCA TCTGATTGTC
GATCTCGCCC GCCGTTATTA CGAGCAGGAT GACGTCAAGG CGCTGCCGCG CACCATTGCC
TCCAAGCAGG CCTTCGAGAA TGCCATGACG CTCGATATCG CCATGGGCGG TTCCACCAAT
ACGGTCCTGC ACATTCTTGC CGCCGCCCAT GAAGGCGAGA TCGATTTCAA TATGGCCGAT
ATCGACGCGC TGTCGCGCCG CGTGCCGTGC CTGTCGAAGG TCGCACCCGC CAAGAGTGAC
GTGCATATGG AAGACGTCCA CCGCGCCGGC GGCATCATGT CGATCCTCGG CGAACTCGAC
AAGGGTGGTC TCTTGAACCG CGATTGCCCG ACGGTCCATG CCGAGACGCT GGGCGATGCG
ATCGATCGCT GGGATATCAC CCGCACGAAC AGCGAAACCG TGCGCAACTT CTATCGTGCC
GCACCCGGCG GCATCCCGAC CCAGGTCGCC TTCAGCCAGG AAGCCCGTTG GGACGATCTC
GACACCGATC GCGAGAACGG CGTCATCCGC TCGGTCGAGC ATCCCTTCTC CAAGGATGGC
GGCCTTGCCG TGCTCAAGGG CAACCTTGCG ATTGACGGCT GCATCGTCAA GACCGCTGGC
GTCGATGAAT CGATCCTGAA GTTCTCCGGC CCCGCCCGCG TCTTCGAAAG CCAGGATTCG
TCGGTCAAGG CGATCCTTGC CAACGAGGTG AAGGCCGGCG ACGTCGTCGT CATCCGCTAC
GAAGGTCCGA AGGGCGGCCC GGGCATGCAG GAAATGCTCT ATCCGACGAG CTATCTGAAG
TCGAAGGGCC TCGGCAAGGC ATGCGCGCTC ATCACCGACG GCCGCTTCTC CGGCGGCACT
TCCGGCCTCT CGATCGGCCA CGCCTCGCCG GAAGCGGCAA ATGGCGGCAC GATCGGCCTG
GTGCGCGAAG GCGACATGAT CGACATCGAC ATCCCCAACC GCACGATCAG CCTGCGTGTC
AGCGAGACTG AACTCGCCGC CCGCCGCGCC GAGCAGGACG CCAAGGGCTG GTACCCGGTC
GAAGTCCGCA AGCGCAATGT CACGACCGCG CTAAAGGCCT ACGCGGCCTT CGCAACGAGT
GCGGACCGCG GTGCCGTGCG CGATCTGAAC GCCCGCTGA
 
Protein sequence
MPVYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV 
AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPT VFVSGGPMEA GKVVLHGKTH ALDLVDAMVA AADDKISDED
VQTIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS TLATHLDRKR LFVEAGHLIV
DLARRYYEQD DVKALPRTIA SKQAFENAMT LDIAMGGSTN TVLHILAAAH EGEIDFNMAD
IDALSRRVPC LSKVAPAKSD VHMEDVHRAG GIMSILGELD KGGLLNRDCP TVHAETLGDA
IDRWDITRTN SETVRNFYRA APGGIPTQVA FSQEARWDDL DTDRENGVIR SVEHPFSKDG
GLAVLKGNLA IDGCIVKTAG VDESILKFSG PARVFESQDS SVKAILANEV KAGDVVVIRY
EGPKGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGT SGLSIGHASP EAANGGTIGL
VREGDMIDID IPNRTISLRV SETELAARRA EQDAKGWYPV EVRKRNVTTA LKAYAAFATS
ADRGAVRDLN AR