Gene Rleg2_2909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2909 
Symbol 
ID6981653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2963606 
End bp2965345 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content63% 
IMG OID643397619 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002282403 
Protein GI209550486 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA AAGCAGAATG GCCGCGCAAG CTGCGCTCGC AGGAATGGTA TGGCGGCACC 
AGCCGCGACG TAATCTACCA TCGCGGCTGG CTGAAGAACC AGGGTTATCC GCATGACCTG
TTCGATGGCC GTCCGGTCAT CGGCATCCTG AATACCTGGT CTGATATGAC GCCGTGTAAC
GGCCATCTGC GCGAACTCGC CGAGAAGGTG AAGGCGGGTG TCTGGGAGGC CGGCGGCTTC
CCGCTCGAGG TGCCGGTGTT CTCGGCATCC GAAAACACTT TCCGCCCGAC CGCGATGATG
TATCGCAACC TCGCCGCGTT GGCGGTGGAA GAGGCGATCC GCGGCCAGCC GATGGACGGC
TGCGTGCTCT TGGTCGGCTG CGATAAGACC ACGCCGTCGC TGCTCATGGG GGCTGCCTCC
TGCGACCTGC CGTCGATCGT CGTCACCGGC GGGCCGATGC TGAACGGCTA TTTCCGCGGT
GAGCGTGTCG GTTCGGGCAC GCATCTGTGG AAGTTCTCCG AAATGGTGAA GGCCGGCGAG
ATGACGCAGG CCGAGTTCCT CGAGGCTGAG GCGTCGATGA GCCGTTCGTC GGGCACCTGC
AACACCATGG GCACCGCCTC CACCATGGCC TCCATGGCCG AGGCGCTCGG CATGGCACTA
TCAGGCAATG CCGCGATCCC GGGCGTCGAT TCCCGCCGCA AGGTCATGGC GCAGCTGACC
GGCCGCCGGA TCGTACAGAT GGTCAAGGAC GACCTGAAGC CCTCCGAGAT CATGACGAAA
CAGGCTTTCG AAAACGCCAT CCGCACCAAT GCGGCGATCG GCGGATCGAC CAACGCCGTC
ATCCACCTGC TTGCGATTGC CGGCCGCGTC GGCATCGATC TGTCGCTCGA CGACTGGGAC
CGCTGCGGCC GCGACGTTCC CACAATCGTC AACCTGATGC CGTCGGGCAA GTACCTGATG
GAAGAGTTCT TCTATGCCGG CGGCCTGCCG GTGGTGCTGA AGCGCCTCGG CGAGGCGGGC
CTGCTGCATA AGGATGCGCT GACGGTTTCT GGCGAAACCG TCTGGGACGA GGTCAAGGAC
GTCGTCAACT GGAATGAGGA CGTCATCCTG CCGGCCGAAA AGGCGCTGAC CTCTTCGGGC
GGCATCGTCG TGCTGCGCGG CAATCTGGCG CCGAAGGGCG CGGTGCTGAA GCCTTCGGCG
GCCTCGCCGC ATCTGTTGGT GCACAAGGGC AGGGCAGTCG TGTTCGAGGA TATCGACGAC
TACAAGGCGA AGATCAACGA CGACAATCTC GACATCGACG AAAACTGCAT CATGGTCATG
AAGAATTGCG GGCCGAAGGG TTATCCCGGG ATGGCCGAAG TCGGCAACAT GGGACTGCCG
CCGAAGGTGC TGAAGAAGGG CATCCTCGAC ATGGTGCGCA TTTCCGACGC CCGCATGTCC
GGAACGGCCT ACGGCACAGT TGTGCTGCAC ACCTCGCCGG AAGCGGCGGT CGGCGGGCCG
CTCGCGGTCG TGAAAAACGG CGACATGATT GAGCTCGATG TGCCGAACCG TCGTCTGCAT
CTCGACATTT CCGACGAGGA ATTGGCGCGG CGGCTGGCCG AATGGCAGCC GAACCACGAC
CTGCCGACAT CGGGTTATGC CTTCCTGCAT CAGCAGCATG TCGAAGGGGC CGATACCGGC
GCCGACCTCG ACTTCCTCAA GGGATGTCGC GGAAACGCGG TCGGCAAAGA CAGCCACTAA
 
Protein sequence
MKKKAEWPRK LRSQEWYGGT SRDVIYHRGW LKNQGYPHDL FDGRPVIGIL NTWSDMTPCN 
GHLRELAEKV KAGVWEAGGF PLEVPVFSAS ENTFRPTAMM YRNLAALAVE EAIRGQPMDG
CVLLVGCDKT TPSLLMGAAS CDLPSIVVTG GPMLNGYFRG ERVGSGTHLW KFSEMVKAGE
MTQAEFLEAE ASMSRSSGTC NTMGTASTMA SMAEALGMAL SGNAAIPGVD SRRKVMAQLT
GRRIVQMVKD DLKPSEIMTK QAFENAIRTN AAIGGSTNAV IHLLAIAGRV GIDLSLDDWD
RCGRDVPTIV NLMPSGKYLM EEFFYAGGLP VVLKRLGEAG LLHKDALTVS GETVWDEVKD
VVNWNEDVIL PAEKALTSSG GIVVLRGNLA PKGAVLKPSA ASPHLLVHKG RAVVFEDIDD
YKAKINDDNL DIDENCIMVM KNCGPKGYPG MAEVGNMGLP PKVLKKGILD MVRISDARMS
GTAYGTVVLH TSPEAAVGGP LAVVKNGDMI ELDVPNRRLH LDISDEELAR RLAEWQPNHD
LPTSGYAFLH QQHVEGADTG ADLDFLKGCR GNAVGKDSH