Gene Rleg_3170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3170 
Symbol 
ID8014069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3171689 
End bp3173428 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content63% 
IMG OID644825736 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002976964 
Protein GI241205868 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TAGCGGAATG GCCGCGCAGG CTGAGGTCGC AGGAATGGTA CGGCGGTACG 
AGCCGCGACG TGATTTATCA CCGCGGCTGG CTTAAGAACC AGGGTTATCC GCACGACCTG
TTCGACGGCC GGCCGGTGAT CGGCATCCTC AACACCTGGT CGGATATGAC CCCCTGCAAC
GGTCATCTCA GAGAGCTCGC CGAGAAGGTG AAGGCGGGCG TATGGGAGGC CGGCGGCTTC
CCGCTCGAGG TGCCGGTGTT CTCGGCATCC GAAAACACCT TCCGCCCGAC CGCGATGATG
TACCGCAATC TTGCCGCGTT GGCGGTAGAA GAGGCGATCC GCGGCCAGCC GATGGATGGC
TGCGTATTGC TGGTGGGCTG CGACAAGACC ACGCCGTCGC TTATCATGGG GGCGGCTTCC
TGCGACCTGC CTTCTATCGT CGTCACAGGC GGACCGATGC TAAACGGCTA TTTCCGTGGC
GAACGGGTCG GCTCGGGCAC GCATCTGTGG AAGTTTTCCG AAATGGTGAA GGCCGGCGAG
ATGACGCAGG CCGAATTCCT CGAGGCTGAG GCCTCGATGA GCCGTTCGTC GGGCACCTGC
AACACCATGG GCACCGCCTC CACCATGGCT TCCATGGCCG AGGCACTCGG CATGGCTCTG
TCAGGCAACG CCGCGATCCC GGGCGTCGAT TCCCGCCGCA AGGTCATGGC GCAGCTGACC
GGCCGGCGTA TCGTGCAGAT GGTCAAGGAT GACCTGAAGC CCTCCGAGAT CATGACGAAG
CAAGCCTTCG AGAATGCCAT CCGCACCAAC GCGGCCATCG GCGGATCGAC CAACGCCGTC
ATCCACCTGC TCGCCATTGC CGGCCGTGTC GGCATCGATC TTTCGCTTGA CGACTGGGAT
CGCTGCGGCC GTGACGTCCC GACCATCGTC AACCTGATGC CGTCGGGCAA GTATCTGATG
GAGGAGTTCT TCTATGCCGG CGGCCTGCCG GTGGTGCTGA AGCGCCTCGG CGAGGCGGGC
CTGCTGCACA AGGATGCGCT GACGGTATCC GGCGAAACCG TCTGGGACGA GGTCAAGGAC
GTCGTCAACT GGAACGAAGA CGTTATCCTT CCGGCCGAAA AGGCGCTGAC AGCTTCGGGC
GGCATCGTCG TGCTGCGCGG CAATCTCGCG CCGAAGGGCG CGGTGTTGAA GCCTTCGGCG
GCTTCGCCAC ATCTGCTGGT GCACCGGGGC AGGGCGGTCG TGTTTGAGGA CATCGACGAC
TACAAGGCCA AGATAAACGA CGAGAACCTC GACATCGACG AAACCTGCAT CATGGTCATG
AAGAACTGTG GGCCGAAGGG CTATCCCGGG ATGGCCGAGG TCGGCAACAT GGGTCTGCCG
CCGAAGGTGC TCAAGAAGGG CATCCTCGAC ATGGTGCGTA TTTCCGACGC CCGCATGTCC
GGAACTGCCT ACGGTACCGT CGTGCTGCAC ACCTCGCCGG AAGCGGCGGT CGGCGGTCCG
CTCGCAGTCG TGAAAAACGG CGACATGATC GAGCTGGATG TGCCGAACCG TCGTCTGCAT
CTCGACATTT CCGATGAGGA ACTGGCGCGG CGGCTCGCTG AGTGGCAGCC GAACCATGAT
CTGCCCACAT CCGGCTATGC CTTCCTGCAT CAGCAGCATG TCGAAGGGGC CGATACCGGC
GCCGATCTCG ACTTCCTGAA GGGATGTCGT GGAAACGCGG TCGGCAAGGA CAGCCATTAA
 
Protein sequence
MKKIAEWPRR LRSQEWYGGT SRDVIYHRGW LKNQGYPHDL FDGRPVIGIL NTWSDMTPCN 
GHLRELAEKV KAGVWEAGGF PLEVPVFSAS ENTFRPTAMM YRNLAALAVE EAIRGQPMDG
CVLLVGCDKT TPSLIMGAAS CDLPSIVVTG GPMLNGYFRG ERVGSGTHLW KFSEMVKAGE
MTQAEFLEAE ASMSRSSGTC NTMGTASTMA SMAEALGMAL SGNAAIPGVD SRRKVMAQLT
GRRIVQMVKD DLKPSEIMTK QAFENAIRTN AAIGGSTNAV IHLLAIAGRV GIDLSLDDWD
RCGRDVPTIV NLMPSGKYLM EEFFYAGGLP VVLKRLGEAG LLHKDALTVS GETVWDEVKD
VVNWNEDVIL PAEKALTASG GIVVLRGNLA PKGAVLKPSA ASPHLLVHRG RAVVFEDIDD
YKAKINDENL DIDETCIMVM KNCGPKGYPG MAEVGNMGLP PKVLKKGILD MVRISDARMS
GTAYGTVVLH TSPEAAVGGP LAVVKNGDMI ELDVPNRRLH LDISDEELAR RLAEWQPNHD
LPTSGYAFLH QQHVEGADTG ADLDFLKGCR GNAVGKDSH