Gene Rleg2_1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1689 
Symbol 
ID6980426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1719389 
End bp1720315 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content63% 
IMG OID643396413 
Product5-dehydro-4-deoxyglucarate dehydratase 
Protein accessionYP_002281203 
Protein GI209549286 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR03249] 5-dehydro-4-deoxyglucarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.448176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACC CGATTGAATT GAAGAAGGCC GTCGGTAGTG GTCTCCTCTC GTTTCCGGTG 
ACGCATTTCG ACGATCAGCT GACATTCGAC GAGGCGAAAT ACCGTCGTCA TGTCGAATGG
CTTTCGGGTT TTGACGCGGC TGCGCTGTTT GCCGCCGGCG GCACGGGAGA GTTCTTCTCC
CTCAATCCGG CCGAAATCCC GCAGGTCGTC CGCGCCGCCA AGGCCTCGGC CGGCAAGACG
CCGATCATCT CGGGCACCGG CTACGGCACG TCGCTCGCCA TCGAGATCGC AAAGGCGGCC
GAGAAGGCAG GCGCGGACGG GCTGCTGCTG CTGCCGCCCT ATCTGATGTT TGCCGAGCAG
GCCGGCCTGA TCGCCCATGT CAAGGCAGTC TGCCAATCGG TCGGCATCGG CGTCATCGTC
TATAACCGCG ACAACGCCGT CCTGACCGCC GAGAGCATCG CGCGGCTTGC GGAGGAATGC
CCGAACCTGA TCGGTTTCAA GGACGGTGTC GGCGATGTCG ACAAGGTGAT CGAGATCACC
ACGCTGCTCG GCGACCGGCT GGTCTATGTC GGCGGCATGC CGACCCACGA GGTCTATGCG
CAAGCCTATT TCGCCGCCGG TGTAACGACC TATTCCTCGG CCGTCTTCAA CTTCGTCCCG
GCGCTGGCCC AGCGCTTTTA CGGCGCCTTG CGGACCGGCG ATCAGGCGAC CGTCGACGAA
ATCCTGAAGA GCTTCTTCTT CCCCTTCGTC GCCTTGCGCA ACCGCAAGAA GGGTTATGCC
GTCTCGATCA TCAAGGCCGG TCTGCGCGTG CTGGGGCAGA ACCCAGGCCC GGTGCGGCCG
CCGCTGACGG ATCTCAACCA GGAAGAACTG GCGCTCTTGG ACAAGATCGT CCAGGCCAAC
GGCGTCTCGC GGATCGCGGC GGAGTAG
 
Protein sequence
MMNPIELKKA VGSGLLSFPV THFDDQLTFD EAKYRRHVEW LSGFDAAALF AAGGTGEFFS 
LNPAEIPQVV RAAKASAGKT PIISGTGYGT SLAIEIAKAA EKAGADGLLL LPPYLMFAEQ
AGLIAHVKAV CQSVGIGVIV YNRDNAVLTA ESIARLAEEC PNLIGFKDGV GDVDKVIEIT
TLLGDRLVYV GGMPTHEVYA QAYFAAGVTT YSSAVFNFVP ALAQRFYGAL RTGDQATVDE
ILKSFFFPFV ALRNRKKGYA VSIIKAGLRV LGQNPGPVRP PLTDLNQEEL ALLDKIVQAN
GVSRIAAE