Gene Rleg_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4084 
Symbol 
ID8014883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4156802 
End bp4158211 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content65% 
IMG OID644826653 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_002977864 
Protein GI241206768 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.774814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAC CGCGTACCCT CTACGACAAG ATCTGGGACG ATCATCTGGT CGACGAACAG 
CCGGACGGCA CCTGTCTTCT CTACATCGAC CGCCACCTGG TCCACGAAGT CACCTCGCCG
CAGGCGTTCG AAGGCCTGCG CATGACCGGC CGCAAGGTTC GCGCACCGGA AAAGACGCTC
GCCGTCGTCG ACCATAACGT TCCGACCTCG CCCGACCGCC ATCTCGGCAT CAAGAACGAG
GAAAGCCGCA TCCAGGTGGA AGCGCTGGCC ACCAACGCCG CCGAATTCGG CGTCGAATAT
TATTCGGCAA GCGACAAGCG CCAGGGCATC GTCCACATCG TCGGTCCGGA ACAGGGCTTC
ACCCTGCCCG GCATGACCAT CGTCTGCGGC GACAGCCACA CCTCGACGCA TGGCGCCTTC
GGGGCGCTGG CGCACGGAAT CGGCACCTCC GAGGTCGAGC ATGTGCTGGC GACCCAGACG
CTGATCCAGA AGAAGGCCAA GAACATGCTG GTGCGGGTCG ACGGCCTGCT TCCGCCGCAC
GTCACCGCCA AGGACATCAT CCTTGCCATC ATCGGCGAGA TCGGCACGGC CGGCGGCACC
GGCCACGTCA TCGAATTTGC CGGTGAAGCG ATCCGCGCGC TGTCGATGGA AGGCCGCATG
ACCGTCTGCA ACATGACGAT CGAGGGCGGC GCCCGCGCCG GCCTGATCGC CCCGGACGAA
AAGACCTTCG AATACATCAA GGGCAAGCCG CGCGCGCCGA AGGGCGAGGC GCTGGAACAG
GCGATCGCCT ACTGGAAGAC GCTGCAAACC GACGAGGGCG CTCATTACGA CCGCGTCGTC
GTCCTCGACG CCGCCAGCCT GCCGCCGATC GTTTCCTGGG GCTCCTCGCC CGAGGATGTC
ATCTCCGTCC AGGGCATCGT TCCGAACCCC GATGACATCC AGGACGAAAC CAAGCGCACC
TCCAAGTGGC GCGCGCTCGA CTATATGGGC CTGAAGCCGG GCACGAAGAT GACCGACATC
ACGCTCGACC GCGTCTTCAT CGGCTCCTGC ACCAACGGCC GCATCGAAGA TCTGCGCGAA
GTCGCCAAGG TCGTCGAAGG CAAGACGGTT GCCTCAACCG TCGACGCGAT GATCGTGCCG
GGCTCCGGCC TCGTCAAGGA ACAGGCGGAA GCCGAAGGCC TCGACAAGAT CTTCAAGGCC
GCCGGTTTCG ACTGGCGCGA ACCGGGCTGC TCCATGTGCC TTGCGATGAA CGACGACCGT
CTGAAGCCGG GCGAACGCTG CGCTTCGACC TCAAACCGCA ACTTCGAGGG CCGTCAGGGC
TTCAAGGGCC GCACGCACCT CGTCTCGCCG GCAATGGCCG CTGCAGCCGC GATCGCCGGG
CACTTCGTCG ATATCCGCGA GTGGAACTGA
 
Protein sequence
MSAPRTLYDK IWDDHLVDEQ PDGTCLLYID RHLVHEVTSP QAFEGLRMTG RKVRAPEKTL 
AVVDHNVPTS PDRHLGIKNE ESRIQVEALA TNAAEFGVEY YSASDKRQGI VHIVGPEQGF
TLPGMTIVCG DSHTSTHGAF GALAHGIGTS EVEHVLATQT LIQKKAKNML VRVDGLLPPH
VTAKDIILAI IGEIGTAGGT GHVIEFAGEA IRALSMEGRM TVCNMTIEGG ARAGLIAPDE
KTFEYIKGKP RAPKGEALEQ AIAYWKTLQT DEGAHYDRVV VLDAASLPPI VSWGSSPEDV
ISVQGIVPNP DDIQDETKRT SKWRALDYMG LKPGTKMTDI TLDRVFIGSC TNGRIEDLRE
VAKVVEGKTV ASTVDAMIVP GSGLVKEQAE AEGLDKIFKA AGFDWREPGC SMCLAMNDDR
LKPGERCAST SNRNFEGRQG FKGRTHLVSP AMAAAAAIAG HFVDIREWN