Gene Rleg2_1361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1361 
Symbol 
ID6980089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1379692 
End bp1381530 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content64% 
IMG OID643396082 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002280881 
Protein GI209548964 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0137226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCCT ACCGTTCCAG AACCACGACC CACGGCCGCA ACATGGCAGG CGCGCGCGGC 
CTTTGGCGCG CCACGGGCAT GAAGGATTCG GATTTCGGCA AGCCGATTAT CGCGGTGGTG
AATTCTTTCA CCCAGTTCGT ACCCGGCCAC GTGCACCTGA AGGACCTCGG CCAGCTCGTT
GCCCGCGAAA TCGAGGCGGC CGGCGGTGTC GCCAAGGAAT TCAACACGAT CGCCGTCGAC
GACGGCATCG CCATGGGCCA TGACGGCATG CTCTATTCGC TGCCCTCGCG CGAACTCATC
GCCGACAGCG TCGAATACAT GGTCAATGCC CATTGCGCCG ACGCCATGGT CTGCATCTCC
AATTGCGACA AGATCACCCC CGGCATGCTG ATGGCGTCGT TGCGCCTCAA CATACCCACA
GTCTTCGTCT CGGGCGGCCC GATGGAAGCG GGCAAGGTGG TGCTGCACGG CAAGACGCAT
GCACTCGACC TCGTCGATGC CATGGTCGCC GCAGCCGATG AAAAGATCAG TGACGAGGAC
GTTCAGACCA TCGAGCGCTC GGCCTGTCCG ACCTGCGGCT CCTGCTCCGG CATGTTTACC
GCCAATTCGA TGAACTGCCT GACCGAGGCG CTCGGCCTGT CGCTGCCCGG CAACGGTTCG
ACGCTCGCAA CCCACGCCGA CCGCAAGCGC CTCTTCGTCG AGGCCGGTCA TCTGATCGTC
GATCTCGCCC GCCGTTACTA CGAGCAGGAC GATATCAAGG CGCTGCCGCG CACCATCGCC
TCCAAGCAGG CCTTCGAGAA TGCCATGGCG CTCGATATCG CCATGGGCGG CTCGACCAAT
ACGGTCCTGC ACATCCTTGC TGCTGCCCAT GAAGGCGAAA TCGATTTCAC CATGGCCGAT
ATCGACGCGC TCTCGCGCCG AGTGCCCTGC CTGTCGAAGG TCGCACCCGC CAAGAGCGAT
GTTCATATGG AAGACGTGCA CCGCGCCGGC GGCATCATGT CGATCCTCGG AGAGCTCGAT
AAGGGCGGTC TCTTGAACCG CAATTGCCCG ACAGTGCATG CCGAGACGCT GGGCGATGCG
ATCGACCGCT GGGATATCAC CCGCACCACC AGCGAAACGG TCCGCAACTT CTATCGTGCC
GCACCCGGCG GCATCCCGAC CCAGGTTGCC TTCAGCCAGG AGGCCCGCTG GGACGAACTC
GACACCGACC GCCAGAATGG CGTCATCCGC TCGGTCGAAC ATCCTTTCTC TAGGGATGGC
GGCCTTGCCG TGCTCAAGGG CAATCTCGCG GTCGACGGAT GCATCGTCAA GACGGCCGGC
GTCGATGAAT CGATCCTGAA ATTTTCAGGC CCGGCCCGTG TCTTCGAAAG CCAGGATGCC
TCCGTGAAGG CGATCCTCGC CAACGAAGTG AAGGCCGGCG ACGTCGTCGT CATTCGCTAC
GAAGGCCCGA AGGGCGGCCC CGGCATGCAG GAAATGCTCT ATCCGACGAG CTATCTGAAG
TCGAAGGGCC TCGGCAAGGC GTGCGCGCTG ATCACCGACG GCCGCTTCTC CGGCGGTACC
TCCGGCCTCT CGATCGGCCA CGCCTCGCCG GAAGCGGCCA ATGGCGGTAC GATCGGCCTG
GTGCGCGAAG GCGACATGAT CGACATCGAC ATCCCGAACC GCACGATCAG CCTGCGCGTG
GATGAGGCCG AACTCGCCGC CCGCCGCGCC GATCAGGACG CCAAGGGCTG GCATCCCGCA
GAAGTGCGCA AGCGCAACGT CACGACGGCG CTGAAGGCTT ATGCTGCCTT TGCGACGAGC
GCGGACCGCG GCGCCGTGCG CGATCTGAAC GCCCGCTGA
 
Protein sequence
MPAYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV 
AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPT VFVSGGPMEA GKVVLHGKTH ALDLVDAMVA AADEKISDED
VQTIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS TLATHADRKR LFVEAGHLIV
DLARRYYEQD DIKALPRTIA SKQAFENAMA LDIAMGGSTN TVLHILAAAH EGEIDFTMAD
IDALSRRVPC LSKVAPAKSD VHMEDVHRAG GIMSILGELD KGGLLNRNCP TVHAETLGDA
IDRWDITRTT SETVRNFYRA APGGIPTQVA FSQEARWDEL DTDRQNGVIR SVEHPFSRDG
GLAVLKGNLA VDGCIVKTAG VDESILKFSG PARVFESQDA SVKAILANEV KAGDVVVIRY
EGPKGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGT SGLSIGHASP EAANGGTIGL
VREGDMIDID IPNRTISLRV DEAELAARRA DQDAKGWHPA EVRKRNVTTA LKAYAAFATS
ADRGAVRDLN AR