Gene Rleg2_1060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1060 
Symbol 
ID6979779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1077917 
End bp1079155 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content62% 
IMG OID643395772 
Productaminodeoxychorismate lyase 
Protein accessionYP_002280580 
Protein GI209548663 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.017221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGATA CGACGAACCA GAGCAACGAT ACCCCGCAGG GCGAGAACGG CCGGCAAGCG 
CAGAAGGGAC CGATCATCCC GAAGTCGCCG AGCGAAGCCC TGCGTCCCGA GCGCGTTCCG
GAGCCGCCGA AACGGTCCAA GAAAGCCCGC GGCCAGGTCG TTCTTTTCCT GAACTTCATC
ATGACGATGG CGGTATTGGT CTGCGTCGTC GCCGTCATCG GCTTTTATTA CGCCACATCG
ACCTACCGGA ACCCCGGTCC GCTGCAGACC AACACCAATT TCATCATCCG CAACGGCGCC
GGTCTCGCCG AAATTGCCTC CAACCTCGAG CGCAATGCGA TCATCAGCGA TGCCCGCATC
TTCCGCTATA TCACCGCAAC GCATCTGTCT GCGGGCGAGA GCCTCAAGGC CGGTGAATAT
GAGATCAAGG CCAGAGCCTC CATGAGCGAT ATCATGGAGC TTCTGAAGTC GGGCAAATCC
ATTCTCTATT CCGTTTCCTT CCCCGAGGGC CTGACGGTCC GCCAGATGTT CAACCGCATG
CTGGAGGATC AGGTACTGGA AGGCGACCTG CCGGCCGCAC TGCCGGCCGA GGGCAGCCTG
CGCCCGGATA CCTACAAGTT CTCGCGCGGC ACCAAGCGCG CGGAAATCAT CCAGCAGATG
GCGGCGGCAC AGCAGAAAAT CGTCGATCAG ATCTGGGACA AGCGCGACTC CTCCCTGCCG
CTGCGATCCA AGGAAGAATT CGTCACGCTC GCCTCGATCG TCGAAAAGGA AACCGGCGTT
GCCGACGAAC GCGCCCATGT CGCCTCCGTT TTCCTGAACC GGCTCGGCAA AGGCATGCGC
CTGCAGTCCG ATCCGACGAT CATCTACGGT CTCTTCGGCG GCGACGGCAA ACCGGCCGAC
CGGCCGATCT ACCAGTCGGA CCTGAAGCGC GAGACGCCAT ACAATACCTA TGTCATCAAG
GGGCTGCCGC CGACGCCGAT CGCCAATCCC GGTAAGGATG CGCTTGAGGC CGTCGCCAAT
CCCTGGAAGA CGCAGGACCT CTATTTCGTC GCCGACGGCA CCGGTGGCCA TGTTTTCGCG
GCGACGCTCG AGGAGCACAA TGCCAACGTC AAGCGCTGGC GCAAGCTCGA AGCCGACAAG
GGCTCGGACC CCAACATCGC CGTCGACGGC CAGCCGGAAG AGCAGCCGGC GGATGACGGC
GCTGCCGTCG TGCCGCCGAA GAAAAAGAAG ATCAACTGA
 
Protein sequence
MSDTTNQSND TPQGENGRQA QKGPIIPKSP SEALRPERVP EPPKRSKKAR GQVVLFLNFI 
MTMAVLVCVV AVIGFYYATS TYRNPGPLQT NTNFIIRNGA GLAEIASNLE RNAIISDARI
FRYITATHLS AGESLKAGEY EIKARASMSD IMELLKSGKS ILYSVSFPEG LTVRQMFNRM
LEDQVLEGDL PAALPAEGSL RPDTYKFSRG TKRAEIIQQM AAAQQKIVDQ IWDKRDSSLP
LRSKEEFVTL ASIVEKETGV ADERAHVASV FLNRLGKGMR LQSDPTIIYG LFGGDGKPAD
RPIYQSDLKR ETPYNTYVIK GLPPTPIANP GKDALEAVAN PWKTQDLYFV ADGTGGHVFA
ATLEEHNANV KRWRKLEADK GSDPNIAVDG QPEEQPADDG AAVVPPKKKK IN