Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1060 |
Symbol | |
ID | 6979779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1077917 |
End bp | 1079155 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643395772 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_002280580 |
Protein GI | 209548663 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.017221 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGATA CGACGAACCA GAGCAACGAT ACCCCGCAGG GCGAGAACGG CCGGCAAGCG CAGAAGGGAC CGATCATCCC GAAGTCGCCG AGCGAAGCCC TGCGTCCCGA GCGCGTTCCG GAGCCGCCGA AACGGTCCAA GAAAGCCCGC GGCCAGGTCG TTCTTTTCCT GAACTTCATC ATGACGATGG CGGTATTGGT CTGCGTCGTC GCCGTCATCG GCTTTTATTA CGCCACATCG ACCTACCGGA ACCCCGGTCC GCTGCAGACC AACACCAATT TCATCATCCG CAACGGCGCC GGTCTCGCCG AAATTGCCTC CAACCTCGAG CGCAATGCGA TCATCAGCGA TGCCCGCATC TTCCGCTATA TCACCGCAAC GCATCTGTCT GCGGGCGAGA GCCTCAAGGC CGGTGAATAT GAGATCAAGG CCAGAGCCTC CATGAGCGAT ATCATGGAGC TTCTGAAGTC GGGCAAATCC ATTCTCTATT CCGTTTCCTT CCCCGAGGGC CTGACGGTCC GCCAGATGTT CAACCGCATG CTGGAGGATC AGGTACTGGA AGGCGACCTG CCGGCCGCAC TGCCGGCCGA GGGCAGCCTG CGCCCGGATA CCTACAAGTT CTCGCGCGGC ACCAAGCGCG CGGAAATCAT CCAGCAGATG GCGGCGGCAC AGCAGAAAAT CGTCGATCAG ATCTGGGACA AGCGCGACTC CTCCCTGCCG CTGCGATCCA AGGAAGAATT CGTCACGCTC GCCTCGATCG TCGAAAAGGA AACCGGCGTT GCCGACGAAC GCGCCCATGT CGCCTCCGTT TTCCTGAACC GGCTCGGCAA AGGCATGCGC CTGCAGTCCG ATCCGACGAT CATCTACGGT CTCTTCGGCG GCGACGGCAA ACCGGCCGAC CGGCCGATCT ACCAGTCGGA CCTGAAGCGC GAGACGCCAT ACAATACCTA TGTCATCAAG GGGCTGCCGC CGACGCCGAT CGCCAATCCC GGTAAGGATG CGCTTGAGGC CGTCGCCAAT CCCTGGAAGA CGCAGGACCT CTATTTCGTC GCCGACGGCA CCGGTGGCCA TGTTTTCGCG GCGACGCTCG AGGAGCACAA TGCCAACGTC AAGCGCTGGC GCAAGCTCGA AGCCGACAAG GGCTCGGACC CCAACATCGC CGTCGACGGC CAGCCGGAAG AGCAGCCGGC GGATGACGGC GCTGCCGTCG TGCCGCCGAA GAAAAAGAAG ATCAACTGA
|
Protein sequence | MSDTTNQSND TPQGENGRQA QKGPIIPKSP SEALRPERVP EPPKRSKKAR GQVVLFLNFI MTMAVLVCVV AVIGFYYATS TYRNPGPLQT NTNFIIRNGA GLAEIASNLE RNAIISDARI FRYITATHLS AGESLKAGEY EIKARASMSD IMELLKSGKS ILYSVSFPEG LTVRQMFNRM LEDQVLEGDL PAALPAEGSL RPDTYKFSRG TKRAEIIQQM AAAQQKIVDQ IWDKRDSSLP LRSKEEFVTL ASIVEKETGV ADERAHVASV FLNRLGKGMR LQSDPTIIYG LFGGDGKPAD RPIYQSDLKR ETPYNTYVIK GLPPTPIANP GKDALEAVAN PWKTQDLYFV ADGTGGHVFA ATLEEHNANV KRWRKLEADK GSDPNIAVDG QPEEQPADDG AAVVPPKKKK IN
|
| |