Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3701 |
Symbol | |
ID | 8014539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3749922 |
End bp | 3751295 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644826264 |
Product | beta-galactosidase |
Protein accession | YP_002977483 |
Protein GI | 241206387 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGATC CGAAGAAACT CGCGGAGCGC TTTCCCGGCG ACTTCATTTT CGGCGTTGCC ACCGCTGCCT TCCAGATCGA AGGCGCCAGC AAAGCTGATG GCCGCAAGCC ATCCATCTGG GATGCTTTCT GCAATATGCC TGGCCGCGTC CATAATCGCG ATAACGGCGA TGTTGCCTGC GATCACTACA ATCGCCTGGA ACAGGACCTC GATCTCATCA AGGAGATGGG TGTCGAAGCC TACCGGTTCT CGATTGCCTG GCCGCGCATT ATCCCGGACG GCACGGGTCC GGTGAACGAG GCCGGCCTCG ATTTCTACGA CCGGCTGGTC GACGGCTGCA AGGCGCGCGG TATCAAGACC TTCGCGACGC TCTACCACTG GGATCTGCCT CTGTTGCTCG CCGGCGAGGG CGGGTGGACC GCGCGCTCGA CCGCCTATGC GTTTCAGCGC TATGCCAAGA CGGTGATGAA CCGCCTTGGC GATCGCCTCG ACCGGGTCGC GACCTTCAAC GAACCCTGGT GCATCGTCTG GCTCAGCCAC CTCTACGGCA TCCATGCTCC GGGCGAGCGC AACATGCAGG CTGCCCTTCA CGCCATGCAT TACATGAACC TTGCCCACGG CCTCGGCGTC GAGGCGATCC GCTCGGAAGC GCCCAATGTG CCGGTCGGCC TCGTGCTCAA TGCCGCCTCG ATCATTCCCG GCTCCGACAG CCCGGCCGAC CTTGCCGCGG GCGAGCGGGC GCATCAGTTC CACAACGGCG CTTTCTTCGA TCCCGTCTTC AAGGGTGAAT ATCCGAAGGA ATTCGTCGCG GCGCTCGGCG ACCGCATGCC TGTCATCGAG GATGGCGACC TGAAGCTCAT CAGCCAGAAA CTCGATTGGT GGGGCCTGAA CTATTATAAG CCCGAACGCG TCACCGACGA TGCCGAACGC AAAGGCGATT TCCCCTGGAC GGTGGAGGCG CCGCCGGCAA GCGACGTCAA GACCGATATC GGCTGGGAAA TCTATGCGCC GGGCCTGAAG CTCTCGATCG AGGATCTCTA CCGCCGCTAC GAACTGCCGG AATGCTACAT CACCGAGAAC GGCGCCTGCG ACAACACCGA TGTCATCGAC GGCGAGGTCG ACGATACGAT GCGCCTCGAC TATGTCGGAG ATCACCTCGA AATCGTTGCC GGCCTCATCA AGGACGGCTA TCCCCTACGC GGCTATTTCG CCTGGAGCCT GATGGATAAT TTCGAATGGG CGGAAGGCTA CCGCATGCGC TTCGGCCTGG TCCATGTCGA TTATGAAACC CAGCTGCGCA CGGTGAAGAA GAGCGGCAAG TGGTATCGCC AACTCGCGGC GCAATTCCCG AAGGGCAATC ACAAGGCGGT TTAG
|
Protein sequence | MIDPKKLAER FPGDFIFGVA TAAFQIEGAS KADGRKPSIW DAFCNMPGRV HNRDNGDVAC DHYNRLEQDL DLIKEMGVEA YRFSIAWPRI IPDGTGPVNE AGLDFYDRLV DGCKARGIKT FATLYHWDLP LLLAGEGGWT ARSTAYAFQR YAKTVMNRLG DRLDRVATFN EPWCIVWLSH LYGIHAPGER NMQAALHAMH YMNLAHGLGV EAIRSEAPNV PVGLVLNAAS IIPGSDSPAD LAAGERAHQF HNGAFFDPVF KGEYPKEFVA ALGDRMPVIE DGDLKLISQK LDWWGLNYYK PERVTDDAER KGDFPWTVEA PPASDVKTDI GWEIYAPGLK LSIEDLYRRY ELPECYITEN GACDNTDVID GEVDDTMRLD YVGDHLEIVA GLIKDGYPLR GYFAWSLMDN FEWAEGYRMR FGLVHVDYET QLRTVKKSGK WYRQLAAQFP KGNHKAV
|
| |