Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3403 |
Symbol | |
ID | 6982157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3515223 |
End bp | 3516596 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643398121 |
Product | beta-galactosidase |
Protein accession | YP_002282896 |
Protein GI | 209550979 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.968269 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.135443 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGATG CGAAGACGCT TGCAAGCCGC CTTCCCGGCG ATTTCACCTT CGGCGTCGCC ACCGCCGCCT TCCAGATCGA GGGTGCTGGT AAGGCCGACG GCCGCAAGCC ATCGATCTGG GATGCTTTCT GCAATATGCC CGGCCGTGTC TATAATCGCG ACAATGGCGA CGTCGCCTGC GACCACTATA ACCGGCTAGA GCAGGATCTC GATCTCATCA AGGATATGGG TGTCGAAGCC TACCGCTTCT CGATCGCCTG GCCGCGCATC ATCCCCGACG GCACCGGTGC GGTGAACGAG GCCGGGCTCG ATTTCTACGA TCGGCTGGTC GACGGCTGCA AGGCGCGCGG GATCAAGACC TTTGCGACGC TCTATCACTG GGACCTGCCA CTAATGCTTG CCGGCGACGG CGGCTGGACG GCGCGCTCGA CCGCCTATGC CTTTCAGCGC TACGCCAAGA CGGTGATGAA CCGGCTTGGC GATCGTCTCG ATGCCGTCGC GACCTTCAAC GAGCCCTGGT GCATCGTCTG GCTGAGCCAC CTCTACGGCA TCCACGCGCC GGGCGAGCGC AATATTCAGG CCGCCCTTCA CGCCATGCAC TACATGAACC TCGCCCACGG TCTCGGCGTC GAGGCGATCC GTGCGGAAGC CCCTGCGGTG CCCGTCGGGC TCGTGCTCAA CGCTGCCTCG ATCATCCCCG GTTCCGAGGG CCCGGCCGAT CTTGCCGCCA CTGAGCGCGC GCATCAGTTT CACAACGGCG CTTTCTTCGA TCCCGTCTTC AAGGGCGAAT ACCCCAAGGA ATTCGTTGAG GCGCTCGGCG ACCGCATGCC TGTCATCGAG GACGGCGACA TGACGCTGAT CAGCCAGAAA CTCGACTGGT GGGGTCTGAA TTATTACACG CCCGAGCGCG TCACTGACGA TGCCGAACGC AACGGCGATT TCCCCTGGAC GGTGAAAGCG CCGCCGGCAA GCGACGTCAA AACCGATATC GGCTGGGAAA TCTATGCGCC GGGATTGAAG CTGCTGGTCG AAAACCTTTA CCGCCGCTAC GAACTGCCGG AATGCTACAT CACTGAGAAC GGCGCTTGCG ACAACACCGG TGTCGTCGAC GGCGAAGTCG ACGATACGAT GCGTCTCGAT TATCTCGGCG ACCATCTCGA TGTCGTGGCC GGCCTTATCA AGGACGGTTA TCCCATGCGC GGCTATTTCG CCTGGAGCCT GATGGACAAT TTCGAATGGG CAGAAGGCTA CCGCATGCGC TTCGGCCTCG TCCATGTCGA TTATCAGACC CAGTTGCGTA CGGTGAAGAA GAGCGGCAAG TGGTATCGCG AACTCGCAGC ACAATTCCCG AAGGGCAATC ACAAGGCGGG TTAG
|
Protein sequence | MIDAKTLASR LPGDFTFGVA TAAFQIEGAG KADGRKPSIW DAFCNMPGRV YNRDNGDVAC DHYNRLEQDL DLIKDMGVEA YRFSIAWPRI IPDGTGAVNE AGLDFYDRLV DGCKARGIKT FATLYHWDLP LMLAGDGGWT ARSTAYAFQR YAKTVMNRLG DRLDAVATFN EPWCIVWLSH LYGIHAPGER NIQAALHAMH YMNLAHGLGV EAIRAEAPAV PVGLVLNAAS IIPGSEGPAD LAATERAHQF HNGAFFDPVF KGEYPKEFVE ALGDRMPVIE DGDMTLISQK LDWWGLNYYT PERVTDDAER NGDFPWTVKA PPASDVKTDI GWEIYAPGLK LLVENLYRRY ELPECYITEN GACDNTGVVD GEVDDTMRLD YLGDHLDVVA GLIKDGYPMR GYFAWSLMDN FEWAEGYRMR FGLVHVDYQT QLRTVKKSGK WYRELAAQFP KGNHKAG
|
| |