Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0140 |
Symbol | |
ID | 6978850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 134362 |
End bp | 136818 |
Gene Length | 2457 bp |
Protein Length | 818 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643394851 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_002279668 |
Protein GI | 209547751 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.143076 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAAA AGACCGAGCT CAATTCCGGC TGGACGCTTC ACTGCAACGA TACAGGAAGG CCTGGCCTGC CGGAAACAAT CCCGGCGACG GTGCCGGGCT GCGTGCATCT CGATCTTCTC GCCAACCGGC TGATCCCCGA TCCCTATATC GACGTCAACG AGATCACCAA TGACTGGATC GGCAAGACCG ACTGGACCTA TCGCTGCACA TTCGAGGCCG CGCCTGACGA CGACACGGTG CAGGAACTCG TCTTCGACGG GCTCGATACG ATCGCGGTGA TCGCGCTGAA CGGCGAGGAG ATCGGCCGCA GCTTCAACAT GCACCGTACC TATCGCTTCG ATATTTCCGG GCTTCTGAAG GTGGGGGCCA ACGACCTCGC GGTCAGCTTC CGCTCCGCCT ATGCCTATGG CGCCGAGATG GAGAAGCACT ACGGCTACCG GCCTAACAAC TATCCGGGGC CGGGCAATCT GATGCGCAAG ATGGCCTGCA ATTTCGGCTG GGACTGGGGT CCGACGCTGG TGACGGCGGG ACTCTGGAAG AGGGTCAGGC TGGAAAGCTG GGATCAGGCG CGGCTTGCCG AAACACGGGT CTCGGCCACG CTTGCCGGCG GCGACGGGCT GGTGAAGGTG CATGCGAGGC TGGCGCGCCA TGGGGACGCG AAGCCGTGCC GGCTGGTCGC GACGATCGGC GGTGTGACGA CGACGGTTGC GATCGGCGCC GGAGAAGACA CCATCGCCTT CGAGCTTCGC CTGCCCTCGC CAAAACTCTG GTGGCCGCAT CATCTCGGCG CCCAGCCGCT CTATCCCCTG ACGCTCGAAC TGATCGACGA TGCCGGCGGC GACCTGCTCG ACAGCTATCA GCGGGCGCTT GGTTTCCGCT CGTTGAGGCT TGACACCTCG GCCGATGCGC ACGGCTCGGC CTTCACCTTC GTCATCAACG ACGTGCCGCT GTTCATCGCC GGCGCGAACT GGATTCCCGA CGATTGTTTC CCTTCGCGGG TGACGGCGGG GCGCTACGCC GCACGGATCG ACGAGGCGAA GGCCGCCAAT ATCCACATGC TGCGCGTCTG GGGCGGCGGC ATCTTCGAGC GCGACGAATT CTACGAGGCC TGCGACCGCA TGGGCATGCT GGTCTGGCAG GATTTCCTCT TTGCCTGCGC CGCCTATCCG GAGGAGGAGC CGCTGAAGAG CGAGGTCGAG GCCGAAGTGC GCGATAATGT CGTGCGGCTG ATGCCGCATG CCAGCCTGAT CCTCTGGAAC GGCAACAATG AGAATATCTG GGGCTTCGAC GAATGGGGCT GGCGGCCGGT CATCAAGGCC GATGAAAGCT GGGGGCTCGG TTATTATCTC GACCTGCTGC CGAGGCTCTC AGCCGAGCTC GATCCCGACC GGCCTTATTA TCCCGGCAGC CCCTATTCCG GTTCGATGGA GATCGCGCCG AATGCCGATG CGCATGGCTG CAAACATATC TGGGACGTCT GGAACGATGT CGGCTACGAG GTCTACCGCG ACTATGTCCC GCGCTTCTGC TCCGAATTCG GCTGGCAGGC GCCGGCCGCC TGGGCGACGA TCGAAGAAAG CGTGCACGAC CAGCCGCTGA CGCCGCAATC GAACGGCGTC TTCCACCATC AGAAGGCCAC CCAAGGCAAT GACAAGCTGA TCCGCGGCCT CTCCGGCCAC CTGCCGGAAC CGCAAACGAT GGACGACTGG CACTTCGCCA CCCAGCTCAA CCAGGCCCGC GCCATCCGCT TCGGCATCGA GCACATGCGC TCGCACCGCG ATATCTGCAA GGGCGCGGTG GTCTGGCAGT TCAACGATTG TTGGCCGGTG ACCTCCTGGG CCGCACTCGA CTCGGCCGGG CGCCGCAAGC CGCTCTGGTA TGCGCTGAGG GCCGCCTATG ATCCGCGCCT GCTGACCATT CAGCCGCGCG GCGACGGGCT TTCGGCGGTG GCGGTTAATG AGAGAACGCT GTTCTGGCGG GCGAAGATCA GCGGCAGGCG TTTGCGGCTC GACGGCAGCG TGCTGGCAGA GTTCGAATTC TGGCGGCTGC TCTGCGACCG CTTCGAGGCA AAAGAGTTTC CGCTGCCCGA AGATATCGTC AGTCCTGGCT TACCGAAAGA GGAGGTCGTT GTCGTCGAAA TGCTCGACAG GCGGGCCTTT CATTATTTCG TCGAGGATAT CGAGCTTGCC CTGCCGGCGC CGCGGCTGAG CGTCGATGTT GCTGCGAGCG ATGGCGGGTT TGCGGTTACG GTGACGGCTG AGAGTTTCCT CAAGGATCTC TGCCTGATGG CGGATCGGCT GGATCCGGAC GCCGTGGTCG ATACGATGCT GGTGACGCTG CTGCCGGGCG AGAGTCATGT GTTTGCGGTG AAGACGGCGA AGGGGATTTC GGCCAATGAC ATTGTCGTCG GTACAGTGCT GCGGTCGGCC AATGATCTTG TCGCAGGGCG GCAATAA
|
Protein sequence | MIEKTELNSG WTLHCNDTGR PGLPETIPAT VPGCVHLDLL ANRLIPDPYI DVNEITNDWI GKTDWTYRCT FEAAPDDDTV QELVFDGLDT IAVIALNGEE IGRSFNMHRT YRFDISGLLK VGANDLAVSF RSAYAYGAEM EKHYGYRPNN YPGPGNLMRK MACNFGWDWG PTLVTAGLWK RVRLESWDQA RLAETRVSAT LAGGDGLVKV HARLARHGDA KPCRLVATIG GVTTTVAIGA GEDTIAFELR LPSPKLWWPH HLGAQPLYPL TLELIDDAGG DLLDSYQRAL GFRSLRLDTS ADAHGSAFTF VINDVPLFIA GANWIPDDCF PSRVTAGRYA ARIDEAKAAN IHMLRVWGGG IFERDEFYEA CDRMGMLVWQ DFLFACAAYP EEEPLKSEVE AEVRDNVVRL MPHASLILWN GNNENIWGFD EWGWRPVIKA DESWGLGYYL DLLPRLSAEL DPDRPYYPGS PYSGSMEIAP NADAHGCKHI WDVWNDVGYE VYRDYVPRFC SEFGWQAPAA WATIEESVHD QPLTPQSNGV FHHQKATQGN DKLIRGLSGH LPEPQTMDDW HFATQLNQAR AIRFGIEHMR SHRDICKGAV VWQFNDCWPV TSWAALDSAG RRKPLWYALR AAYDPRLLTI QPRGDGLSAV AVNERTLFWR AKISGRRLRL DGSVLAEFEF WRLLCDRFEA KEFPLPEDIV SPGLPKEEVV VVEMLDRRAF HYFVEDIELA LPAPRLSVDV AASDGGFAVT VTAESFLKDL CLMADRLDPD AVVDTMLVTL LPGESHVFAV KTAKGISAND IVVGTVLRSA NDLVAGRQ
|
| |