Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4209 |
Symbol | |
ID | 6982982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4386426 |
End bp | 4387475 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643398940 |
Product | Cellulase |
Protein accession | YP_002283697 |
Protein GI | 209551780 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACGA GACGACATCT GACGGCGCTG CTGCTTGCGG CCGCTCTCGT CCCCTCGCCG ACCCTTGCGG CCGACCCGCC CTGCTATCGC GGCGTCAATC TTTCCGGCGG CGAATATGGT GAGCGCGACG GCATTTACGG CACGAATTAT AACTATCCCA GCGAAGAGAC GATCCGCTAT TTCGCCGAAA AGGGCATGAC GATCGTCCGG CTGCCCTTCC GCTGGGAGCG GTTGCAGCCG GCGCTGGGCG GCCGGCTCGA CGAGGACGAA CTCAAGCGGA TCAAGGATAC CGTCGGGCTG ATCCGCAAGC ACGGCATGGC CGTGCTGCTC GACCCGCATA ATTTCGGCTA TTACGACAAG GTGCAGGTCG GCACGGCGCC GGCGACGGAT GCCGCCTTCG GTGATTTCTG GGCAAGGCTT GCGGTCGAAT TCGCCAATCA GGACGGCGTG CTCTTCGGCC TGATGAACGA GCCGCACGAT ATCAAGGCAA CGGACTGGCT CGAGGCCGCC AATGCGGCGA TCCGCAGCAT CCGCGCCGTC GGCGCCCGCA ACCTCATCCT GGTGCCGGGC ACGGCCTGGA GCGGCGCTCA CAGCTGGGAG GAGGATGTGA TCGGCGGCGC CAACGGCACG GTGATGCTCG GCGTGCGCGA TCCGCTCGAC TTCTACGCCT ATGAGGTCCA CCAGTATCTC GACATTGATT CCTCCGGAAC CCATCCGACC TGCGAGGGTG CTACCGGCGC TGTCGAAGCG ATCGCCGGCG TCACCGCCTG GCTGAAGAAG AACCACAAGC GCGGCTTTCT CGGCGAGTTC GGCGCCGCTG CCGACAAGGA CTGCATGAGC GGGCTGACCG AGATCTATTC CACCATGTCC GATAATGGCG ACGCCTGGCT CGGCTGGTCC TATTGGGCCG CAGGCGAATG GTGGCCGGCC GACGAGCCGT TCAACGTCCA GCCGCGAAAG GGCGCTGAGC GGCCGCAGAT GCGGCTTCTC GTCAATTCGG CAAAAGCCAA AGCCGGCGCC TGCGCCAGCG TCAAGCCAGC GGGGAAGTGA
|
Protein sequence | MRTRRHLTAL LLAAALVPSP TLAADPPCYR GVNLSGGEYG ERDGIYGTNY NYPSEETIRY FAEKGMTIVR LPFRWERLQP ALGGRLDEDE LKRIKDTVGL IRKHGMAVLL DPHNFGYYDK VQVGTAPATD AAFGDFWARL AVEFANQDGV LFGLMNEPHD IKATDWLEAA NAAIRSIRAV GARNLILVPG TAWSGAHSWE EDVIGGANGT VMLGVRDPLD FYAYEVHQYL DIDSSGTHPT CEGATGAVEA IAGVTAWLKK NHKRGFLGEF GAAADKDCMS GLTEIYSTMS DNGDAWLGWS YWAAGEWWPA DEPFNVQPRK GAERPQMRLL VNSAKAKAGA CASVKPAGK
|
| |