Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4499 |
Symbol | |
ID | 8015260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4633305 |
End bp | 4634354 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644827075 |
Product | Cellulase |
Protein accession | YP_002978276 |
Protein GI | 241207180 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.591469 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACGA CACGACATCT GACAGCGCTG CTGTTTGCGG CCGCTCTCAC CCCGTCGCCG GTCCTTGCGG CTGAGGCCCC TTGCTACCGC GGCGTCAATT TGTCCGGCGG TGAATATGGC GAGCGCGGCG GCATCTACGG CACCAACTAC ACCTACCCGA GCGAAGACAC GATCGGCTAT TTCGCCAAGA AGGGCATGAC GATTATCCGG CTGCCCTTCC GCTGGGAGCG GCTGCAGCCC GCACTCGGCG GGCGGCTCGA CGAGGATGAG CTCAAGCGGA TCAAAGATAC GATCGGCCTG ATCCGCAAGC ACGGCATGGC GGTTCTGCTC GACCCGCATA ATTTCGGCTA TTACGACAAG ACCCAGGTCG GCACAGCGCC GGCGACGGAT GCCGCCTTCG GTGACTTCTG GGCAAGGCTC GCCGTCGAAT TCGCCAATCA GGACGGCGTT CTCTTCGGCC TGATGAACGA ACCGCACGAC ATCAAGGCGA CCGACTGGCT GGATGCGGCC AATGCGGCGA TCCGCAGCAT CCGCGCTGTC GGCGCGCGCA ACCTCATCTT GGTGCCGGGC ACCGCCTGGA GCGGCGCGGG CAGCTGGGAA AAGGATGTGA TCGGCGGCGC CAACGGCACG GTGATGCTCG GTGTGCGCGA TCCGCTCAAT TTCTACGCCT ATGAGGTCCA CCAGTATCTC GATGCCGATT CCTCCGGCAC CCATCCGACC TGTGAAGGTG CGTCCGCCGC GGTCGCGGCG ATCAACGGCG TTACCGCCTG GTTGAAGCAG AACCACAAGC GCGGTTTTCT CGGCGAATTT GGCGCCTCCA CCGACAAGGA CTGCATGAGC GGGCTGACCG AAATCTACGC CACCATGTCC GGCAATAGCG ATGTGTGGCT CGGCTGGTCC TACTGGGCGG CCGGCGATTG GTGGCCGGCG GACGAGCCGT TCAACGTCCA GCCGCGCAAG GGCCCTGAGC GGCCGCAGAT GCGGCTTCTT GCCGAGGCGG CAAAAGCCGG TGCCGGCATT TGCTCCGCCG TCAAACCCGC GGGGAAATGA
|
Protein sequence | MRTTRHLTAL LFAAALTPSP VLAAEAPCYR GVNLSGGEYG ERGGIYGTNY TYPSEDTIGY FAKKGMTIIR LPFRWERLQP ALGGRLDEDE LKRIKDTIGL IRKHGMAVLL DPHNFGYYDK TQVGTAPATD AAFGDFWARL AVEFANQDGV LFGLMNEPHD IKATDWLDAA NAAIRSIRAV GARNLILVPG TAWSGAGSWE KDVIGGANGT VMLGVRDPLN FYAYEVHQYL DADSSGTHPT CEGASAAVAA INGVTAWLKQ NHKRGFLGEF GASTDKDCMS GLTEIYATMS GNSDVWLGWS YWAAGDWWPA DEPFNVQPRK GPERPQMRLL AEAAKAGAGI CSAVKPAGK
|
| |