Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4332 |
Symbol | |
ID | 8015112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4452810 |
End bp | 4454678 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826908 |
Product | putative cellulose synthase protein |
Protein accession | YP_002978111 |
Protein GI | 241207015 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00407382 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00579604 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAATG TTTCGCAAAT CGGCCATGGC GTCAGGACCG CCAAAGCCGT CAATGCTGAT GCATTCGATC TTGTTTTCAC CGGCTGGAAC CGTGTCGCCT ATGGCTTCGG CATTCTCTGC TGGCTGACGG CACTCGGCTT TTTCTGGATC TGGTGGTGCC AGTCCGCCCA TATCATTTCT TGGGCCACAT TCGTGCTCGT CACCCTCGTG CTTGCCTGGA TCACGCTTGT ACCGGCCTAT TTCATCCTGA TCTTTGTCGA TGCGAGGACG GTCAGCCCTC GCGCAGGGCT GCCAGAAGGG CGGGTGGCTA TGGTCGTCAC CAAGGCGCCG TCCGAGCCCT TTGCCGTCGT CCGCACCACT TTGCAGGCGA TGCTCGACCA GATCGGCGTC GATTTCGATG TCTGGTTGGC TGACGAGGAC CCTTCGGAGG AAACCAGGCG TTGGTGTGCG GCGCACGGCG TGCTGATCTC CACCCGAAAG GGTGTGGCGG AGTATCATCG CACCACCTGG CCGCGCCGGA CGCGTTGCAA GGAAGGCAAC CTCGCCTATT TCTACGACCA CTTCGGCTAT GCGCGCTACG ATTTCGTCGC CCAGTTCGAT GCCGACCATG TGCCGACGCC GACCTACCTC CGTGAAGTCC TGCGCCCCTT TGCCGATCCC GGCATCGGTT ATGTCTCCGC TCCCAGCATC TGCGATGCCA ATGCCGGGGC AAGATGGGCG GCGCGCGGCA GGCTCTATGC CGAGGCAAGC CTGCACGGTT CGCTGCAGAC GGGTTACAAC AATGGCTGGG CGCCTCTTTG CATCGGTTCC CATTATGCGG TTCGCACATC AGCCCTTCGT CAGATCGGCG GCCTCGGCCC AGAACTTGCG GAAGACCATT CGACAACGCT GATGATGAAT GCCGGCGGCT GGCGTGGCGT GCATGCGGTC GATGCGATCG CTCATGGCGA CGGACCGGCG AGCTTCGCCG ATCTCGTCGT CCAGGAATTC CAGTGGTCGC GCAGCCTGGT CACCATTCTG CTGCAACATT CCCGCCGCCA CATCATGCAC CTGCCTTGGC GGCTGAGGTT CCAGTTCGTG TTCTCGCAGC TCTGGTACCC GCTCTTTTCC GTCTTCATGG CCATGATGTT CCTGCTGCCG GTTGCCGCGC TGCTGACGGG CCGTGTCTTC GTCAATGTCA CCTATCCGGA TTTCCTGCTG CATTTCGTGC CGATGTCGAT CGTACTGACG CTGTTTGCCT TCTTCTGGCG GGCAACGGCA ACCTTCAGGC CGCACGATGC GAAACTGCTC GGCTGGGAAG GGCTTGCCTT CATCTTTCTG CGCTGGCCGT GGTCGTTGGC CGGCAGTCTT GCTGCTGTTC GCGACTATAT CTGCGGCTCC TTCGTCGATT TCCGCATCAC GCCGAAGGGA AAGCAGCAGC AGCGGTCTTT GCCGCTGCGC GTCATCTCGC CCTATATCGG GCTCGCTGCC CTTTCCGCCG CTGCAATGAT GTTTGCGACC GATGCCGCGG CCGCCCAGGG CTTTTATGTT TTCGCCATGA TCAACCTCTC GGTCTACCTG TCGCTGACGG TGCTGATCGT CGTTCGCCAT GCGGTCGAAA ACGATCTGCC GCTGCTGCCG CAATCGCGCG GGCTGTGGCT TGCGACAGCC ACCGGACTGG CGATTTTCGT CGCTGGCGGC ACGCAGGCGG GGAGCCACGG CCTGCGCGGC CTCGAAGCTC TTTCGCACGG CCAGACGTTT GTCAGTTTCA CCGAGACGCA ATTTGCCGTG GCCGGCGCCG GCCTCGGCGG CGGCAAGACC AGGATCGTGA AATTCCACCT GAGATGGAAT GGTTTCGGCA GAACCGGCCG AGACGAACAG GGTGCGTGA
|
Protein sequence | MSNVSQIGHG VRTAKAVNAD AFDLVFTGWN RVAYGFGILC WLTALGFFWI WWCQSAHIIS WATFVLVTLV LAWITLVPAY FILIFVDART VSPRAGLPEG RVAMVVTKAP SEPFAVVRTT LQAMLDQIGV DFDVWLADED PSEETRRWCA AHGVLISTRK GVAEYHRTTW PRRTRCKEGN LAYFYDHFGY ARYDFVAQFD ADHVPTPTYL REVLRPFADP GIGYVSAPSI CDANAGARWA ARGRLYAEAS LHGSLQTGYN NGWAPLCIGS HYAVRTSALR QIGGLGPELA EDHSTTLMMN AGGWRGVHAV DAIAHGDGPA SFADLVVQEF QWSRSLVTIL LQHSRRHIMH LPWRLRFQFV FSQLWYPLFS VFMAMMFLLP VAALLTGRVF VNVTYPDFLL HFVPMSIVLT LFAFFWRATA TFRPHDAKLL GWEGLAFIFL RWPWSLAGSL AAVRDYICGS FVDFRITPKG KQQQRSLPLR VISPYIGLAA LSAAAMMFAT DAAAAQGFYV FAMINLSVYL SLTVLIVVRH AVENDLPLLP QSRGLWLATA TGLAIFVAGG TQAGSHGLRG LEALSHGQTF VSFTETQFAV AGAGLGGGKT RIVKFHLRWN GFGRTGRDEQ GA
|
| |