Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4003 |
Symbol | |
ID | 6982773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 4170458 |
End bp | 4172326 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643398732 |
Product | putative cellulose synthase protein |
Protein accession | YP_002283491 |
Protein GI | 209551574 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATCG GTTCGGAAAT CGGTCAACGC GCCAGGACGG CGAAAGCTGC CTCCGCGAAT GCTTTCGATG CTGTCTTCGC CGGCTGGAGC CGCGTCGTTT ACGGTTTCGG CATCCTCTGC TGGCTGGCGG CACTCGGCTA TTTCTGGATC TGGTGGTGCC AGTCGGCTCA CATCATTTCC TGGACGGCCT TCGTGCTCAT CACTCTCGTG CTGGGCTGGA TCACGCTTGT GCCGGCTTAT TTCATCCTGA TCTTCCTCGA TGCGAGAACG GTCAGCCCTC ACGCGCGGCT GCCGGAAGGG CGGGTGGCGA TGGTGGTCAC CAAGGCGCCA TCCGAGCCCT TTGCCGTGGT CCGCGCAACA CTGCGGGCTA TGCTCGATCA GATCGGCGTC GATTTCGACG TATGGCTGGC CGACGAAGAT CCTTCACAGG AAACCCGGCG CTGGTGTGCG GAGCACGGCG TGCTGATCTC CACCCGAAAG GGGGTGGCGG AATATCATCG CGCCACCTGG CCGCGCCGGA CGCGCTGCAA GGAAGGCAAT CTCGCCTATT TCTACGATCA CTTCGGCTAT GCGCGTTACG ATTTCGTCGC CCAGTTCGAT GCCGATCATG TGCCGACGCC GACCTATCTC CGTGAGGTCC TGCGTCCCTT CGCCGACCCT GGCATCGGCT ATGTCTCGGC GCCCAGCATC TGCGATGCCA ATGCCGCTAC GAGCTGGGCC GCGCGCGGCA GGCTCTATGC CGAAGCCAGC CTGCACGGTT CGCTGCAGAC GGGTTACAAC AATGGCTGGG CGCCGCTTTG CATCGGCTCG CATTATGCCG TTCGCACCTC AGCACTTCGG GAGATCGGCG GCCTCGGCCC GGAACTCGCC GAAGACCATT CGACGACACT GATGATGAAT GCCGGCGGCT GGCGTGGCGT ACATGCCGTC GATGCGATCG CCCATGGCGA CGGCCCGGCG AATTTTGCCG ATCTCGTCGT CCAGGAATTC CAGTGGTCGC GCAGCCTCGT CACCATTCTG CTGCAGCATT CCGGCCGTTA CATCGAGCAT CTGCCGTGGC GGCTGAAATT TCAGTTCGTG TTCTCGCAGC TGTGGTATCC GCTCTTTTCC GCCTTCATGG CCGTCATGTT CCTGCTGCCG GTTGCCGCCT TGCTGACGGG CCATGTCTTC GTCAACGTCA CCTATCCGGA TTTCCTGCTG CATTTCGCAC CGATATCCGC CGTGCTGACG CTGTTTTCCG TGTTCTGGCG GGCGACGGGC ACTTTCAGGC CGTATGATGC GAAACTGTTC GGCTGGGAAG GGCTCGCCTT CATCTTCCTG CGCTGGCCGT GGTCGCTCGC CGGCAGCCTG GCCGCCATGC GCGATCGCAT CTCCGGCTCC TTCGTCGATT TCCGCATCAC GCCGAAGGGA CAGCAGCAGC AGCATTCTCT GCCTCTGCGC GTCGTCGCGC CCTATATCGC GCTCGCCGGG CTGTGCGCCG TCGCCATGGC GTTTGCCAGC AAGGCCGCTG CCGCCCAGGG CTTCTACATC TTCGCCGCGA TGAACCTGTC GATCTATCTG TCGCTGACGG TGCTGATCGT CGTTCGCCAT GCAATCGAGA ACGGTCTGCC GCTGCTGCCG CAATCGCATG GGCTGCGGCT CGCGACCGCC ATGGGGCTTG CCATCTGGGT TACCGGCGGC ATGCAACTGG GCGGCCATGG CCTCCGCAGC CTCGAAGCCC TTTCCCATGG CCAGCCATTC GTCAGCTTCA CCGAAACGCA ATTTGCCGTA GCCGGCGCCG GCCTCGGCGG CGGCAAGACC AAGATCACGA AATTCCGTCT CAAATGGAAT GGGTTCGGCA ACATGGGACG GGACCAACAG GGTGTCTGA
|
Protein sequence | MSIGSEIGQR ARTAKAASAN AFDAVFAGWS RVVYGFGILC WLAALGYFWI WWCQSAHIIS WTAFVLITLV LGWITLVPAY FILIFLDART VSPHARLPEG RVAMVVTKAP SEPFAVVRAT LRAMLDQIGV DFDVWLADED PSQETRRWCA EHGVLISTRK GVAEYHRATW PRRTRCKEGN LAYFYDHFGY ARYDFVAQFD ADHVPTPTYL REVLRPFADP GIGYVSAPSI CDANAATSWA ARGRLYAEAS LHGSLQTGYN NGWAPLCIGS HYAVRTSALR EIGGLGPELA EDHSTTLMMN AGGWRGVHAV DAIAHGDGPA NFADLVVQEF QWSRSLVTIL LQHSGRYIEH LPWRLKFQFV FSQLWYPLFS AFMAVMFLLP VAALLTGHVF VNVTYPDFLL HFAPISAVLT LFSVFWRATG TFRPYDAKLF GWEGLAFIFL RWPWSLAGSL AAMRDRISGS FVDFRITPKG QQQQHSLPLR VVAPYIALAG LCAVAMAFAS KAAAAQGFYI FAAMNLSIYL SLTVLIVVRH AIENGLPLLP QSHGLRLATA MGLAIWVTGG MQLGGHGLRS LEALSHGQPF VSFTETQFAV AGAGLGGGKT KITKFRLKWN GFGNMGRDQQ GV
|
| |