Gene Rleg_4332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4332 
Symbol 
ID8015112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4452810 
End bp4454678 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content62% 
IMG OID644826908 
Productputative cellulose synthase protein 
Protein accessionYP_002978111 
Protein GI241207015 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00407382 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00579604 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAATG TTTCGCAAAT CGGCCATGGC GTCAGGACCG CCAAAGCCGT CAATGCTGAT 
GCATTCGATC TTGTTTTCAC CGGCTGGAAC CGTGTCGCCT ATGGCTTCGG CATTCTCTGC
TGGCTGACGG CACTCGGCTT TTTCTGGATC TGGTGGTGCC AGTCCGCCCA TATCATTTCT
TGGGCCACAT TCGTGCTCGT CACCCTCGTG CTTGCCTGGA TCACGCTTGT ACCGGCCTAT
TTCATCCTGA TCTTTGTCGA TGCGAGGACG GTCAGCCCTC GCGCAGGGCT GCCAGAAGGG
CGGGTGGCTA TGGTCGTCAC CAAGGCGCCG TCCGAGCCCT TTGCCGTCGT CCGCACCACT
TTGCAGGCGA TGCTCGACCA GATCGGCGTC GATTTCGATG TCTGGTTGGC TGACGAGGAC
CCTTCGGAGG AAACCAGGCG TTGGTGTGCG GCGCACGGCG TGCTGATCTC CACCCGAAAG
GGTGTGGCGG AGTATCATCG CACCACCTGG CCGCGCCGGA CGCGTTGCAA GGAAGGCAAC
CTCGCCTATT TCTACGACCA CTTCGGCTAT GCGCGCTACG ATTTCGTCGC CCAGTTCGAT
GCCGACCATG TGCCGACGCC GACCTACCTC CGTGAAGTCC TGCGCCCCTT TGCCGATCCC
GGCATCGGTT ATGTCTCCGC TCCCAGCATC TGCGATGCCA ATGCCGGGGC AAGATGGGCG
GCGCGCGGCA GGCTCTATGC CGAGGCAAGC CTGCACGGTT CGCTGCAGAC GGGTTACAAC
AATGGCTGGG CGCCTCTTTG CATCGGTTCC CATTATGCGG TTCGCACATC AGCCCTTCGT
CAGATCGGCG GCCTCGGCCC AGAACTTGCG GAAGACCATT CGACAACGCT GATGATGAAT
GCCGGCGGCT GGCGTGGCGT GCATGCGGTC GATGCGATCG CTCATGGCGA CGGACCGGCG
AGCTTCGCCG ATCTCGTCGT CCAGGAATTC CAGTGGTCGC GCAGCCTGGT CACCATTCTG
CTGCAACATT CCCGCCGCCA CATCATGCAC CTGCCTTGGC GGCTGAGGTT CCAGTTCGTG
TTCTCGCAGC TCTGGTACCC GCTCTTTTCC GTCTTCATGG CCATGATGTT CCTGCTGCCG
GTTGCCGCGC TGCTGACGGG CCGTGTCTTC GTCAATGTCA CCTATCCGGA TTTCCTGCTG
CATTTCGTGC CGATGTCGAT CGTACTGACG CTGTTTGCCT TCTTCTGGCG GGCAACGGCA
ACCTTCAGGC CGCACGATGC GAAACTGCTC GGCTGGGAAG GGCTTGCCTT CATCTTTCTG
CGCTGGCCGT GGTCGTTGGC CGGCAGTCTT GCTGCTGTTC GCGACTATAT CTGCGGCTCC
TTCGTCGATT TCCGCATCAC GCCGAAGGGA AAGCAGCAGC AGCGGTCTTT GCCGCTGCGC
GTCATCTCGC CCTATATCGG GCTCGCTGCC CTTTCCGCCG CTGCAATGAT GTTTGCGACC
GATGCCGCGG CCGCCCAGGG CTTTTATGTT TTCGCCATGA TCAACCTCTC GGTCTACCTG
TCGCTGACGG TGCTGATCGT CGTTCGCCAT GCGGTCGAAA ACGATCTGCC GCTGCTGCCG
CAATCGCGCG GGCTGTGGCT TGCGACAGCC ACCGGACTGG CGATTTTCGT CGCTGGCGGC
ACGCAGGCGG GGAGCCACGG CCTGCGCGGC CTCGAAGCTC TTTCGCACGG CCAGACGTTT
GTCAGTTTCA CCGAGACGCA ATTTGCCGTG GCCGGCGCCG GCCTCGGCGG CGGCAAGACC
AGGATCGTGA AATTCCACCT GAGATGGAAT GGTTTCGGCA GAACCGGCCG AGACGAACAG
GGTGCGTGA
 
Protein sequence
MSNVSQIGHG VRTAKAVNAD AFDLVFTGWN RVAYGFGILC WLTALGFFWI WWCQSAHIIS 
WATFVLVTLV LAWITLVPAY FILIFVDART VSPRAGLPEG RVAMVVTKAP SEPFAVVRTT
LQAMLDQIGV DFDVWLADED PSEETRRWCA AHGVLISTRK GVAEYHRTTW PRRTRCKEGN
LAYFYDHFGY ARYDFVAQFD ADHVPTPTYL REVLRPFADP GIGYVSAPSI CDANAGARWA
ARGRLYAEAS LHGSLQTGYN NGWAPLCIGS HYAVRTSALR QIGGLGPELA EDHSTTLMMN
AGGWRGVHAV DAIAHGDGPA SFADLVVQEF QWSRSLVTIL LQHSRRHIMH LPWRLRFQFV
FSQLWYPLFS VFMAMMFLLP VAALLTGRVF VNVTYPDFLL HFVPMSIVLT LFAFFWRATA
TFRPHDAKLL GWEGLAFIFL RWPWSLAGSL AAVRDYICGS FVDFRITPKG KQQQRSLPLR
VISPYIGLAA LSAAAMMFAT DAAAAQGFYV FAMINLSVYL SLTVLIVVRH AVENDLPLLP
QSRGLWLATA TGLAIFVAGG TQAGSHGLRG LEALSHGQTF VSFTETQFAV AGAGLGGGKT
RIVKFHLRWN GFGRTGRDEQ GA