Gene Rleg_0207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0207 
Symbol 
ID8011435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp217195 
End bp219102 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content62% 
IMG OID644822800 
Productputative cellulose synthase protein 
Protein accessionYP_002974057 
Protein GI241202961 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.489481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.127812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAAC TTGCGATTAT CCAAGCGACG CACGCTCCGG AACCCTCCCA CGAGCCGCTT 
CTTGTCCCGG TTCTGACCGG ACATCGGCGG ACCGAATATC TGTTCTCGGC GGCCGTATGG
GCCTGCGCCT TCGCCTATTT CTGGATCTGG TGGCTGGAGC CGCGCCATCA CGTCGATGCT
TTCGGCACAA TTACGGTGAG CCTCGTTCTT GCCTGGGTTA CCGCGCTGCC GGCCTATTTC
ATCGTCGTCT TCTATCGGGC GGCAAGACCG AACGGTCCCT TGCGCCTGCC GGCGGGCAGT
CGAGTGGCCA TGGTCGTGAC CAAGGCGCCG GCCGAACCCT TCTCGGTCGT CAGTGAAACC
CTGCTGGCAA TGCTTGCCCA GGAGGTGGAA CACGACACCT GGCTTGCCGA CGAAGACCCT
TCCCCCGAGA CTCTCGACTG GTGCTCCCGT CACGGGGTCC TCGTCTCGAC CCGCAAGGGG
CGATCCGATT ATCATCGGAC GTCGTGGCCC CGACGCACGC GGTGCAAGGA AGGCAACCTC
GCCTTCTTCT ACGACCACTA CGGCTACAGC CGCTACGATT TCGTGGCGCA ACTCGATGCG
GATCATGTTC CCGCGCCCGA CTATCTCTTC CACATGCTGC GTCCGTTCGG CGATCCAAAG
GTCGGCTATG TCTCGGCCCC CAGCATTTGC GACAAGAACG CTTCCGAAAG CTGGTCGGCA
CGTGGAAGGC TCTATGCCGA GGCCAGCATG CATGGTTCGC TCCAAGCCGG TTACAATGGC
GGCCTGGCGC CGATGTGCAT AGGGTCGCAT TACGCGGTAC GTACCGTCGC CCTCAAACAG
ATCGGCGGCC TCGGTCCGGA GTTGGCTGAA GACCATTCGA CGACATTGAT GATGAATGCC
GGCGGCTGGC GAGGTGTGCA TGCGCTGGAT GCCATCGCCC ATGGTGACGG CCCCAGAACC
TTCAGCGATC TCGTGACGCA GGAATTCCAG TGGTCGCGCA GCCTGGTCAT GGTGCTGTTG
CGGTACTCGC CGAGCCTCCT CGGCCGGCTG CCGGCCCGGC TCAAGTTCCA GTTCCTCTTT
TCGCAACTCT GGTATCCGCT TTTTGCGTTC TTTATGTTGC TCATGTTTGC TTTGCCGATC
ATCGCGCTCG TGCGCGGCCA GAATTTCGTG ACCGTGACCT ATCCCGATTT CCTGGCGCAT
TTCGCACCGC TTTCTATCGC TCTCGTGGTG ATGGCTTATC GCTGGCGCGC CAGCAGCTCG
TTCCGCCCCT ACGACGCCAA GATCCTGAGC TGGGAATGCA TGCTCTTTCT CTTCGCCCGC
TGGCCCTGGG CGCTCGCCGG CACGCTGGCT GCGGTGCACG ATTTTGTGAC AGGCTCCTTC
GTCGATTTCC GCGTCACGCC GAAAGGCCGG TCCGAAGTCG ATCTGCTGCC GGTGCGTGTC
CTGGCGCCCT ACGCTGTGCT GGCGCTCATG GCGGTATTGC CTGCCCTGGC AATCGCCGAT
GCCGGCGCCT CCAAGGGCGG TTACTTCTTT ACCATCCTGA ACGCCGCAAT CTACTGCGCG
CTGCTGTTGG TCATCGTCGG GCGTCATTCG AAGGAGAATA CTGTTGCGGC GGCCCCGCGT
TTCTACCGTC CGGCGATGGC GACCGTGCTT CTGGCGCTCG TCGCGTTGCC GGGGGCTGCA
ACCATCATGC GCGGAAAGGA TGCGACCGAG GCACTCGCCT GGGGCAGCGG CCGCCTTCAG
CTGTTCGAAG AGCGCTACGC CGCCTCCGGT GCAGGCCGGG GCGGGACCGC CCTGCGCACG
ATCTTCTTCA AACCACGATG GATTTCCGAT CCGGAGGAAA TCCAACTGAG AACGGATGCC
GCTCACCCGG ATGTGCGGCC GAGCGTGGAG GAAGTACACA ATGCATAG
 
Protein sequence
MTELAIIQAT HAPEPSHEPL LVPVLTGHRR TEYLFSAAVW ACAFAYFWIW WLEPRHHVDA 
FGTITVSLVL AWVTALPAYF IVVFYRAARP NGPLRLPAGS RVAMVVTKAP AEPFSVVSET
LLAMLAQEVE HDTWLADEDP SPETLDWCSR HGVLVSTRKG RSDYHRTSWP RRTRCKEGNL
AFFYDHYGYS RYDFVAQLDA DHVPAPDYLF HMLRPFGDPK VGYVSAPSIC DKNASESWSA
RGRLYAEASM HGSLQAGYNG GLAPMCIGSH YAVRTVALKQ IGGLGPELAE DHSTTLMMNA
GGWRGVHALD AIAHGDGPRT FSDLVTQEFQ WSRSLVMVLL RYSPSLLGRL PARLKFQFLF
SQLWYPLFAF FMLLMFALPI IALVRGQNFV TVTYPDFLAH FAPLSIALVV MAYRWRASSS
FRPYDAKILS WECMLFLFAR WPWALAGTLA AVHDFVTGSF VDFRVTPKGR SEVDLLPVRV
LAPYAVLALM AVLPALAIAD AGASKGGYFF TILNAAIYCA LLLVIVGRHS KENTVAAAPR
FYRPAMATVL LALVALPGAA TIMRGKDATE ALAWGSGRLQ LFEERYAASG AGRGGTALRT
IFFKPRWISD PEEIQLRTDA AHPDVRPSVE EVHNA