Gene Rleg2_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4003 
Symbol 
ID6982773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4170458 
End bp4172326 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content63% 
IMG OID643398732 
Productputative cellulose synthase protein 
Protein accessionYP_002283491 
Protein GI209551574 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCG GTTCGGAAAT CGGTCAACGC GCCAGGACGG CGAAAGCTGC CTCCGCGAAT 
GCTTTCGATG CTGTCTTCGC CGGCTGGAGC CGCGTCGTTT ACGGTTTCGG CATCCTCTGC
TGGCTGGCGG CACTCGGCTA TTTCTGGATC TGGTGGTGCC AGTCGGCTCA CATCATTTCC
TGGACGGCCT TCGTGCTCAT CACTCTCGTG CTGGGCTGGA TCACGCTTGT GCCGGCTTAT
TTCATCCTGA TCTTCCTCGA TGCGAGAACG GTCAGCCCTC ACGCGCGGCT GCCGGAAGGG
CGGGTGGCGA TGGTGGTCAC CAAGGCGCCA TCCGAGCCCT TTGCCGTGGT CCGCGCAACA
CTGCGGGCTA TGCTCGATCA GATCGGCGTC GATTTCGACG TATGGCTGGC CGACGAAGAT
CCTTCACAGG AAACCCGGCG CTGGTGTGCG GAGCACGGCG TGCTGATCTC CACCCGAAAG
GGGGTGGCGG AATATCATCG CGCCACCTGG CCGCGCCGGA CGCGCTGCAA GGAAGGCAAT
CTCGCCTATT TCTACGATCA CTTCGGCTAT GCGCGTTACG ATTTCGTCGC CCAGTTCGAT
GCCGATCATG TGCCGACGCC GACCTATCTC CGTGAGGTCC TGCGTCCCTT CGCCGACCCT
GGCATCGGCT ATGTCTCGGC GCCCAGCATC TGCGATGCCA ATGCCGCTAC GAGCTGGGCC
GCGCGCGGCA GGCTCTATGC CGAAGCCAGC CTGCACGGTT CGCTGCAGAC GGGTTACAAC
AATGGCTGGG CGCCGCTTTG CATCGGCTCG CATTATGCCG TTCGCACCTC AGCACTTCGG
GAGATCGGCG GCCTCGGCCC GGAACTCGCC GAAGACCATT CGACGACACT GATGATGAAT
GCCGGCGGCT GGCGTGGCGT ACATGCCGTC GATGCGATCG CCCATGGCGA CGGCCCGGCG
AATTTTGCCG ATCTCGTCGT CCAGGAATTC CAGTGGTCGC GCAGCCTCGT CACCATTCTG
CTGCAGCATT CCGGCCGTTA CATCGAGCAT CTGCCGTGGC GGCTGAAATT TCAGTTCGTG
TTCTCGCAGC TGTGGTATCC GCTCTTTTCC GCCTTCATGG CCGTCATGTT CCTGCTGCCG
GTTGCCGCCT TGCTGACGGG CCATGTCTTC GTCAACGTCA CCTATCCGGA TTTCCTGCTG
CATTTCGCAC CGATATCCGC CGTGCTGACG CTGTTTTCCG TGTTCTGGCG GGCGACGGGC
ACTTTCAGGC CGTATGATGC GAAACTGTTC GGCTGGGAAG GGCTCGCCTT CATCTTCCTG
CGCTGGCCGT GGTCGCTCGC CGGCAGCCTG GCCGCCATGC GCGATCGCAT CTCCGGCTCC
TTCGTCGATT TCCGCATCAC GCCGAAGGGA CAGCAGCAGC AGCATTCTCT GCCTCTGCGC
GTCGTCGCGC CCTATATCGC GCTCGCCGGG CTGTGCGCCG TCGCCATGGC GTTTGCCAGC
AAGGCCGCTG CCGCCCAGGG CTTCTACATC TTCGCCGCGA TGAACCTGTC GATCTATCTG
TCGCTGACGG TGCTGATCGT CGTTCGCCAT GCAATCGAGA ACGGTCTGCC GCTGCTGCCG
CAATCGCATG GGCTGCGGCT CGCGACCGCC ATGGGGCTTG CCATCTGGGT TACCGGCGGC
ATGCAACTGG GCGGCCATGG CCTCCGCAGC CTCGAAGCCC TTTCCCATGG CCAGCCATTC
GTCAGCTTCA CCGAAACGCA ATTTGCCGTA GCCGGCGCCG GCCTCGGCGG CGGCAAGACC
AAGATCACGA AATTCCGTCT CAAATGGAAT GGGTTCGGCA ACATGGGACG GGACCAACAG
GGTGTCTGA
 
Protein sequence
MSIGSEIGQR ARTAKAASAN AFDAVFAGWS RVVYGFGILC WLAALGYFWI WWCQSAHIIS 
WTAFVLITLV LGWITLVPAY FILIFLDART VSPHARLPEG RVAMVVTKAP SEPFAVVRAT
LRAMLDQIGV DFDVWLADED PSQETRRWCA EHGVLISTRK GVAEYHRATW PRRTRCKEGN
LAYFYDHFGY ARYDFVAQFD ADHVPTPTYL REVLRPFADP GIGYVSAPSI CDANAATSWA
ARGRLYAEAS LHGSLQTGYN NGWAPLCIGS HYAVRTSALR EIGGLGPELA EDHSTTLMMN
AGGWRGVHAV DAIAHGDGPA NFADLVVQEF QWSRSLVTIL LQHSGRYIEH LPWRLKFQFV
FSQLWYPLFS AFMAVMFLLP VAALLTGHVF VNVTYPDFLL HFAPISAVLT LFSVFWRATG
TFRPYDAKLF GWEGLAFIFL RWPWSLAGSL AAMRDRISGS FVDFRITPKG QQQQHSLPLR
VVAPYIALAG LCAVAMAFAS KAAAAQGFYI FAAMNLSIYL SLTVLIVVRH AIENGLPLLP
QSHGLRLATA MGLAIWVTGG MQLGGHGLRS LEALSHGQPF VSFTETQFAV AGAGLGGGKT
KITKFRLKWN GFGNMGRDQQ GV