Gene Rleg2_1206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1206 
Symbol 
ID6979926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1218145 
End bp1219188 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content64% 
IMG OID643395919 
ProductCellulase 
Protein accessionYP_002280726 
Protein GI209548809 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.364245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGGT GGCGCGCGCT CCTGCTGGCG GCCTCTGTCG CGCTTGCACC GGCTCTGCCG 
GCCACCGCGC AGCAGGCGAT GATCAATGCC GACGCGTGGT CGGCCTACAA GGCGAAGTTT
CTCGATCCGA GCGGCCGCAT CGTCGACAAC GGCAACGGCA ACATCAGTCA CAGCGAAGGG
CAGGGCTACG GCCTGCTGCT CGCCTATCTC TCGGCAAGCC CGGCCGATTT CGAGCAGATC
TGGTATTTTA CCCGTACCGA GCTGCTGCTG CGCGACGACG GCCTGGCCGT TTGGAAATGG
GATCCGAACG TCAAGCCGCA CGTGGCCGAC ACCAACAATG CCACCGACGG TGACATGCTG
ATCGCCTATG CGCTGGCGCT TGCCGGCACG GCATGGAAAC GGGAAGACTA TATCCTCGCT
GCCTCCCGCA TGGCGCAGGC GCTGCTTGCC GAAACCGTCG GCAGCTCGCA GGGCCGCACC
TTGCTGATGC CGGGAACCGA AGGGTTTACC GGCAGCGACC GCGACGATGG TCCCGTCGTC
AACCCGTCCT ACTGGATTTA TGAGGCGATC CCGGTGATGG CAGCGCTCGC GCCGTCGGAT
GCCTGGCAAA AACTGTCGAA TGACGGCGTG GAGCTGTTGA AGACGATGCA ATTCGGCCCG
CGCAAGCTTC CCGCCGAATG GGTGAGCCTG CACGACAAGC CGCGGCCGGC AGAGGGGTTC
GACGCCGAAT TCGGCTACAA CGCCATCCGC ATCCCGCTCT ATCTCGCCCG CGGCGGCATC
ACCGACAAGG CACTGCTCAT CCGCCTGCAA AAGGGGATGT CGCAAGACGG CGTTCCCGCC
ACCATTGATC TGACCACCGG CCGGCCGAAG ACCGTGCTGT CGGACCCCGG TTATAGAATT
GTTAACGATG TTGTGGCCTG TGTTGTCGAT GGGACCAGGC TGCCGAGTTC GGCGCTGCAG
TTTGCCCCCG CGCTCTATTA TCCGTCCACC CTTCAACTGC TGGGGCTGGC CTATATCGGG
GAGAAGCATC CGGAGTGTCT GTGA
 
Protein sequence
MRRWRALLLA ASVALAPALP ATAQQAMINA DAWSAYKAKF LDPSGRIVDN GNGNISHSEG 
QGYGLLLAYL SASPADFEQI WYFTRTELLL RDDGLAVWKW DPNVKPHVAD TNNATDGDML
IAYALALAGT AWKREDYILA ASRMAQALLA ETVGSSQGRT LLMPGTEGFT GSDRDDGPVV
NPSYWIYEAI PVMAALAPSD AWQKLSNDGV ELLKTMQFGP RKLPAEWVSL HDKPRPAEGF
DAEFGYNAIR IPLYLARGGI TDKALLIRLQ KGMSQDGVPA TIDLTTGRPK TVLSDPGYRI
VNDVVACVVD GTRLPSSALQ FAPALYYPST LQLLGLAYIG EKHPECL