Gene Rleg_4499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4499 
Symbol 
ID8015260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4633305 
End bp4634354 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content64% 
IMG OID644827075 
ProductCellulase 
Protein accessionYP_002978276 
Protein GI241207180 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.591469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACGA CACGACATCT GACAGCGCTG CTGTTTGCGG CCGCTCTCAC CCCGTCGCCG 
GTCCTTGCGG CTGAGGCCCC TTGCTACCGC GGCGTCAATT TGTCCGGCGG TGAATATGGC
GAGCGCGGCG GCATCTACGG CACCAACTAC ACCTACCCGA GCGAAGACAC GATCGGCTAT
TTCGCCAAGA AGGGCATGAC GATTATCCGG CTGCCCTTCC GCTGGGAGCG GCTGCAGCCC
GCACTCGGCG GGCGGCTCGA CGAGGATGAG CTCAAGCGGA TCAAAGATAC GATCGGCCTG
ATCCGCAAGC ACGGCATGGC GGTTCTGCTC GACCCGCATA ATTTCGGCTA TTACGACAAG
ACCCAGGTCG GCACAGCGCC GGCGACGGAT GCCGCCTTCG GTGACTTCTG GGCAAGGCTC
GCCGTCGAAT TCGCCAATCA GGACGGCGTT CTCTTCGGCC TGATGAACGA ACCGCACGAC
ATCAAGGCGA CCGACTGGCT GGATGCGGCC AATGCGGCGA TCCGCAGCAT CCGCGCTGTC
GGCGCGCGCA ACCTCATCTT GGTGCCGGGC ACCGCCTGGA GCGGCGCGGG CAGCTGGGAA
AAGGATGTGA TCGGCGGCGC CAACGGCACG GTGATGCTCG GTGTGCGCGA TCCGCTCAAT
TTCTACGCCT ATGAGGTCCA CCAGTATCTC GATGCCGATT CCTCCGGCAC CCATCCGACC
TGTGAAGGTG CGTCCGCCGC GGTCGCGGCG ATCAACGGCG TTACCGCCTG GTTGAAGCAG
AACCACAAGC GCGGTTTTCT CGGCGAATTT GGCGCCTCCA CCGACAAGGA CTGCATGAGC
GGGCTGACCG AAATCTACGC CACCATGTCC GGCAATAGCG ATGTGTGGCT CGGCTGGTCC
TACTGGGCGG CCGGCGATTG GTGGCCGGCG GACGAGCCGT TCAACGTCCA GCCGCGCAAG
GGCCCTGAGC GGCCGCAGAT GCGGCTTCTT GCCGAGGCGG CAAAAGCCGG TGCCGGCATT
TGCTCCGCCG TCAAACCCGC GGGGAAATGA
 
Protein sequence
MRTTRHLTAL LFAAALTPSP VLAAEAPCYR GVNLSGGEYG ERGGIYGTNY TYPSEDTIGY 
FAKKGMTIIR LPFRWERLQP ALGGRLDEDE LKRIKDTIGL IRKHGMAVLL DPHNFGYYDK
TQVGTAPATD AAFGDFWARL AVEFANQDGV LFGLMNEPHD IKATDWLDAA NAAIRSIRAV
GARNLILVPG TAWSGAGSWE KDVIGGANGT VMLGVRDPLN FYAYEVHQYL DADSSGTHPT
CEGASAAVAA INGVTAWLKQ NHKRGFLGEF GASTDKDCMS GLTEIYATMS GNSDVWLGWS
YWAAGDWWPA DEPFNVQPRK GPERPQMRLL AEAAKAGAGI CSAVKPAGK