Gene Rleg2_4209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4209 
Symbol 
ID6982982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4386426 
End bp4387475 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content65% 
IMG OID643398940 
ProductCellulase 
Protein accessionYP_002283697 
Protein GI209551780 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACGA GACGACATCT GACGGCGCTG CTGCTTGCGG CCGCTCTCGT CCCCTCGCCG 
ACCCTTGCGG CCGACCCGCC CTGCTATCGC GGCGTCAATC TTTCCGGCGG CGAATATGGT
GAGCGCGACG GCATTTACGG CACGAATTAT AACTATCCCA GCGAAGAGAC GATCCGCTAT
TTCGCCGAAA AGGGCATGAC GATCGTCCGG CTGCCCTTCC GCTGGGAGCG GTTGCAGCCG
GCGCTGGGCG GCCGGCTCGA CGAGGACGAA CTCAAGCGGA TCAAGGATAC CGTCGGGCTG
ATCCGCAAGC ACGGCATGGC CGTGCTGCTC GACCCGCATA ATTTCGGCTA TTACGACAAG
GTGCAGGTCG GCACGGCGCC GGCGACGGAT GCCGCCTTCG GTGATTTCTG GGCAAGGCTT
GCGGTCGAAT TCGCCAATCA GGACGGCGTG CTCTTCGGCC TGATGAACGA GCCGCACGAT
ATCAAGGCAA CGGACTGGCT CGAGGCCGCC AATGCGGCGA TCCGCAGCAT CCGCGCCGTC
GGCGCCCGCA ACCTCATCCT GGTGCCGGGC ACGGCCTGGA GCGGCGCTCA CAGCTGGGAG
GAGGATGTGA TCGGCGGCGC CAACGGCACG GTGATGCTCG GCGTGCGCGA TCCGCTCGAC
TTCTACGCCT ATGAGGTCCA CCAGTATCTC GACATTGATT CCTCCGGAAC CCATCCGACC
TGCGAGGGTG CTACCGGCGC TGTCGAAGCG ATCGCCGGCG TCACCGCCTG GCTGAAGAAG
AACCACAAGC GCGGCTTTCT CGGCGAGTTC GGCGCCGCTG CCGACAAGGA CTGCATGAGC
GGGCTGACCG AGATCTATTC CACCATGTCC GATAATGGCG ACGCCTGGCT CGGCTGGTCC
TATTGGGCCG CAGGCGAATG GTGGCCGGCC GACGAGCCGT TCAACGTCCA GCCGCGAAAG
GGCGCTGAGC GGCCGCAGAT GCGGCTTCTC GTCAATTCGG CAAAAGCCAA AGCCGGCGCC
TGCGCCAGCG TCAAGCCAGC GGGGAAGTGA
 
Protein sequence
MRTRRHLTAL LLAAALVPSP TLAADPPCYR GVNLSGGEYG ERDGIYGTNY NYPSEETIRY 
FAEKGMTIVR LPFRWERLQP ALGGRLDEDE LKRIKDTVGL IRKHGMAVLL DPHNFGYYDK
VQVGTAPATD AAFGDFWARL AVEFANQDGV LFGLMNEPHD IKATDWLEAA NAAIRSIRAV
GARNLILVPG TAWSGAHSWE EDVIGGANGT VMLGVRDPLD FYAYEVHQYL DIDSSGTHPT
CEGATGAVEA IAGVTAWLKK NHKRGFLGEF GAAADKDCMS GLTEIYSTMS DNGDAWLGWS
YWAAGEWWPA DEPFNVQPRK GAERPQMRLL VNSAKAKAGA CASVKPAGK