Gene Rleg2_5779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5779 
Symbol 
ID6977168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp189732 
End bp190754 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content65% 
IMG OID643393234 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_002278052 
Protein GI209546162 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.028314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0213376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCTCGA CCCCGCTCGC CCTTTTCGTC GGCCTTCCCA ATCCGGTCCT TTCGGATGAA 
GAATTCGCCC TGTTTCGCGA AACCAATCCG CTCGGCCTCT TTGTCGGCCG GCGCAATCAG
CGCGAACCGG AGCAGACGAA GCGCCTGATC GAACGCTTTC GCGAGGCCGT CGGCCGCGAC
GATGCGCCTG TTTTTACCGA CCAGGAAGGC GGCCGCGTGC AGCATCTCGA TGCCGGCCCC
TGGCCGCTCT TCCGCAGCTT CGGCCAGTTC GCCGAACTGG CGCGCCGGGA TTTCGCACTC
GGCAAAAAAG CATTGCGCCT TTCCTCCCAG GCCATGGGGG CGATGATGAC GGAACTCGGC
CTTTCCAGCG GCTGCTCGCC CGTTCTCGAC CTCGTCTTCG AGACGACGAG TGCGGTCATC
GGCGCCCGCT CTTTCGGCCC CGATCCTGAT GTCATCGCCG CCCTCGGCCC CGAGGTGATC
GACGGCCTGC TCGAGGCCGG CAATATGCCT GTGATGAAGC ACATTCCCGG CCATGGCCGC
GCGACGCTGG ATTCCCACAA AGAGCGTCCC GTAGTCGATG CCAGCCGCGT GACGCTCGCT
GCGACCGATT TCAAGCCCTT CGTGGCGCTG AAGGATACGC CCTGGGCCAT GGTCGCCCAT
GTCGTCTACT CCGCCTACGA CAAGGAGCGG CCCGCCTCCG TCTCGCCGGT CATGCACGAC
GTCATCCGCA ACGAGATGGG CTATGAAGGC GTGCTGATTT CCGACTGCAT CTTCATGGAA
TCGCTCTCCG GCACCCTGCC GGAACGCGTC AGACAGGTGC TCGACGCCGG CTTCGACATC
GCCCTCCACA GCCATGGCGA CGTCAGGGAA AGCGAGGCCG CCGCCAAGGC CGCCCGACCG
CTGACGGACG CCGCTCTCAA GCGGATCGCC GCCGGCACGG CCCGCCTCGG CAATCTCAAG
GTCGACGTCC GCGCCGCCCA CCGCCAAGTC GAAGACATGT TTGCAAGCGC GCTGGTCTCC
TGA
 
Protein sequence
MSSTPLALFV GLPNPVLSDE EFALFRETNP LGLFVGRRNQ REPEQTKRLI ERFREAVGRD 
DAPVFTDQEG GRVQHLDAGP WPLFRSFGQF AELARRDFAL GKKALRLSSQ AMGAMMTELG
LSSGCSPVLD LVFETTSAVI GARSFGPDPD VIAALGPEVI DGLLEAGNMP VMKHIPGHGR
ATLDSHKERP VVDASRVTLA ATDFKPFVAL KDTPWAMVAH VVYSAYDKER PASVSPVMHD
VIRNEMGYEG VLISDCIFME SLSGTLPERV RQVLDAGFDI ALHSHGDVRE SEAAAKAARP
LTDAALKRIA AGTARLGNLK VDVRAAHRQV EDMFASALVS