Gene Rleg_3756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3756 
Symbol 
ID8014588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3807616 
End bp3809526 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content64% 
IMG OID644826319 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_002977538 
Protein GI241206442 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.782699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATT ACCATCTGGA AGCAAGCTGG AGCCCGATCG AGGGCAGTTT CGGGCGCCTC 
ACCTTCATGC TTTTCAATCT TTCGACCGAG CCGCTGTCCG GCTTCTCGCT CGCCTATACG
TCAGAGACGC GGGTTGCCGA CAAACATGTC TGCGACGGCG GCAGCCTCAA GCGGCGGGTC
GCGCATTTCC AAGAATTCCT GCCGCCCGAA GACCTGAGCG TGCCGCCCGG TGGGCGCTGG
CGCTTCACTG TCGAGGGACT GACCAGGGAG CCGAAACATG TCACGGCCGG CGTCAAGTCG
GCCTATCTGA CACTTGGCGA CGGACGCCAC TTTCCTGTTG GTTTCGGCGA TCTCATGCTC
GAAGGCCGGG ATGGTGGCGT GGCGCCGCCG CTTCTGCCGC CGGGCCGGGC CGAGGAACCT
TATTCGCTAC TGCCCTGGCC GCTGGCGCTC GGGTTGAAGG CGGGAGAGCT GCCGGTCGTG
CTTTATCCGG CCGAGCGGAC GCGCCCTGAT GCGGTCAAGG CGCTCTCGCT GATTCTGGAG
CTCTACCAGC GGCTCTACCC GGCCGACAAT ATGCCGTTTT CCCTCGGTGC CGTCGAAGGC
GGGCGGGGCA TTCGTTTCGT CACCGAATCG TCGATCGCCG CCTTCGCTTA CGAATTGCGT
TTTACCGCGC ATGAGATCGT GCTTTCGAGT GCGGATGCCG CCGGGCGGCA TTACGGGCTG
ATCAGCCTGG CGCAACTGCT GCACGGCGCC CGCGCCGATC CCGAGCGCTT CAAATTCCCC
AATTTCGGCG CGATCGCCGA CCAGCCGCGT TATGACTGGC GCGGCTGCCA TCTCGATGTG
TCCAGGCAGT TCTATCCGGT GGCAGACGTC GTGCGGCTGA TCGATATTCT CGCCTGGAAC
AAGCTCAACA TCTTCCACTG GCATCTGACC GATGACGAAG CGTGGCGGCT GGAGATCAAG
GCCTATCCCG CGCTGACGGA GATCGGCGCC CGGCGCGGGC CGGATGAAGT GCTCGTGCCG
CAGCTCGGCG ACGGGGCGCA AACGCGCTCC GGTCATTACA CGCAGGAGGA TGCCAAGCGG
ATCGTTGCGC ATGCAGCCTC GCTGCATATC GAGGTACTGC CGGAAATCGA TATTCCGGGC
CACAGCATGG CGACGCTGTT CTCGCTGCCC GAGCTCGTCG ACGGCCAGGA GGCGCCGGAT
AGTTACCGCT CGGTGCAGGG TTATCCGAAC AACGCCCTCA ATCCGGCGGT GGAATTCACC
TATGAATTTC TCGGTAAGGT GTTCGACGAG ATGGTGACGC TGTTTCCCGG CGAATATCTC
CATATCGGCG GCGACGAAGT GGCGCACGGC TCCTGGCTTT CCTCGCCGCT CTGCAAGACG
CTGATGGAGA GGGAGAAACT TGCCGGCACT GCCGAGCTGC AATCCTATTT CCTGAAACGT
ATCAAAGCCA TGCTGTCGGA TCGCGGCAAG AAACTCGTCG GCTGGAACGA GGTTTCGCAT
GGCGGCGGCG TCGACCGCGA CGGCACGCTG CTGATGGCCT GGGAAAAGCC CGCCGTCGGC
ATCGAGCTGG CACAGGAGGG CTACGACGTG GTGATGACGC CGGGCCAGGC CTATTATCTC
GACATGGCGC AAGCGGAAGC CTGGGGCGAG CCCGGCGCGA GCTGGGCGGG CTTCAGCCTG
CCGGAACACA CCTACGCTTA CGAGGCCGAG GGCGAGCTGC CGGCGGCGCT GCAGGAGAAG
ATGCGCGGCA TCCAGGCCTG CATCTGGACT GAAAATTTCC TCTCGCGTGC CTATTTCAAC
CGGCTGGTTT TCCCGCGTCT CCCAGCGGTC GCCGAGGCTG CTTGGACGCC TTCTGCGCGC
AAGGACTGGG ATCGGTTCGC AGCGATCGTG CGGATGTGGC CGGTGCTTTA A
 
Protein sequence
MADYHLEASW SPIEGSFGRL TFMLFNLSTE PLSGFSLAYT SETRVADKHV CDGGSLKRRV 
AHFQEFLPPE DLSVPPGGRW RFTVEGLTRE PKHVTAGVKS AYLTLGDGRH FPVGFGDLML
EGRDGGVAPP LLPPGRAEEP YSLLPWPLAL GLKAGELPVV LYPAERTRPD AVKALSLILE
LYQRLYPADN MPFSLGAVEG GRGIRFVTES SIAAFAYELR FTAHEIVLSS ADAAGRHYGL
ISLAQLLHGA RADPERFKFP NFGAIADQPR YDWRGCHLDV SRQFYPVADV VRLIDILAWN
KLNIFHWHLT DDEAWRLEIK AYPALTEIGA RRGPDEVLVP QLGDGAQTRS GHYTQEDAKR
IVAHAASLHI EVLPEIDIPG HSMATLFSLP ELVDGQEAPD SYRSVQGYPN NALNPAVEFT
YEFLGKVFDE MVTLFPGEYL HIGGDEVAHG SWLSSPLCKT LMEREKLAGT AELQSYFLKR
IKAMLSDRGK KLVGWNEVSH GGGVDRDGTL LMAWEKPAVG IELAQEGYDV VMTPGQAYYL
DMAQAEAWGE PGASWAGFSL PEHTYAYEAE GELPAALQEK MRGIQACIWT ENFLSRAYFN
RLVFPRLPAV AEAAWTPSAR KDWDRFAAIV RMWPVL