Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3756 |
Symbol | |
ID | 8014588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3807616 |
End bp | 3809526 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644826319 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_002977538 |
Protein GI | 241206442 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.782699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATT ACCATCTGGA AGCAAGCTGG AGCCCGATCG AGGGCAGTTT CGGGCGCCTC ACCTTCATGC TTTTCAATCT TTCGACCGAG CCGCTGTCCG GCTTCTCGCT CGCCTATACG TCAGAGACGC GGGTTGCCGA CAAACATGTC TGCGACGGCG GCAGCCTCAA GCGGCGGGTC GCGCATTTCC AAGAATTCCT GCCGCCCGAA GACCTGAGCG TGCCGCCCGG TGGGCGCTGG CGCTTCACTG TCGAGGGACT GACCAGGGAG CCGAAACATG TCACGGCCGG CGTCAAGTCG GCCTATCTGA CACTTGGCGA CGGACGCCAC TTTCCTGTTG GTTTCGGCGA TCTCATGCTC GAAGGCCGGG ATGGTGGCGT GGCGCCGCCG CTTCTGCCGC CGGGCCGGGC CGAGGAACCT TATTCGCTAC TGCCCTGGCC GCTGGCGCTC GGGTTGAAGG CGGGAGAGCT GCCGGTCGTG CTTTATCCGG CCGAGCGGAC GCGCCCTGAT GCGGTCAAGG CGCTCTCGCT GATTCTGGAG CTCTACCAGC GGCTCTACCC GGCCGACAAT ATGCCGTTTT CCCTCGGTGC CGTCGAAGGC GGGCGGGGCA TTCGTTTCGT CACCGAATCG TCGATCGCCG CCTTCGCTTA CGAATTGCGT TTTACCGCGC ATGAGATCGT GCTTTCGAGT GCGGATGCCG CCGGGCGGCA TTACGGGCTG ATCAGCCTGG CGCAACTGCT GCACGGCGCC CGCGCCGATC CCGAGCGCTT CAAATTCCCC AATTTCGGCG CGATCGCCGA CCAGCCGCGT TATGACTGGC GCGGCTGCCA TCTCGATGTG TCCAGGCAGT TCTATCCGGT GGCAGACGTC GTGCGGCTGA TCGATATTCT CGCCTGGAAC AAGCTCAACA TCTTCCACTG GCATCTGACC GATGACGAAG CGTGGCGGCT GGAGATCAAG GCCTATCCCG CGCTGACGGA GATCGGCGCC CGGCGCGGGC CGGATGAAGT GCTCGTGCCG CAGCTCGGCG ACGGGGCGCA AACGCGCTCC GGTCATTACA CGCAGGAGGA TGCCAAGCGG ATCGTTGCGC ATGCAGCCTC GCTGCATATC GAGGTACTGC CGGAAATCGA TATTCCGGGC CACAGCATGG CGACGCTGTT CTCGCTGCCC GAGCTCGTCG ACGGCCAGGA GGCGCCGGAT AGTTACCGCT CGGTGCAGGG TTATCCGAAC AACGCCCTCA ATCCGGCGGT GGAATTCACC TATGAATTTC TCGGTAAGGT GTTCGACGAG ATGGTGACGC TGTTTCCCGG CGAATATCTC CATATCGGCG GCGACGAAGT GGCGCACGGC TCCTGGCTTT CCTCGCCGCT CTGCAAGACG CTGATGGAGA GGGAGAAACT TGCCGGCACT GCCGAGCTGC AATCCTATTT CCTGAAACGT ATCAAAGCCA TGCTGTCGGA TCGCGGCAAG AAACTCGTCG GCTGGAACGA GGTTTCGCAT GGCGGCGGCG TCGACCGCGA CGGCACGCTG CTGATGGCCT GGGAAAAGCC CGCCGTCGGC ATCGAGCTGG CACAGGAGGG CTACGACGTG GTGATGACGC CGGGCCAGGC CTATTATCTC GACATGGCGC AAGCGGAAGC CTGGGGCGAG CCCGGCGCGA GCTGGGCGGG CTTCAGCCTG CCGGAACACA CCTACGCTTA CGAGGCCGAG GGCGAGCTGC CGGCGGCGCT GCAGGAGAAG ATGCGCGGCA TCCAGGCCTG CATCTGGACT GAAAATTTCC TCTCGCGTGC CTATTTCAAC CGGCTGGTTT TCCCGCGTCT CCCAGCGGTC GCCGAGGCTG CTTGGACGCC TTCTGCGCGC AAGGACTGGG ATCGGTTCGC AGCGATCGTG CGGATGTGGC CGGTGCTTTA A
|
Protein sequence | MADYHLEASW SPIEGSFGRL TFMLFNLSTE PLSGFSLAYT SETRVADKHV CDGGSLKRRV AHFQEFLPPE DLSVPPGGRW RFTVEGLTRE PKHVTAGVKS AYLTLGDGRH FPVGFGDLML EGRDGGVAPP LLPPGRAEEP YSLLPWPLAL GLKAGELPVV LYPAERTRPD AVKALSLILE LYQRLYPADN MPFSLGAVEG GRGIRFVTES SIAAFAYELR FTAHEIVLSS ADAAGRHYGL ISLAQLLHGA RADPERFKFP NFGAIADQPR YDWRGCHLDV SRQFYPVADV VRLIDILAWN KLNIFHWHLT DDEAWRLEIK AYPALTEIGA RRGPDEVLVP QLGDGAQTRS GHYTQEDAKR IVAHAASLHI EVLPEIDIPG HSMATLFSLP ELVDGQEAPD SYRSVQGYPN NALNPAVEFT YEFLGKVFDE MVTLFPGEYL HIGGDEVAHG SWLSSPLCKT LMEREKLAGT AELQSYFLKR IKAMLSDRGK KLVGWNEVSH GGGVDRDGTL LMAWEKPAVG IELAQEGYDV VMTPGQAYYL DMAQAEAWGE PGASWAGFSL PEHTYAYEAE GELPAALQEK MRGIQACIWT ENFLSRAYFN RLVFPRLPAV AEAAWTPSAR KDWDRFAAIV RMWPVL
|
| |