Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3454 |
Symbol | |
ID | 6982208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3569716 |
End bp | 3571626 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643398172 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_002282947 |
Protein GI | 209551030 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATT ATAGTCTTGA GACAAGCTGG CGCCCGGTCG AAGGCAGCTT CGGGCGCTTG ACCTTCACGC TCCACAATCT TTCGGCCGAG CCGCTGTCCG GCTTCTCGCT TGCCTATACA TCGGAGACGC GGGTTGCCGA CAAGCATGTC TGCGACGGCG CCAGCCTCAA GCGGCAGGTC GCGCATTTCC ATGAGTTCCT GCCGTCCGAG CAATTGACGG TGCCGCCGGG TGGGCGCTGG CGGTTCACTG TGGAGGGGCT GAGCCGAGAG CCGAAACATG TCACCGCTGG CGTTAAATCC GCCTATCTGA CACTTGCCGA CGGTCGTCAC CTTCCCGTCG ATTTCGGCGA TCTCATGCTC GAAGGCCGGG ATGGCGGTGT GTCGCCGCCG CTCTTGCCGC CGGGGAAAGC CGATGAACCC TATGCGCTGC TCCCCTGGCC GCTGGCGCTC GGGCTGAAGG CGGGTGACTT GCCGGCCACC CTCTATCCGG CCGAGAGGAC GCGGCCTGAT GCGATCGAGG CCCTCTCCCA GGTTCTTTCT CTTTACCAGC GCCTCTACCC AGTCGAAGAC GCGCCGTTTT CGCTGAGGGC CGTCGAAGGC GGACGGCGCA TTCGTTTCGT CGCCGAATCG TCGATCGCTG CCTTCGCCTA CGAGCTGCGT TTCACCGCGC ACGAGATCGT GCTTTCGAGC GCCGATGAAG CGGGGCGGCG CTACGGTCTC ATCAGCCTGG CGCAGCTGCT GCACGGGGCC CGCGCCGATG GCGAACGATT TAAATTCCCC AATTTCGGCA CGATCGCCGA CCAGCCGCGT TATGACTGGC GCGGCTGCCA TCTCGATGTG TCCCGGCAGT TCTATCCGGT CGCCGACATC CTGCGGCTGA TCGATATTCT TGCCTGGAAC AAGCTCAATA TCTTTCATTG GCACCTGACC GACGACGAAG CCTGGCGGCT GGAGATCAAG GCCTATCCAG CGCTGACGGA GATCGGCGCG CGGCGCGGGC CGGACGAGGT CCTCGTGCCA CAGCTCGGCG ACGGAGCGGA AACACGCGCC GGCCATTACA CGCAGGATGA TATCAGGCGG ATCGTTGCGC ATGCCGCCTC GCTCGGTGTC GAGGTCGTGC CGGAAATCGA CATTCCCGGC CACAGCACCG CAACCTTGCT CTCGCTGCCC GAGCTCGCCG ACGGGCAGGA AGCGCCGGAC AGCTATCGCG CGGTCCAAGG TTATCCCAAC AATGCGCTCA ACCCTGCGGT CGAATTCACC TATGAATTTC TCGGCAAGGT GTTCGACGAG ATCGTGGCGC TGTTTCCGAG CGAGTATCTC CACATTGGCG GCGACGAGGT GGCAGAGGGC GCTTGGCTAT CCTCCCCTCT CTGCCAGGCG CTGATGAAGC GGGAGAAGCT TGCCGGCACC GCCGAGCTGC AATCCTATTT CCTGAAACGC ATCAAGGCGA TGCTCTCGGA GCGCGGCAGG AAGCTCGCCG GCTGGAACGA GGTCTCGCAT GGCGGCGGCA TCGACCCCGA CGGCACGCTG CTGATGGCCT GGGAAAAGCC CGCCGTCGGC ATCGAGCTGG CGCAGCAAGG CTACGACGTG GTGATGACGC CGGGACAGGC TTATTATCTC GACATGGCGC AGGCCGAAGC CTGGGCGGAG CCCGGTGCAG CATGGGCGGG CTATTCGCCG CCGGAACATA GCTACACTTA CGAGGCCGAG GGTGAGTTGC CGGAGGCGCT GCAGGAGAAG ATGCGCGGTA TCCAGGCCTG CATCTGGACT GAAAACTTCA TCTCACGCGC CTATTTCAAC CGGCTGGTCT TTCCGCGTCT TCCGGCCGTC GCCGAGGCCG CATGGACACC GTCGGGCCGG AAGGATTGGG ACCGGTTCGC CGCGATCGTG CGGATGTGGC CTATGCTTTA G
|
Protein sequence | MADYSLETSW RPVEGSFGRL TFTLHNLSAE PLSGFSLAYT SETRVADKHV CDGASLKRQV AHFHEFLPSE QLTVPPGGRW RFTVEGLSRE PKHVTAGVKS AYLTLADGRH LPVDFGDLML EGRDGGVSPP LLPPGKADEP YALLPWPLAL GLKAGDLPAT LYPAERTRPD AIEALSQVLS LYQRLYPVED APFSLRAVEG GRRIRFVAES SIAAFAYELR FTAHEIVLSS ADEAGRRYGL ISLAQLLHGA RADGERFKFP NFGTIADQPR YDWRGCHLDV SRQFYPVADI LRLIDILAWN KLNIFHWHLT DDEAWRLEIK AYPALTEIGA RRGPDEVLVP QLGDGAETRA GHYTQDDIRR IVAHAASLGV EVVPEIDIPG HSTATLLSLP ELADGQEAPD SYRAVQGYPN NALNPAVEFT YEFLGKVFDE IVALFPSEYL HIGGDEVAEG AWLSSPLCQA LMKREKLAGT AELQSYFLKR IKAMLSERGR KLAGWNEVSH GGGIDPDGTL LMAWEKPAVG IELAQQGYDV VMTPGQAYYL DMAQAEAWAE PGAAWAGYSP PEHSYTYEAE GELPEALQEK MRGIQACIWT ENFISRAYFN RLVFPRLPAV AEAAWTPSGR KDWDRFAAIV RMWPML
|
| |