Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6613 |
Symbol | |
ID | 8022863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 41729 |
End bp | 43750 |
Gene Length | 2022 bp |
Protein Length | 673 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644833482 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_002984616 |
Protein GI | 241666532 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.179225 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACAT CGCGACCGAA AGCCCTGCGG CTTGAAACCC TTTGGAACCC GCCTGCTGAC GGTAAGGAAT TCTCCTACGT CCTGCGTCTC AAGAACCTCG GCACCGAGCC GCTTTCGAAT TTCTCCCTCT GCGTGAGCGG CCCAGGCCGT GTCGATCCGG CCGGTCGTGT CGAAGGCGCC ACGGTTTCGC AGCGGCTCTC GAATTTCACC GAATTCCAGC CACCGGCCAA TTTCGTTCTC GGCGCCGGCG AGACGTGGAC GATATCGGTC CATGCACTGA GCTGGCAGTT CCGTCACTGG ACGGACGGCG CGACAAGCGG TTATCTCGCG CTTTCCGATG GAAGCACGAT CGTGCTCAGC ATAGAGCCGA CGCGATCTTC GGTCAGCAAT GCGCCGCTGA AGCGTGGCGC CGAGATCTAT CCGGTTCCCA TCAATGCGCC CGTTCAGGTG TCGATCATCC CCTGGCCGAA CCATGTGGCG GTCACTTCCC GCCGGCCGCT GCCGGCCGGT TTTGCGCCGC AGTCGCAGAG CGCTGCAGGG GAGGCGGCAT CAAGAAGTTT CGCAGCGCTG GTCGAGCATC TCTTTGCCGT CGAAGGCATT ATGCGGAGCG AGGCGGAAGG CGCGGTTCCG GTTGCCCTGA AGGATGCCGC CGGGCTCGGG CCGGAGGCCT ATCGGCTGAG CTTCGAGGGT GAGGCGATCA CGATCGAGGC AAGCAGCCAG ACCGGCTTTC TCTACGGCCT CGTCACGCTC GGCCAGATCT GGCGCGGTGC AAGGCTGCAT CCCGAGGTCT TCCAGTTTCC GGCTTCCGGC GAGATCGTCG ATGAGCCGTC AATGGGCTGG CGCGGCCTGC ATCTCGACGT CGCCCGCCAG TTCTACGGTG CGGCTGAGGT CAAGAAACTG CTGGCGGTGC TTGCCTGGAA CAAGCTCAAC CGTTTCCACT GGCACCTTTC CGACGACGAA GCATGGCGCG TCGAGATCGA CGCCTATCCT GATTTGACCG CGGTCGGTGC CTGGCGCGGC CACGGTCTTG CCGTTCCGCC GCTGCTCGGT TCGAGCCCGG CCCGCACCGG CGGTTATTAC ACCAAGGCTT CGATCCGCGA GATCGTCGCC CATGCCAAGA GCTTCGGCGT GGAGATCGTG CCGGAGATCG ATGTCCCCGG CCATTGCTAC GCCATGCTGC AGGCGATACC GGAGCTGCGC GATCCGGCCG AGGCCGGCAG CTATTATTCG GTTCAGGGCT TTCCCGACAA TTGCATCAAT CCGGCCCGCG AGAAGACCTA TGAGATCATC GAAACGATCC TCTTAGAACT CATCGAGCTC TTTCCGTTCA AGGTCATCCA TCTCGGCGCC GACGAAGTTC CGCTTGGCGC GTGGTCCGGC TCGCCGGAAG CGCTCGAGCG CCTGCGCACG GTGGCGGGCG ACGAGGTTGC CGATGCGCAT GCCAAGCGGC TGAACGTCGT GACCAATACC CACGGTGCCG ACGACATCCA CGGCTCGGGC GCCGCCATCC TGCAGGCGGA GTTCCTGAAC CGCGTCCAGC GCTTCCTTGC GAGCAAGGGC TGCATCACCG GCGGCTGGGA AGAGGCGGCC CACGGCGATG TCATCGACAA ATCGAAGAGT TATCTCTGCA GCTGGCGCAA TGTCGAGGTC TCTGCCGAAC TTGCCGAGCG CGGCTACGAA ATGGTCGTCT GCCCCGGACA GGTCTACTAC CTCGACATGG CGCTCAGGCC CGACTGGGAC GAGCCCGGCG CCAGTTGGGC GGGGACTTCG GACGCCGAGA AGCTCTACAA TTTTGATCCC ATCGGCGGCT GGACGGCGAG CCAGAAACAG AAACTCCTCG GCATCCAGGC CTGCATCTGG TCCGAGCCGA TGACGGATCG CGCCGTTTTC GACCGCCTCG TCTTCCCCCG CCTTTCCGCA CTTGCCGAAA CGGGCTGGAC GAAGCCGTCA TCCAAGTCGT GGGAGCGCTT CAGGGCGCTC GCAGGACTGA TGCCGCTGCT CTACGGGCTG CAACAGTCGT AG
|
Protein sequence | MSTSRPKALR LETLWNPPAD GKEFSYVLRL KNLGTEPLSN FSLCVSGPGR VDPAGRVEGA TVSQRLSNFT EFQPPANFVL GAGETWTISV HALSWQFRHW TDGATSGYLA LSDGSTIVLS IEPTRSSVSN APLKRGAEIY PVPINAPVQV SIIPWPNHVA VTSRRPLPAG FAPQSQSAAG EAASRSFAAL VEHLFAVEGI MRSEAEGAVP VALKDAAGLG PEAYRLSFEG EAITIEASSQ TGFLYGLVTL GQIWRGARLH PEVFQFPASG EIVDEPSMGW RGLHLDVARQ FYGAAEVKKL LAVLAWNKLN RFHWHLSDDE AWRVEIDAYP DLTAVGAWRG HGLAVPPLLG SSPARTGGYY TKASIREIVA HAKSFGVEIV PEIDVPGHCY AMLQAIPELR DPAEAGSYYS VQGFPDNCIN PAREKTYEII ETILLELIEL FPFKVIHLGA DEVPLGAWSG SPEALERLRT VAGDEVADAH AKRLNVVTNT HGADDIHGSG AAILQAEFLN RVQRFLASKG CITGGWEEAA HGDVIDKSKS YLCSWRNVEV SAELAERGYE MVVCPGQVYY LDMALRPDWD EPGASWAGTS DAEKLYNFDP IGGWTASQKQ KLLGIQACIW SEPMTDRAVF DRLVFPRLSA LAETGWTKPS SKSWERFRAL AGLMPLLYGL QQS
|
| |