Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1924 |
Symbol | |
ID | 6275292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2334650 |
End bp | 2336644 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642613984 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001878518 |
Protein GI | 187736406 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.234624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.00104389 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCGATTAT CCCTTTCCGC GCCTTTGCTG GCTTCCGGCC TGATGCTTTG TTTTACCTCC TCCCATGGAG ACTGGACATC CCCGCACCCT CTCCAGCAGG AAAAAGCCGC TCCGGTCCCA CTGATTCCCT TCCCGTCCCA GGTGGACTGG AAAACGGGAA CATGCCCTAA AAAGGCTCCC GTTTCCGTGA AAAAAAACGC GTCCCTGTCT AAGACGCTGG GCAAGGAAGG GTATGAACTG CGGATCCGCC CCAACGGCAT TCTGATTAAG GCCGCTGACG ATGCGGGCGT TTTTTATGCC CGCAGAACGC TGGACCAGCT GGGAGCCAGG GGAGATTACC CGTGCTGCGA CATCAAGGAC AGCCCCGCGT TCGCCATCCG GTGCTTCATG CATGACGCAG GCCGCCATTT CCGGACTGTT GAAACACTCA AGGCGGATAT TGATGAAATG GCCCGGCTGA AAATAAACGC TTTTCACTGG CATCTGACGG ATTACCCCGC ATGGCGCATC CAGTGCAAAA AGTACCCTGT TTTGAATGAT CCCTCCAAGA GGATTCAGGG ACGGGACGTT AACGGCACCT ACTCCTACGA CCAGATACGG GACTTATTCC GCTACGCCAG GGAGCGCCAT GTCCAGATTA TTCCGGAAAT CGATATGCCC GGGCACAGCA CCTATTTTAA AAACTGCTTT GGCTTCCCGA TGCATGATCC CAGAGGCATC AACATTCTGG AAGAACTGCT GGAAGAATTT TGCAGGGAAA TTCCGGCGGA AATGTCTCCC TACCTTCATA TCGGAGCGGA TGAAATCCGC ATTCCCAACG GAAAGCAGTT TGCGGACCGG ATGGCGGCCA AGGTCAAATC CCTGGGACGG CAGCCCATCC AATGGGCGGG CAACAATGAC CTGCCTGTGT CCGGAGACAG TTATGCCCAG CTGTGGAATG ATGAAAATTC CGTAGGCCTG CCGGATCCAG CCAGGCAGAA GAATCCCTAC TTCGACTCCA CGGCAGGATA CGTCAATTCC TTTGACCCCG GCATTCTGGT GCGCAGGAAT TTCTTCCGCC AGCCCTGCGG AACAGCCAGG GGAAACAACC ATTCCCTGGG CGTCATCCAG TGTCTATGGC CGGATACCCG GGTGGAGAAC AAAAAAAACA TTCCTGTCCA GAGCCCCCAG TGGCCTGCCA TGTTCGCCAT GGCGGAACGC AGTTGGAAAG GAATTCCAGA GGATGGTTCC CGTTTTGCCG GCAGTCTGCC GGAAAAGAAC ACGGAGGCTT ATCAGGCCTT CTCTCTGTTT GAAAAACGCA TGGAAGCCCT GGCCGGAAGC AGGCCTTTCC CTTACTGGAG GGATTCTTTC GTGGAATGGA CGGTATTCGG CCCCGTTCCG CAGGACAGGC AGGAAGAAGT AAGAAATAAC CTGCTGGCGG GCAAATCTCC CGCAGGACTG TCTTCCGTCC AGACGCGAGG AGGCAATTTG TATTTCCGTA CCCGTGCGGG TGCGGAGGGT CTATTTTCCA AGACAAAACC GGGGAATACG GTCTGGGCGG AGACGACGTT TCACGCGCCC GTGGAGGGCA CCATGCACGC TATGGTGGGG TTTGACGCTC CGGCCCGTTC CACACGGCGC TGTTCCGGAG TACCGGCTGC CGGGGAGTGG TCCCAGTGCG GCACCAGAAT ATGGGTAAAC GGCAAGGAAA TGAAAAATCC CCAGACTTAC AAACTGGCAG GCCAACGGCG TTACGAAAAG CATACATGGA ATTCGCCCGC TAATGAGATA CCCTTTGACA ACGAGGAATT CTGGTGGGCC CGGCCTCCTG TTCCCTTTCA AGTGAAGGCC GGGGAAAACA GGATCCTGAT AGAACAACCG TACACAGGAG AATTCCAGTC GTGGGGAGTC AGCTTCATTC CGGTGAAAAA AGCGGGAGAC CGCTGGATTG CCGACCCAAG CTATTATGCC AAACCCAGGA GAGAGAAACA GGATGACGTC TCTCCGGTGC CATAA
|
Protein sequence | MRLSLSAPLL ASGLMLCFTS SHGDWTSPHP LQQEKAAPVP LIPFPSQVDW KTGTCPKKAP VSVKKNASLS KTLGKEGYEL RIRPNGILIK AADDAGVFYA RRTLDQLGAR GDYPCCDIKD SPAFAIRCFM HDAGRHFRTV ETLKADIDEM ARLKINAFHW HLTDYPAWRI QCKKYPVLND PSKRIQGRDV NGTYSYDQIR DLFRYARERH VQIIPEIDMP GHSTYFKNCF GFPMHDPRGI NILEELLEEF CREIPAEMSP YLHIGADEIR IPNGKQFADR MAAKVKSLGR QPIQWAGNND LPVSGDSYAQ LWNDENSVGL PDPARQKNPY FDSTAGYVNS FDPGILVRRN FFRQPCGTAR GNNHSLGVIQ CLWPDTRVEN KKNIPVQSPQ WPAMFAMAER SWKGIPEDGS RFAGSLPEKN TEAYQAFSLF EKRMEALAGS RPFPYWRDSF VEWTVFGPVP QDRQEEVRNN LLAGKSPAGL SSVQTRGGNL YFRTRAGAEG LFSKTKPGNT VWAETTFHAP VEGTMHAMVG FDAPARSTRR CSGVPAAGEW SQCGTRIWVN GKEMKNPQTY KLAGQRRYEK HTWNSPANEI PFDNEEFWWA RPPVPFQVKA GENRILIEQP YTGEFQSWGV SFIPVKKAGD RWIADPSYYA KPRREKQDDV SPVP
|
| |