Gene Amuc_1924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1924 
Symbol 
ID6275292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2334650 
End bp2336644 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content56% 
IMG OID642613984 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001878518 
Protein GI187736406 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.234624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.00104389 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGATTAT CCCTTTCCGC GCCTTTGCTG GCTTCCGGCC TGATGCTTTG TTTTACCTCC 
TCCCATGGAG ACTGGACATC CCCGCACCCT CTCCAGCAGG AAAAAGCCGC TCCGGTCCCA
CTGATTCCCT TCCCGTCCCA GGTGGACTGG AAAACGGGAA CATGCCCTAA AAAGGCTCCC
GTTTCCGTGA AAAAAAACGC GTCCCTGTCT AAGACGCTGG GCAAGGAAGG GTATGAACTG
CGGATCCGCC CCAACGGCAT TCTGATTAAG GCCGCTGACG ATGCGGGCGT TTTTTATGCC
CGCAGAACGC TGGACCAGCT GGGAGCCAGG GGAGATTACC CGTGCTGCGA CATCAAGGAC
AGCCCCGCGT TCGCCATCCG GTGCTTCATG CATGACGCAG GCCGCCATTT CCGGACTGTT
GAAACACTCA AGGCGGATAT TGATGAAATG GCCCGGCTGA AAATAAACGC TTTTCACTGG
CATCTGACGG ATTACCCCGC ATGGCGCATC CAGTGCAAAA AGTACCCTGT TTTGAATGAT
CCCTCCAAGA GGATTCAGGG ACGGGACGTT AACGGCACCT ACTCCTACGA CCAGATACGG
GACTTATTCC GCTACGCCAG GGAGCGCCAT GTCCAGATTA TTCCGGAAAT CGATATGCCC
GGGCACAGCA CCTATTTTAA AAACTGCTTT GGCTTCCCGA TGCATGATCC CAGAGGCATC
AACATTCTGG AAGAACTGCT GGAAGAATTT TGCAGGGAAA TTCCGGCGGA AATGTCTCCC
TACCTTCATA TCGGAGCGGA TGAAATCCGC ATTCCCAACG GAAAGCAGTT TGCGGACCGG
ATGGCGGCCA AGGTCAAATC CCTGGGACGG CAGCCCATCC AATGGGCGGG CAACAATGAC
CTGCCTGTGT CCGGAGACAG TTATGCCCAG CTGTGGAATG ATGAAAATTC CGTAGGCCTG
CCGGATCCAG CCAGGCAGAA GAATCCCTAC TTCGACTCCA CGGCAGGATA CGTCAATTCC
TTTGACCCCG GCATTCTGGT GCGCAGGAAT TTCTTCCGCC AGCCCTGCGG AACAGCCAGG
GGAAACAACC ATTCCCTGGG CGTCATCCAG TGTCTATGGC CGGATACCCG GGTGGAGAAC
AAAAAAAACA TTCCTGTCCA GAGCCCCCAG TGGCCTGCCA TGTTCGCCAT GGCGGAACGC
AGTTGGAAAG GAATTCCAGA GGATGGTTCC CGTTTTGCCG GCAGTCTGCC GGAAAAGAAC
ACGGAGGCTT ATCAGGCCTT CTCTCTGTTT GAAAAACGCA TGGAAGCCCT GGCCGGAAGC
AGGCCTTTCC CTTACTGGAG GGATTCTTTC GTGGAATGGA CGGTATTCGG CCCCGTTCCG
CAGGACAGGC AGGAAGAAGT AAGAAATAAC CTGCTGGCGG GCAAATCTCC CGCAGGACTG
TCTTCCGTCC AGACGCGAGG AGGCAATTTG TATTTCCGTA CCCGTGCGGG TGCGGAGGGT
CTATTTTCCA AGACAAAACC GGGGAATACG GTCTGGGCGG AGACGACGTT TCACGCGCCC
GTGGAGGGCA CCATGCACGC TATGGTGGGG TTTGACGCTC CGGCCCGTTC CACACGGCGC
TGTTCCGGAG TACCGGCTGC CGGGGAGTGG TCCCAGTGCG GCACCAGAAT ATGGGTAAAC
GGCAAGGAAA TGAAAAATCC CCAGACTTAC AAACTGGCAG GCCAACGGCG TTACGAAAAG
CATACATGGA ATTCGCCCGC TAATGAGATA CCCTTTGACA ACGAGGAATT CTGGTGGGCC
CGGCCTCCTG TTCCCTTTCA AGTGAAGGCC GGGGAAAACA GGATCCTGAT AGAACAACCG
TACACAGGAG AATTCCAGTC GTGGGGAGTC AGCTTCATTC CGGTGAAAAA AGCGGGAGAC
CGCTGGATTG CCGACCCAAG CTATTATGCC AAACCCAGGA GAGAGAAACA GGATGACGTC
TCTCCGGTGC CATAA
 
Protein sequence
MRLSLSAPLL ASGLMLCFTS SHGDWTSPHP LQQEKAAPVP LIPFPSQVDW KTGTCPKKAP 
VSVKKNASLS KTLGKEGYEL RIRPNGILIK AADDAGVFYA RRTLDQLGAR GDYPCCDIKD
SPAFAIRCFM HDAGRHFRTV ETLKADIDEM ARLKINAFHW HLTDYPAWRI QCKKYPVLND
PSKRIQGRDV NGTYSYDQIR DLFRYARERH VQIIPEIDMP GHSTYFKNCF GFPMHDPRGI
NILEELLEEF CREIPAEMSP YLHIGADEIR IPNGKQFADR MAAKVKSLGR QPIQWAGNND
LPVSGDSYAQ LWNDENSVGL PDPARQKNPY FDSTAGYVNS FDPGILVRRN FFRQPCGTAR
GNNHSLGVIQ CLWPDTRVEN KKNIPVQSPQ WPAMFAMAER SWKGIPEDGS RFAGSLPEKN
TEAYQAFSLF EKRMEALAGS RPFPYWRDSF VEWTVFGPVP QDRQEEVRNN LLAGKSPAGL
SSVQTRGGNL YFRTRAGAEG LFSKTKPGNT VWAETTFHAP VEGTMHAMVG FDAPARSTRR
CSGVPAAGEW SQCGTRIWVN GKEMKNPQTY KLAGQRRYEK HTWNSPANEI PFDNEEFWWA
RPPVPFQVKA GENRILIEQP YTGEFQSWGV SFIPVKKAGD RWIADPSYYA KPRREKQDDV
SPVP