Gene Amuc_1669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1669 
Symbol 
ID6274577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2021876 
End bp2023519 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content54% 
IMG OID642613727 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001878268 
Protein GI187736156 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.254995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.452028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTCA GACATGTTTC CCATGTGTTT CTTTTTCTGT TTTCCGGCTT ATGCGGCACG 
GCCGTGAATG CTCAGGATCA GATAATACCC CGGCCCGTGT CCGTGAAGGT AGAGGCCAAG
GGGGCGGCAA CGCCCGTGAA GCTGGGTGAA GGAATGCGCA TTATATGCAA GGAGAAGGAC
GGGGAGTTCC GGCGCCAGGC TCACTTGTTG CAGCAGTTTT TGTCACGCGG AACCGGATTG
GCTCTGGACG GGGCTGGCGG TACGGGAATA ATCCGCGTTG TGAAGGACGC TGCCCTGAAG
CAGTACGGGC CGGAGGCATA CCGTTTGGAA GCGGCTCCCG GAAATATCGT CATCAGCGCC
GCGACTCCCA AAGGAGTTTA TTATGCGGGG CAAAGTCTGG CGCAGATGCT GCCTGCCGCC
TTTTTCAACA ATGATGCGGA CAAAAAGAGC GTAAGTTGGA ATGTAGCGGA GAAGCCTTTT
TCCATATTGG ATTATCCCCG TTTTGCCTGG CGAGCCTTCA TGCTGGATGA AGCACGGCAT
TTTTTTGGAG AGGAGGAGGT GAAGAGGCTG ATTGACCAAA TGGGGCTCCT GAAAATGAAT
GTCCTGCACT GGCATTTGTC GGATGATGCC GGATGGCGCA TCCAGATTAA AAAATATCCC
AAACTGACAT CCGTGGGAGG CAAGAGGAGA GATACGGAGA TAGAGACGTG GGGAAGCGGC
AAGTATGAAG GCAGGCCCCA TGAAGGCTTT TATACGCAGG AACAGGTCAG GCGCATTGTC
CGGTATGCGG CCGACAGGAA TATAACGATT GTTCCTGAAA TAGATATTCC GGGCCATTCC
GCCGCCGCCA TCGTTTCCTA TCCGGAGTTG AAATTGTCCG CCAGGCCTTT CGCGGAAGTG
CCCGTGAGTT TCAATGACGG AGCGGCGTTT GATCCCACCA GCGAACGTAC CTACCAGTTC
CTGGGAGACA TCATGACAGA GCTGGCTTCC CTCTTTCCCG GAGGTATCAT TCACATCGGC
GGTGACGAGG TGCGGTACAA AAAGTACTGG GAAGGCGTGC CCCACATTGA GGCGTTCATG
AAGAAAAAAG GCATTAAGAC TTTTCCCGAC CTTCAAATCA TGTTTACGAA TCGGATTTCC
GGCATGCTGG CTGGGATGGG GCGCCGGATG ATGGGGTGGA ATGAAATACT GGGGTCCGAC
GTGCACAATG ACGGCGGCCG GGGAGCTGCG CTCGGCAAGC TGGATGCAAA TGCTATCATT
CATTTCTGGT ATGGGTCCGA TAAAATTGCC GCCAAGGCGA TCAGGGAAGG CCGCCAGGTG
GTGAATTCCA CTTCCCACAT GACTTACATT AACAAAGGAT ATGACAAGCT CCCTCTGTCC
AGGTCCTATT CTTTTGAGCC CGTTTTTGCG GGATTGAACC CGGAGCAGCA GAAGAATGTC
ATCGGGCTGG GCTGCCAGGT CTGGACGGAA TGGATTGCGG ACGTGGAGAA ACTGCACCGT
CATGTATTTC CCCGTATTGC CGCTTATGCG GAGACGGGGT GGACCCGGAA GGAGGATAAG
AATTTTCAGG ATTTCCAGAG GCGTTTGACG GGGTATGAAA AAATTTTGGA TGCCCAGGGC
ATCCGGCACG GTGAAAAGGA GTAG
 
Protein sequence
MMFRHVSHVF LFLFSGLCGT AVNAQDQIIP RPVSVKVEAK GAATPVKLGE GMRIICKEKD 
GEFRRQAHLL QQFLSRGTGL ALDGAGGTGI IRVVKDAALK QYGPEAYRLE AAPGNIVISA
ATPKGVYYAG QSLAQMLPAA FFNNDADKKS VSWNVAEKPF SILDYPRFAW RAFMLDEARH
FFGEEEVKRL IDQMGLLKMN VLHWHLSDDA GWRIQIKKYP KLTSVGGKRR DTEIETWGSG
KYEGRPHEGF YTQEQVRRIV RYAADRNITI VPEIDIPGHS AAAIVSYPEL KLSARPFAEV
PVSFNDGAAF DPTSERTYQF LGDIMTELAS LFPGGIIHIG GDEVRYKKYW EGVPHIEAFM
KKKGIKTFPD LQIMFTNRIS GMLAGMGRRM MGWNEILGSD VHNDGGRGAA LGKLDANAII
HFWYGSDKIA AKAIREGRQV VNSTSHMTYI NKGYDKLPLS RSYSFEPVFA GLNPEQQKNV
IGLGCQVWTE WIADVEKLHR HVFPRIAAYA ETGWTRKEDK NFQDFQRRLT GYEKILDAQG
IRHGEKE