Gene Amuc_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2018 
Symbol 
ID6274502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2450608 
End bp2452089 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content58% 
IMG OID642614078 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001878609 
Protein GI187736497 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.191644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.00584388 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACGTC CCTTACCCAT TCTCGGCGGC ATTCTGCTAT CCTTCTCCCC TCCGGCAGAA 
GCAACAGCCC AATACAGCAT TATCCCTGAG CCGTCCAGAA CGGAACTCAG ACAGGAAACA
GCTAAAACTT TACAGCTTCT TTCCGACCAG GAAGTTCCGA CCCTGGAAAC GGACGCCTAC
CGGCTCACGG TCACCCCGCA GGGGGCGCAC CTTGCTTCCG GAGGAAGGGA AGGCAGAATT
TACGGGCTGG CAACCCTCCG CCAGCTCCGG GACCAGCTGG CGGGACAGCC GGAGGGCATT
CCCTGCGGCG TCATCACGGA CAAGCCGCGC TATCCGTGGC GCGGCCTCAT GGTGGATCCC
GCGCGGCATT TCATCCCCGC GGCCGATCTG AAAAAATTTG TGGATATGAT GGCCTACTAC
AAATTCAACA GGCTGCACCT GCATCTGACG GACAACCAGG GCTGGAGGCT GCCCGTGCCC
GGCTACCCCA AATTGAAAAG CGTCGCATCC AGGCGGGAGG AAAGCTTCGG AGACGGAATC
CCCCACGAAG GGATGTACAC CAAACAGGAA CTGAAGGAAC TGGTGGCGTA CTGCGCAGCG
CGCGGCATTG ATGTCATCCC TGAAATAGAC ATGCCGGGCC ACAACCAGGC GCTTCATGCC
GCCTACCCGG AATTTTTCTG CTTCCCCAAA CCGGACATGA ACGTGCGGAC GACAGCGGGA
AACAGCAAGG AACTGGTCTG TCCCCAGAAG CCGGAAGTCT GGAAATTTTA TGCCTCCGTC
TTTAATGAAC TCAAGGATAT CTTCCCGTCC GGTATCGTTC ATCTGGGCGG GGACGAGGCC
CCCACGGAAC TCTGGGAAAA ATGCCCTCTG TGCCGGGAAG CCCGGACCAG GGCAGCCATG
AAAGACGAAC AGGAACAGAT GAAAGCCTTT TTTGCGAAAA CGGCAGCTCT GCTTGCCAAA
AACGGGCAAA CGCCGCAATT CTGGTATGAG GGGAACGCCG GCATTTACCA TCCGGGGGAA
ACGGTTTACG CATGGCGGCA AGGCCAGGCC CTCCAGTCCA TTGAGAAGAC GAAAAAGGCG
GGATTGAACC TGATTATGGC CTCCAGCGAA TACTGTTACC TGGATTTTCC CCAGATTCAG
GGGCAGCGCA ACTGGGGATG GATGAAAACC ACCACCCTGC AAAAATGTTA TGACCTGGAT
CCCGCTTTTG GAAAACCGGA GAAAGAGGCA GGCCATATCC GGGGCGTGCA TGCCCCCGTA
TGGGCGGAAC GCCTGCCGGA CTTGAACCAC TTGCTTTACC GCGCCTATCC CCGCGCCTGC
GCCATTGCGG AAGCCGGCTG GTCACCGATG GGCGTGCGCT CCTGGGAAAA CTTCCGGCGC
AAGCTGGCCG ACCACCGTCA ATTCATCCTC AAACGCTTCA ATTATGATAT GGAGCGCACT
CAGGGGAATG AACCGGCCTT CCGCTGGGAA AACAACAAGT AA
 
Protein sequence
MARPLPILGG ILLSFSPPAE ATAQYSIIPE PSRTELRQET AKTLQLLSDQ EVPTLETDAY 
RLTVTPQGAH LASGGREGRI YGLATLRQLR DQLAGQPEGI PCGVITDKPR YPWRGLMVDP
ARHFIPAADL KKFVDMMAYY KFNRLHLHLT DNQGWRLPVP GYPKLKSVAS RREESFGDGI
PHEGMYTKQE LKELVAYCAA RGIDVIPEID MPGHNQALHA AYPEFFCFPK PDMNVRTTAG
NSKELVCPQK PEVWKFYASV FNELKDIFPS GIVHLGGDEA PTELWEKCPL CREARTRAAM
KDEQEQMKAF FAKTAALLAK NGQTPQFWYE GNAGIYHPGE TVYAWRQGQA LQSIEKTKKA
GLNLIMASSE YCYLDFPQIQ GQRNWGWMKT TTLQKCYDLD PAFGKPEKEA GHIRGVHAPV
WAERLPDLNH LLYRAYPRAC AIAEAGWSPM GVRSWENFRR KLADHRQFIL KRFNYDMERT
QGNEPAFRWE NNK