Gene Amuc_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1032 
Symbol 
ID6274083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1224627 
End bp1226183 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content55% 
IMG OID642613081 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001877639 
Protein GI187735527 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAA TTTTGCTAGC CGCAGCGTTC CTGGGCTCCC TCTGCTGGGC CGGAACCAAT 
CCCTACAACA TTATTCCGGA GCCCGTCAAC GTGACGACAA CTTCCGGAAC TACCAAAAAC
CTCAAAATCG TCCATGAGCA AAAAGTCGCC GGACTGGGCA ATGAAGGGTA TGCCATGAAA
CTAACGCCCG GCGGCGTGGA ACTCCGTTAT ACCACGCCCA ACGGGAAGGC CATGGCCATG
GCGACCCTGT TCCAGCTCCA GGACCAGCTT TCGGATACTC CTGAGGGACT TCCCTGCGGC
AGCATCCAGG ATTCCCCCGA CTTCGGCTGG CGCGGCATGA TGGTTGACGT AGGCCGCTAC
CACTATCCCA TGAAGGAGAT CTACAATTTT GTGGACGCCA TGCATTATTA CAAATACAAC
GTCCTGCATC TCCATCTGAC GGAAGACCAG GGCTGGCGTC TCCCCGTTCC GGGCTACGAC
AAGCTCCGCA CCATCGGCGC CGTCCGCCCC TCCGCTCCGG AAAGCCAGAA CAACTCCCTG
CTGGCCAATG AAGGCATGTA TACCAAGAAG GAGCTCCAGG ACCTGGTAGC CTACTGCAAA
GCGCGCGGCA TCCAGGTACT GCCGGAAGTG GAAATGCCGG GTCATAACAT GGCCCTGGCC
GCGTCCTATC CCGAATTCTG CTGCAACACC AAACGGGCCC AGGTATGGAC GCACGGCGGT
GTTTCCTCCA AGCTGATTTG CCCGCAGAAA CCGGCCACTA AAAAGTTTCT TAAGGATACC
TTCAATACCG TCCAGCAGAT ATTCCCTTTC CCGTACATCC ACATCGGAGG TGACGAATGC
CCCATGGGGG ACTGGAAGAA GTGCCCGGAC TGCCAGGCCG CCCGAGCCAA AAAGGGCCAG
GGGGATAATG TGGAAGCCCA GATGAGCGAT TTCACGAAAA GCCTGACGGC CATGCTCGCC
AAGCACCGGA AAAAGCCCAT CCTGTGGTAT GACATCAACA AGAGCTATTA CCACAAGGGG
GAAACCGTCA TGTCCTGGCT GCCGGGAGAA TTCCCGCGTT GCATTGATAA GACGAAGGAA
CAGGGCATCG ACCTCATCGT CACCCCCCAG TTCAAGTATT ATCTGGCGCG TACCCAGATG
AAATTCCCGG CGGACGACGT GCGCGCCCGG CCCGGTGGAG CTCCCATCCT GCTGAAAGAC
TGCTACAACT TCGATCCCCG CAACGGACGG GACAAGAATG ACGTCAAGCA CATCAAGGGA
ATCAACCTCT GCATGTGGGC GGAATGGATT CCCTCCGGCG AATTGCTGAT GTACATGACC
TACCCCCGCG CCATGGCTGT TTCCGAAACC GCGTGGGGCA GCCACAAGAA CCGTCCAAGC
CTGGAAGAGT TTGAAAAGAA AATGGAAACC CACAAGAAAC ATTTCCAGAA GCGTTTCGGC
TATACTCTGG AACGCACTGT GGAAAACAAA CCCTACCGGG AAAAATTCAT CACCCAAGAG
GAAATCGAAC GTATTAACGA GAATTATAAA AAGGGCCAGC AAAACGCGGA CAAATAG
 
Protein sequence
MKSILLAAAF LGSLCWAGTN PYNIIPEPVN VTTTSGTTKN LKIVHEQKVA GLGNEGYAMK 
LTPGGVELRY TTPNGKAMAM ATLFQLQDQL SDTPEGLPCG SIQDSPDFGW RGMMVDVGRY
HYPMKEIYNF VDAMHYYKYN VLHLHLTEDQ GWRLPVPGYD KLRTIGAVRP SAPESQNNSL
LANEGMYTKK ELQDLVAYCK ARGIQVLPEV EMPGHNMALA ASYPEFCCNT KRAQVWTHGG
VSSKLICPQK PATKKFLKDT FNTVQQIFPF PYIHIGGDEC PMGDWKKCPD CQAARAKKGQ
GDNVEAQMSD FTKSLTAMLA KHRKKPILWY DINKSYYHKG ETVMSWLPGE FPRCIDKTKE
QGIDLIVTPQ FKYYLARTQM KFPADDVRAR PGGAPILLKD CYNFDPRNGR DKNDVKHIKG
INLCMWAEWI PSGELLMYMT YPRAMAVSET AWGSHKNRPS LEEFEKKMET HKKHFQKRFG
YTLERTVENK PYREKFITQE EIERINENYK KGQQNADK