Gene Amuc_0369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0369 
Symbol 
ID6274905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp436606 
End bp438603 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content54% 
IMG OID642612420 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001876989 
Protein GI187734877 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.868004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCA TTATCCCCGC ACTCGCCCTG TCCGCCTGCC TTCCCGCAGC CTTCTCACAA 
GAACAAATCA TCCCGAAACC GGCGGAAATC ACATTGTTCA CAGGAAGTCC GGCCCGTCTC
ACACCGGATT CCCTCATCAT CACGGAAACA CAGGACAAAG CTTTCCTGGA TCAGGCAGGG
CAATTGCAGC AGATGCTCAG CGCAGGGACG GGACTCCCCC TTCCCCTTAA ACCGGCCGGG
CAAGCGTCGA AAAAAGCCGC CTGCATTGTC ATCAAAAAAG ACCCAGCCCT GGCCGCCAGG
GGAGAAGAAG CCTACTCCAT CCAATCATCC CCAAGTGGAA TCATCCTTTC CGCAGCCGAT
GCCAGGGGCA TCTTTTACGC GGGGCAAAGC CTGGTCCAGA TGATGCCCTC CGTCTTCCAC
GACCGGACGG GGGATAAATC CGCCGTCCGG TGGAATATTT CTGAAACTCC GTTCCGCATA
ACGGACTACC CGCGATTCTC CTGGCGGGCG CTGATGATTG ATGAAGCACG CCACTTCTTT
GGCGAGAAAA CCATTAAACA GATCATCGAC CAAATGGCTC TGCTGAAAAT GAACATCCTG
CACTGGCACC TGACGGACGA CACAGGATGG CGCATTGAAA TCAAGAAGTA TCCGCGCCTG
ACCTCCATCG GCTCCAAACG CAGGGAATCA GAAATCGGCA CATGGAACAG CGGCAAGTCA
GACGGAACGC CGCATGAAGG CTTTTATACC CAGGAACAGA TCAGGGATAT CGTACAATAC
GCAGCCCGCC GAAACATCAC CATCGTTCCG GAAATTGAAA TGCCGGGCCA TGCCAGCGCC
GCCGCCGTAG CATACCCCTT CCTGAGCCTG AAAACTCCCG GGGAAGTGCC CACAACATTC
ATCGTCAATA CGGCCTTCGA TCCCACCTCG GAAAAAACTT ATGCCTTCCT GTCCGATGTT
TTGGATGAAG TCACGGCGAT CTTCCCCGGC AGAATCATCC ACATTGGCGG AGATGAAGTG
CGCTATGACA AGCAATGGAA GGGGGTTCCG GAAATTGAGG AATTCATGAA AAAAAACGGC
ATGAAAAGTT ATGCGGACGT CCAGATGCAT TTCACCAACC GCATGTCCGG CATTATTGCC
CAAAAAGGGC GCCGCATGAT GGGATGGAAT GAAATTTACG GACATGACGT CAATGGGGAC
GGAGGAGGAA AAGCCGGCGC CAAACTGGAT ACAAACGCCG TCATCCAGTT CTGGAAGGGC
AACACCAGCC TGGCTAAAAA CGCCATCCGG GACGGGCATG ACGTTATCAA TTCCCTCCAC
ACCTCCACCT ATTTGGATTA CAGCTACGGC AGCATTCCCC TGCAAAAGGC ATACGGGTTC
GAACCCGTTT TCCCCGGGTT GGAGAAACAG TACCATTCCA GAGTCAAGGG ACTGGGCGCC
CAAGTATGGA CGGAATGGAT TTCCACACCG GAACGCCTGC ACTACCAGGC ATTCCCCCGT
GCCTGCGCCT TTGCGGAAGT CGGCTGGACT CCCGCTGGTA AAAAGGATTT TCCGGATTTC
AAAAAACGCT TGAAAGCGTA TAGCGAGCGT ATGGATCTGA TGGGGATCAA GTTTGCCCGG
AACGTCATCA GCCAAATAGA CAAATCTGAC TTTTTCAATA CGCCCAGGAT CGGCACATGG
ACGCCTGCCA CCCTGACCCG GGAGGAACAT TCGTTTGACG TTACCAAACT GGTCAAAGCG
TCCGGCAAAC ACACCGTCAC CCTGCTGTAC GACAAAGGCG CCCACGCCAT CGAAATCGAA
TCCGTAGCCC TGTATGAAAA TTCCCGGGAA GTCTCCCGGG ACGCCCATGC AGGCAGAAGC
GGCGCCCATA AGGAAAATAT CCAGTACATT CTGAATGCCC CGGCCCCTAG GCAAGGAGCA
ACCTATACGG TCAAAGCAAA CTTCAAAGGG GCCGGAGGCC GGGATTCCCA CGGGACAGTG
TATTTTGAAA CGCCATAA
 
Protein sequence
MKFIIPALAL SACLPAAFSQ EQIIPKPAEI TLFTGSPARL TPDSLIITET QDKAFLDQAG 
QLQQMLSAGT GLPLPLKPAG QASKKAACIV IKKDPALAAR GEEAYSIQSS PSGIILSAAD
ARGIFYAGQS LVQMMPSVFH DRTGDKSAVR WNISETPFRI TDYPRFSWRA LMIDEARHFF
GEKTIKQIID QMALLKMNIL HWHLTDDTGW RIEIKKYPRL TSIGSKRRES EIGTWNSGKS
DGTPHEGFYT QEQIRDIVQY AARRNITIVP EIEMPGHASA AAVAYPFLSL KTPGEVPTTF
IVNTAFDPTS EKTYAFLSDV LDEVTAIFPG RIIHIGGDEV RYDKQWKGVP EIEEFMKKNG
MKSYADVQMH FTNRMSGIIA QKGRRMMGWN EIYGHDVNGD GGGKAGAKLD TNAVIQFWKG
NTSLAKNAIR DGHDVINSLH TSTYLDYSYG SIPLQKAYGF EPVFPGLEKQ YHSRVKGLGA
QVWTEWISTP ERLHYQAFPR ACAFAEVGWT PAGKKDFPDF KKRLKAYSER MDLMGIKFAR
NVISQIDKSD FFNTPRIGTW TPATLTREEH SFDVTKLVKA SGKHTVTLLY DKGAHAIEIE
SVALYENSRE VSRDAHAGRS GAHKENIQYI LNAPAPRQGA TYTVKANFKG AGGRDSHGTV
YFETP