Gene Amuc_0953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0953 
Symbol 
ID6274210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1134988 
End bp1139232 
Gene Length4245 bp 
Protein Length1414 aa 
Translation table11 
GC content60% 
IMG OID642613007 
Productsulfatase 
Protein accessionYP_001877566 
Protein GI187735454 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.462358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCCGC ACCGTTTTTT TTCCGTTCTC CTCTTTTTTC TTCTGTTCCT TGCTCCAGCC 
AGGTCTGCTG ATGTTGAATA TGTCTGGAGG GGTGTCGATT ACCAGTGGGG TTCCCTGTCC
AACTGGAGTG TCGGAGGCAT TGCCGCCTCT TCCGCGCCAG GGGCCGCCGC TTCCGCCTAT
GAACATTGGA TGGTAACCAA TGGAACGGAT TCCGCGGGAA GGACGGATGT CGGCGGATTG
GGCGCGGGGG GCCGCTATTT GAAGGGCGTC AGGATTGAGG GGCTTAACAG CCAGCCGGAA
GGCAAAATCC CATTGTTCAT TAAAAACACG AACAAGGATG TATATTTGCG CGTGGAGGAG
GGCGGCATTA CCGTGGAAAA TGCCGGAGAA GGCGGTTATT CTGCGGATTT CGGGGTCGCC
CAGCTGCGCG TAGCCGCGGA CCAGGAGTGG CATGTGGCTG AAGGGCGCAG CCTGTATGTG
GGGCATGACG ATGACGCTCC GTCCGGGGGA TTGTGTTCCC TGACTTCGGA AGGCGATGTT
CCGCGGCGCG TGACAGTGAC GGGCGGAGGC GCCGTGCGCA TCGGGGAAGG GATGCTGCTG
AACAATATTT CCGGCCTCAT CGGTTTTGTG CTGAACGCCG GAAAGGGAAT TCCGACGCTG
GATCTGGCGG ACCGGGGCAT GGGCAATACG GTGACGGTGG AAGACGCTGC CCGCCTGGAA
GGCATGTCTT TGTACCAGGG GGCCCTGGTG ACGCGGGAAA ACGCGTCCGT GACTTTTTCC
GGAACGGAGG CCAAGGCGTC CGGACAATGG AACATTGGCG CGGACACGGA GTTGGCGCTG
GAAAATTCCA CGCTGGACCT GACGGAGGCG GGTGTGGACG GCAACGTGAT TCTTTCCGGC
TCTTCCGGCA TTACCGGGGA TAAGGGAACG CTGCGGCAGA CGCTTCTGGA TGATGCCCGG
GTCACTTATA CGGACAGGAA TGTCAAGGCG GGGGAAATCC GGAGCGTAGG CAGCAATGTG
ACGATAACGC TGAATAATTC CTCCATGCAT TTTGACGGAG AAATTCCGGA GGTGGACCTG
GTGGTGCAGG GCAGCTGCGC ACTGGGCGGC AGCGGCGCCT TTTCAGGAAC GATTACTTAT
GCGCCGGGAG GCACCCTGAC TGTGGATGGG GATATTTCCC TGCCCAGTTC ATCCCTGACG
CTGGGGCGTC TCCATGTGGC TACGCTGGAC GGCGTGCCGC AAAACTCCGC CGTCCGGCCG
GAAACCCCTG CCCAGCAGGC AAGGGAATAT GTGGTTTTGT TTGATTCCTC CTTTGAGGAT
TTGATTTCCG AATGGCCGGG CAGGCATACG CTGGGCGTGC AGTTCCGGCC GGATATGGGC
GCCCCCGTCA CCGTAAAGGA ACTGCCTGAA GGGGCGGTTT CATGGAATTA TGATGCCGCA
TCCGGCTGGC TGACGCTGCA GACGGATGTA GCGGAAAAGC CGGACATGCC GGCGCATGTT
TCCGGAGCCA GGCCCAACAT TATTTTTGTC CTGGTGGATG ACATGGGCTG GGGGGATATG
AGCGTCAACT GGACGCAGCA GGATAAAAAC GGCAGGCAGG TGACCCGCAG GAATGAATTC
AAGACGCCTA CCCTGGACAC GATGGCGGCG GAGGGCATGC AACTGAGGCG CCATTACAGT
GCGGCTCCGG TATGCGCTCC GGCCCGCGCT TCCCTGCTGC TGGGCGTTCA CCAGGGGCAT
TCCCGGGTGG TGCGCAACAA TACGTTTGAT TACCCTATCG AGAATTCCCA TACGCTGGGA
ACGCTGCTGA AAAGCGCGGG TTACCATACG GCGGCCATCG GGAAATGGGG CGTGGGCGGA
GGCGGACAGA GCGGCAAGGC TCTGACCGCG GCTCCCCACA TGCGCGGGTT TGATTATTTT
TACGGAATCA TCAAGCATTT GGCGGGGCAT TACCATTATT TGACGGCGGG TTCCCAGGAT
ATTTACGAAT ATAATGACCA GGCCGCGGCT CCGGCGTGGG AGAGCGTCAG CGCGAAAGTT
CCGGGCACCG CCTATGATAC GGATTTGTTC GGGGCCCGCG CCAAACAGTG GATTGTGGAC
CACCATGAGA AAACTCCTTC CCGGCCTTTC TTCCTGTATC TGGCTTTCCC GGCTCCCCAC
GGCTGTCTGT CCGCTCCCGC CTGTGCCTAT CCGGCGGGGC TGGGAAGGGA CGGCGGCCTC
CAGTGGGAGA AAAAAGACGG CTATGAAGCC TGCAATACCG CTGCGGACGG ATGGTCGGGG
GTGGAGGGGG ATTTTTCTCC GGATACCTAT ATCTACCCGG AACATGAGGT TTTTGGAAGA
ACGGAATCCC GCCACGCCAC GATGATTCGC CGGGTGGATG ACGTCCTGAA GGACATGATC
CAGCTGCTTA AGGATTTAGG CATTGATGAC AATACGATGA TTGTCTTCAC GTCCGACAAC
GGCCCCCACA ATGAACCCGG TTCCGGCAAT TACGGCAACG GCGCGCAGGA TCCCGCGTAT
TTCATGAGCT ACGGGATGAT GGACGGCATT AAGCGCGATT GCTGGGAAGG CGGCATGCGT
GTTCCGGCCG TGGTACGCTG GCCCGGTGTG GTTCCCAGCG GAATCAGTTT GAATGCCAGC
CAGTTCCAGG ACTGGATGGC TACTTTTGCG GACGTGGCCG GAGTGCCCGT GCCGTCCCGT
TGTGACGGCG TTTCCCTGCT GCCTACGCTG GCAGGGGTGC CGGAACGCCA GAAAACGGGC
GTTATTTATG CGGAATATGC CTATAGCGGG AGCACTCCGA ATTACAGCGA TTTCCTCCAG
CAACACAGGG GGAGAGGGCG GCAGCAGCAG CAGATTATAT TTGTGGACGG CTTCAAGGGG
ATTCGGATGG GGAATGCCGC CCCCGCTACG GATTTTGAAA TCTACGATAC GGAAAAAGAC
CCGCAGGAAG CGGCGAATCT GGCGGCATCC CGTCCGGACC TGCAGGAGAA GATGAAGGCG
CAGGCGCTGC GCTCCCGCCG TTCCTCCCCC ATTGCCTCCA CCAGTTTTGA CACCAGTTAT
ATCGCGCCCG TTTCCGCCCC CGCTGGCCTG CGGGAGGGAA GGCTGCGCTG GCGCGCGTGG
AACCGCTCTT TTGACTGGGT GCCGGATTTC CGGCAGCTGG AGGAGGGGCC CTGTTCTACG
GGTGCGACGG ATGTTTCCGA CGTGCTCAGC GTCAAGGCGG GGCTTTCCGG TCAGAAGGGG
GTGGAATTGA CGGGCTATCT CAGGGTGCCT GTCACCGGGG AATATCAATT TTACCTTCAG
ACGGATTCCA ATGAAGGCTC CAGGGCTTTT GTCCATTTGC ACGATATGCA GCTCATTGAT
GCGGATTATG CCTATACTCC CGGAACGCAG GCTGCCTCCA ACGCGCGGGA AGGCGTTTCC
GGCGATGTGC AGCCCAATGC CGTTCAGACG GTGAAGCTTA CCGCGGGGCT GCATCCCATC
CGCATCGGCT ATGTGGGGAA GGCGGCTTCC TCCTCCCTTT CCCTGCAATG GGAGGGGCCG
GAAACCAGCG GCAGGGAACC CATTCCCGCC AGCGCGTTTT CCTATGAATA CGTGAATCCC
TTCAATCTGG AAAAGACGGA GGAAACGGTA GGATGCGCCG CCTCCGGTAC GGGGCTGACC
GTGCAGACGC ATCTTCCCTG GACAGTTTCC TGCGACCAGC CATGGGTTAC GGTTTCCCCC
GTTTCCGGTT CCGGAACCTC AGTGCTGGAT ATTGCCGTGG AAGCAAACGG GCTGCAAACG
GAGCGGGAAG CCGTTGTCAC CGTCGTATGC GGCGGGGAGC AGAGGACATT CACGCTGCGC
CAGGAAGCTG CTCCTGCCCC GGCAGGGTAC GACAAGTGGA AGCAGGACCA TTTTGCGGAT
GGGACGCCGG ATGAGCAAAT GGCCCCGGAT GCCTGTCCCG CCGGAGACGG AATTTCCAAT
CTGATGAAAT ATGCTACGGG CATGGATCCC AACCAGCCCT GCGGGAGCGT GACGAAGCTG
GCTGTCCGCG AGGAAGCGGG CGGGAAATAC CTCGTGCTTT CCTGGCCTGT GAATCCGGAG
GCGACGGATG TGACGTTCCA TGTGGAAAGT TCTTCCGACC TGGAGGAATG GGCTGACGAA
GGGGCCGTCA CTCCGGACGG GGCCTGCGGG GAATTCCGCG ATACGGTTGC GCTGGGAAAA
GGCTCTCCGG AGCGCCGGTT CCTGAGATTA AAGGTGACGA GATAG
 
Protein sequence
MFPHRFFSVL LFFLLFLAPA RSADVEYVWR GVDYQWGSLS NWSVGGIAAS SAPGAAASAY 
EHWMVTNGTD SAGRTDVGGL GAGGRYLKGV RIEGLNSQPE GKIPLFIKNT NKDVYLRVEE
GGITVENAGE GGYSADFGVA QLRVAADQEW HVAEGRSLYV GHDDDAPSGG LCSLTSEGDV
PRRVTVTGGG AVRIGEGMLL NNISGLIGFV LNAGKGIPTL DLADRGMGNT VTVEDAARLE
GMSLYQGALV TRENASVTFS GTEAKASGQW NIGADTELAL ENSTLDLTEA GVDGNVILSG
SSGITGDKGT LRQTLLDDAR VTYTDRNVKA GEIRSVGSNV TITLNNSSMH FDGEIPEVDL
VVQGSCALGG SGAFSGTITY APGGTLTVDG DISLPSSSLT LGRLHVATLD GVPQNSAVRP
ETPAQQAREY VVLFDSSFED LISEWPGRHT LGVQFRPDMG APVTVKELPE GAVSWNYDAA
SGWLTLQTDV AEKPDMPAHV SGARPNIIFV LVDDMGWGDM SVNWTQQDKN GRQVTRRNEF
KTPTLDTMAA EGMQLRRHYS AAPVCAPARA SLLLGVHQGH SRVVRNNTFD YPIENSHTLG
TLLKSAGYHT AAIGKWGVGG GGQSGKALTA APHMRGFDYF YGIIKHLAGH YHYLTAGSQD
IYEYNDQAAA PAWESVSAKV PGTAYDTDLF GARAKQWIVD HHEKTPSRPF FLYLAFPAPH
GCLSAPACAY PAGLGRDGGL QWEKKDGYEA CNTAADGWSG VEGDFSPDTY IYPEHEVFGR
TESRHATMIR RVDDVLKDMI QLLKDLGIDD NTMIVFTSDN GPHNEPGSGN YGNGAQDPAY
FMSYGMMDGI KRDCWEGGMR VPAVVRWPGV VPSGISLNAS QFQDWMATFA DVAGVPVPSR
CDGVSLLPTL AGVPERQKTG VIYAEYAYSG STPNYSDFLQ QHRGRGRQQQ QIIFVDGFKG
IRMGNAAPAT DFEIYDTEKD PQEAANLAAS RPDLQEKMKA QALRSRRSSP IASTSFDTSY
IAPVSAPAGL REGRLRWRAW NRSFDWVPDF RQLEEGPCST GATDVSDVLS VKAGLSGQKG
VELTGYLRVP VTGEYQFYLQ TDSNEGSRAF VHLHDMQLID ADYAYTPGTQ AASNAREGVS
GDVQPNAVQT VKLTAGLHPI RIGYVGKAAS SSLSLQWEGP ETSGREPIPA SAFSYEYVNP
FNLEKTEETV GCAASGTGLT VQTHLPWTVS CDQPWVTVSP VSGSGTSVLD IAVEANGLQT
EREAVVTVVC GGEQRTFTLR QEAAPAPAGY DKWKQDHFAD GTPDEQMAPD ACPAGDGISN
LMKYATGMDP NQPCGSVTKL AVREEAGGKY LVLSWPVNPE ATDVTFHVES SSDLEEWADE
GAVTPDGACG EFRDTVALGK GSPERRFLRL KVTR