Gene Amuc_0451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0451 
Symbol 
ID6275686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp536416 
End bp538575 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content56% 
IMG OID642612501 
Productsulfatase 
Protein accessionYP_001877070 
Protein GI187734958 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGG TATCTTTACT ATCCGTTCTT CTGACTTCCC TGGTTCCGTG CATGGGCGCT 
TCCGGGCAGG CAGCGAAGCC GGCTGCCCGG TCTGTCAAAA CTTCAAAACC CAATGTTATT
TTCATTCTGG TGGACGACAT GGGCTGGGGT GATCTGGATT CCAACTGGAG CCAGCAGAAG
CTGAATGGCC GGACGGTGGA GAGAAAGAAC GAGTTCAAGA CTCCTGCCCT GTCCGCTCTG
GCCCGGGAGG GGATTCAGCT GCGCCGTCAT TATTCCGCCG CTCCGGTTTG CGCTCCGGCG
CGCGCCTCCC TGCTGCTGGG GGTGCACCAG GGGAATTCCC GCGTGGTGAG AAACAACCGT
TTCGACCAGC CTATTGAAGA TTCCCATACG CTGGGTACCG TGATGCGTGA TGCCGGGTAT
GATACTGCCG CCATTGGCAA ATGGGGTGTA GGCGGCGGAG GCCAGAGCCA TGTGCCCATG
ACCGCCGGCC CCCATCAGCG GGGATTCAAT TATTTTTATG GTATCCTGGA CCATTTGGCA
GGGCATTTCC ATTATCCTTC CGAATCCCGT GACGTTTTCG AGTACAACGG TTACGCTTCC
AATCCCGAGT GGAAAAATAT CAAGGACCAG GTGCCCCAGA CGGCTTATTC CACGGATTTA
TTTGCCGCCT GCGCCAAGCA GTGGATTGTG GACCAGCGCA AGTCCGCCCG CAAGACCGGA
AAGCCGTTTT TCCTGTACCT GGCTTTTCCG GCGCCCCACG GCAATCTGGT GGTTCCGGGG
GTCCCCTATC CTTCCGGCGG CGGGTTGAAA GGAGGCCTGC AATGGGTAAA GAAGGAAGGC
ACAGAATCCG TGAATACGGC TTTTGACGCC AGGGCGGAAA AGAATAAGGA TACCTATATT
CATCCGGACA ATTCCCGTTT CCCGAATGAT GTGGCCAAAC GTCATTCCAC CATGATCCGC
CGCGTGGACG ACGCCGTAGC TGATTTGATC CGTCTGCTGA AAGATTTGAA GATTGACGAT
AATACGATGA TCGTCTTCAC GTCCGACAAT GGCCCCCACA ATGAGGGCGG TTCCGACTCC
AGGCACTGGC ACGGAGCGCA AAATCCGCAG TTTTTCAAGA GTTATGGCAT GATGGACGGC
ATTAAGCGCG ATTGCTGGGA AGGTGGCATG CGCGTGCCTA CGCTGGTACG CTGGCCCGCC
CGTATTCCCA AAGGGCAGGT CAGCCTGCAT CTTGGACAGT TCCATGACTG GCTGGCTACG
CTGGCGGATG TCGCCGGGGT GCCGGTTCCT GCCCGCAGTG ACGGCGTTTC CCTGCTGCCG
ACGCTGACCG GTCATGCGGA CCAGCAGAAG CCCGGCATTG TTTATGCCGA ATATAATTTC
GCCGGCAAAA CGCCGGAGTA TAAGGATTTC CTGGGCGAAC ACAAGGGAGC GCAGCGAGGC
CAGCAGCAGA TTGTCTTTGT GGATGGATTG AAGGGCCTGC GCATGGGGGT GAAGGATGCG
GACAAGGATT TCATGATTTT TGATACTTTG AATGATCCGC AGGAAAGCAA AGATCTCGCA
TCCTCCAAGC CGGAACTTCA GGCCCGCATG AAAGCCGCCG CTTTGTCCAA TCGCCGGGCT
TCCCTTCCTT CCAAAACAGT GTTTGATTCT GCGCTGGTGC CTGCCGTGGA TGTGAAGGGA
ACCGTTTCTC CCGGGTTGCA ATGGGCCTTG TATGAGGGGG ATTTTCCCTG GGTACCGGAT
TTTCGCCAGT GGAAGAAGCC TGCTTCCGCC CATGGCGTGA CGCCTTCCCC ATCCGTGAAA
ATGAACGGTC CGGCCAAGCG CGGTGTGGAG CTGACGGGAT ATGTGAAGAT TCCTGAGGAC
GGAGCATATA CGTTTTATCT GACCACGGAT GAAAATAAGG GCAGCAAGGC TTTTGTCCGT
CTGCACGGCA TGGAACTGAT TGACGCGGAC AAAACTTATG AGCCGGGGGC CGAGGTCTCC
TCCGATTTGG GGGACCGGAA GAATCCCGTT TATTTGAAAG CCGGACTTCA TCCCATCCGC
ATCGGCTATG TGGGGAACTC CGGTACTGCC TCCAAACTGG TTTTGAAGTG GGAAGGCCCC
GGTTTGTCCA AGCAGGAGAT TCCCGCCTCC GCCTTCAGTC ACGGGAAGGA ATCCAGGTAA
 
Protein sequence
MKLVSLLSVL LTSLVPCMGA SGQAAKPAAR SVKTSKPNVI FILVDDMGWG DLDSNWSQQK 
LNGRTVERKN EFKTPALSAL AREGIQLRRH YSAAPVCAPA RASLLLGVHQ GNSRVVRNNR
FDQPIEDSHT LGTVMRDAGY DTAAIGKWGV GGGGQSHVPM TAGPHQRGFN YFYGILDHLA
GHFHYPSESR DVFEYNGYAS NPEWKNIKDQ VPQTAYSTDL FAACAKQWIV DQRKSARKTG
KPFFLYLAFP APHGNLVVPG VPYPSGGGLK GGLQWVKKEG TESVNTAFDA RAEKNKDTYI
HPDNSRFPND VAKRHSTMIR RVDDAVADLI RLLKDLKIDD NTMIVFTSDN GPHNEGGSDS
RHWHGAQNPQ FFKSYGMMDG IKRDCWEGGM RVPTLVRWPA RIPKGQVSLH LGQFHDWLAT
LADVAGVPVP ARSDGVSLLP TLTGHADQQK PGIVYAEYNF AGKTPEYKDF LGEHKGAQRG
QQQIVFVDGL KGLRMGVKDA DKDFMIFDTL NDPQESKDLA SSKPELQARM KAAALSNRRA
SLPSKTVFDS ALVPAVDVKG TVSPGLQWAL YEGDFPWVPD FRQWKKPASA HGVTPSPSVK
MNGPAKRGVE LTGYVKIPED GAYTFYLTTD ENKGSKAFVR LHGMELIDAD KTYEPGAEVS
SDLGDRKNPV YLKAGLHPIR IGYVGNSGTA SKLVLKWEGP GLSKQEIPAS AFSHGKESR