Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0451 |
Symbol | |
ID | 6275686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 536416 |
End bp | 538575 |
Gene Length | 2160 bp |
Protein Length | 719 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642612501 |
Product | sulfatase |
Protein accession | YP_001877070 |
Protein GI | 187734958 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTGG TATCTTTACT ATCCGTTCTT CTGACTTCCC TGGTTCCGTG CATGGGCGCT TCCGGGCAGG CAGCGAAGCC GGCTGCCCGG TCTGTCAAAA CTTCAAAACC CAATGTTATT TTCATTCTGG TGGACGACAT GGGCTGGGGT GATCTGGATT CCAACTGGAG CCAGCAGAAG CTGAATGGCC GGACGGTGGA GAGAAAGAAC GAGTTCAAGA CTCCTGCCCT GTCCGCTCTG GCCCGGGAGG GGATTCAGCT GCGCCGTCAT TATTCCGCCG CTCCGGTTTG CGCTCCGGCG CGCGCCTCCC TGCTGCTGGG GGTGCACCAG GGGAATTCCC GCGTGGTGAG AAACAACCGT TTCGACCAGC CTATTGAAGA TTCCCATACG CTGGGTACCG TGATGCGTGA TGCCGGGTAT GATACTGCCG CCATTGGCAA ATGGGGTGTA GGCGGCGGAG GCCAGAGCCA TGTGCCCATG ACCGCCGGCC CCCATCAGCG GGGATTCAAT TATTTTTATG GTATCCTGGA CCATTTGGCA GGGCATTTCC ATTATCCTTC CGAATCCCGT GACGTTTTCG AGTACAACGG TTACGCTTCC AATCCCGAGT GGAAAAATAT CAAGGACCAG GTGCCCCAGA CGGCTTATTC CACGGATTTA TTTGCCGCCT GCGCCAAGCA GTGGATTGTG GACCAGCGCA AGTCCGCCCG CAAGACCGGA AAGCCGTTTT TCCTGTACCT GGCTTTTCCG GCGCCCCACG GCAATCTGGT GGTTCCGGGG GTCCCCTATC CTTCCGGCGG CGGGTTGAAA GGAGGCCTGC AATGGGTAAA GAAGGAAGGC ACAGAATCCG TGAATACGGC TTTTGACGCC AGGGCGGAAA AGAATAAGGA TACCTATATT CATCCGGACA ATTCCCGTTT CCCGAATGAT GTGGCCAAAC GTCATTCCAC CATGATCCGC CGCGTGGACG ACGCCGTAGC TGATTTGATC CGTCTGCTGA AAGATTTGAA GATTGACGAT AATACGATGA TCGTCTTCAC GTCCGACAAT GGCCCCCACA ATGAGGGCGG TTCCGACTCC AGGCACTGGC ACGGAGCGCA AAATCCGCAG TTTTTCAAGA GTTATGGCAT GATGGACGGC ATTAAGCGCG ATTGCTGGGA AGGTGGCATG CGCGTGCCTA CGCTGGTACG CTGGCCCGCC CGTATTCCCA AAGGGCAGGT CAGCCTGCAT CTTGGACAGT TCCATGACTG GCTGGCTACG CTGGCGGATG TCGCCGGGGT GCCGGTTCCT GCCCGCAGTG ACGGCGTTTC CCTGCTGCCG ACGCTGACCG GTCATGCGGA CCAGCAGAAG CCCGGCATTG TTTATGCCGA ATATAATTTC GCCGGCAAAA CGCCGGAGTA TAAGGATTTC CTGGGCGAAC ACAAGGGAGC GCAGCGAGGC CAGCAGCAGA TTGTCTTTGT GGATGGATTG AAGGGCCTGC GCATGGGGGT GAAGGATGCG GACAAGGATT TCATGATTTT TGATACTTTG AATGATCCGC AGGAAAGCAA AGATCTCGCA TCCTCCAAGC CGGAACTTCA GGCCCGCATG AAAGCCGCCG CTTTGTCCAA TCGCCGGGCT TCCCTTCCTT CCAAAACAGT GTTTGATTCT GCGCTGGTGC CTGCCGTGGA TGTGAAGGGA ACCGTTTCTC CCGGGTTGCA ATGGGCCTTG TATGAGGGGG ATTTTCCCTG GGTACCGGAT TTTCGCCAGT GGAAGAAGCC TGCTTCCGCC CATGGCGTGA CGCCTTCCCC ATCCGTGAAA ATGAACGGTC CGGCCAAGCG CGGTGTGGAG CTGACGGGAT ATGTGAAGAT TCCTGAGGAC GGAGCATATA CGTTTTATCT GACCACGGAT GAAAATAAGG GCAGCAAGGC TTTTGTCCGT CTGCACGGCA TGGAACTGAT TGACGCGGAC AAAACTTATG AGCCGGGGGC CGAGGTCTCC TCCGATTTGG GGGACCGGAA GAATCCCGTT TATTTGAAAG CCGGACTTCA TCCCATCCGC ATCGGCTATG TGGGGAACTC CGGTACTGCC TCCAAACTGG TTTTGAAGTG GGAAGGCCCC GGTTTGTCCA AGCAGGAGAT TCCCGCCTCC GCCTTCAGTC ACGGGAAGGA ATCCAGGTAA
|
Protein sequence | MKLVSLLSVL LTSLVPCMGA SGQAAKPAAR SVKTSKPNVI FILVDDMGWG DLDSNWSQQK LNGRTVERKN EFKTPALSAL AREGIQLRRH YSAAPVCAPA RASLLLGVHQ GNSRVVRNNR FDQPIEDSHT LGTVMRDAGY DTAAIGKWGV GGGGQSHVPM TAGPHQRGFN YFYGILDHLA GHFHYPSESR DVFEYNGYAS NPEWKNIKDQ VPQTAYSTDL FAACAKQWIV DQRKSARKTG KPFFLYLAFP APHGNLVVPG VPYPSGGGLK GGLQWVKKEG TESVNTAFDA RAEKNKDTYI HPDNSRFPND VAKRHSTMIR RVDDAVADLI RLLKDLKIDD NTMIVFTSDN GPHNEGGSDS RHWHGAQNPQ FFKSYGMMDG IKRDCWEGGM RVPTLVRWPA RIPKGQVSLH LGQFHDWLAT LADVAGVPVP ARSDGVSLLP TLTGHADQQK PGIVYAEYNF AGKTPEYKDF LGEHKGAQRG QQQIVFVDGL KGLRMGVKDA DKDFMIFDTL NDPQESKDLA SSKPELQARM KAAALSNRRA SLPSKTVFDS ALVPAVDVKG TVSPGLQWAL YEGDFPWVPD FRQWKKPASA HGVTPSPSVK MNGPAKRGVE LTGYVKIPED GAYTFYLTTD ENKGSKAFVR LHGMELIDAD KTYEPGAEVS SDLGDRKNPV YLKAGLHPIR IGYVGNSGTA SKLVLKWEGP GLSKQEIPAS AFSHGKESR
|
| |