Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0953 |
Symbol | |
ID | 6274210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1134988 |
End bp | 1139232 |
Gene Length | 4245 bp |
Protein Length | 1414 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642613007 |
Product | sulfatase |
Protein accession | YP_001877566 |
Protein GI | 187735454 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.462358 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCCGC ACCGTTTTTT TTCCGTTCTC CTCTTTTTTC TTCTGTTCCT TGCTCCAGCC AGGTCTGCTG ATGTTGAATA TGTCTGGAGG GGTGTCGATT ACCAGTGGGG TTCCCTGTCC AACTGGAGTG TCGGAGGCAT TGCCGCCTCT TCCGCGCCAG GGGCCGCCGC TTCCGCCTAT GAACATTGGA TGGTAACCAA TGGAACGGAT TCCGCGGGAA GGACGGATGT CGGCGGATTG GGCGCGGGGG GCCGCTATTT GAAGGGCGTC AGGATTGAGG GGCTTAACAG CCAGCCGGAA GGCAAAATCC CATTGTTCAT TAAAAACACG AACAAGGATG TATATTTGCG CGTGGAGGAG GGCGGCATTA CCGTGGAAAA TGCCGGAGAA GGCGGTTATT CTGCGGATTT CGGGGTCGCC CAGCTGCGCG TAGCCGCGGA CCAGGAGTGG CATGTGGCTG AAGGGCGCAG CCTGTATGTG GGGCATGACG ATGACGCTCC GTCCGGGGGA TTGTGTTCCC TGACTTCGGA AGGCGATGTT CCGCGGCGCG TGACAGTGAC GGGCGGAGGC GCCGTGCGCA TCGGGGAAGG GATGCTGCTG AACAATATTT CCGGCCTCAT CGGTTTTGTG CTGAACGCCG GAAAGGGAAT TCCGACGCTG GATCTGGCGG ACCGGGGCAT GGGCAATACG GTGACGGTGG AAGACGCTGC CCGCCTGGAA GGCATGTCTT TGTACCAGGG GGCCCTGGTG ACGCGGGAAA ACGCGTCCGT GACTTTTTCC GGAACGGAGG CCAAGGCGTC CGGACAATGG AACATTGGCG CGGACACGGA GTTGGCGCTG GAAAATTCCA CGCTGGACCT GACGGAGGCG GGTGTGGACG GCAACGTGAT TCTTTCCGGC TCTTCCGGCA TTACCGGGGA TAAGGGAACG CTGCGGCAGA CGCTTCTGGA TGATGCCCGG GTCACTTATA CGGACAGGAA TGTCAAGGCG GGGGAAATCC GGAGCGTAGG CAGCAATGTG ACGATAACGC TGAATAATTC CTCCATGCAT TTTGACGGAG AAATTCCGGA GGTGGACCTG GTGGTGCAGG GCAGCTGCGC ACTGGGCGGC AGCGGCGCCT TTTCAGGAAC GATTACTTAT GCGCCGGGAG GCACCCTGAC TGTGGATGGG GATATTTCCC TGCCCAGTTC ATCCCTGACG CTGGGGCGTC TCCATGTGGC TACGCTGGAC GGCGTGCCGC AAAACTCCGC CGTCCGGCCG GAAACCCCTG CCCAGCAGGC AAGGGAATAT GTGGTTTTGT TTGATTCCTC CTTTGAGGAT TTGATTTCCG AATGGCCGGG CAGGCATACG CTGGGCGTGC AGTTCCGGCC GGATATGGGC GCCCCCGTCA CCGTAAAGGA ACTGCCTGAA GGGGCGGTTT CATGGAATTA TGATGCCGCA TCCGGCTGGC TGACGCTGCA GACGGATGTA GCGGAAAAGC CGGACATGCC GGCGCATGTT TCCGGAGCCA GGCCCAACAT TATTTTTGTC CTGGTGGATG ACATGGGCTG GGGGGATATG AGCGTCAACT GGACGCAGCA GGATAAAAAC GGCAGGCAGG TGACCCGCAG GAATGAATTC AAGACGCCTA CCCTGGACAC GATGGCGGCG GAGGGCATGC AACTGAGGCG CCATTACAGT GCGGCTCCGG TATGCGCTCC GGCCCGCGCT TCCCTGCTGC TGGGCGTTCA CCAGGGGCAT TCCCGGGTGG TGCGCAACAA TACGTTTGAT TACCCTATCG AGAATTCCCA TACGCTGGGA ACGCTGCTGA AAAGCGCGGG TTACCATACG GCGGCCATCG GGAAATGGGG CGTGGGCGGA GGCGGACAGA GCGGCAAGGC TCTGACCGCG GCTCCCCACA TGCGCGGGTT TGATTATTTT TACGGAATCA TCAAGCATTT GGCGGGGCAT TACCATTATT TGACGGCGGG TTCCCAGGAT ATTTACGAAT ATAATGACCA GGCCGCGGCT CCGGCGTGGG AGAGCGTCAG CGCGAAAGTT CCGGGCACCG CCTATGATAC GGATTTGTTC GGGGCCCGCG CCAAACAGTG GATTGTGGAC CACCATGAGA AAACTCCTTC CCGGCCTTTC TTCCTGTATC TGGCTTTCCC GGCTCCCCAC GGCTGTCTGT CCGCTCCCGC CTGTGCCTAT CCGGCGGGGC TGGGAAGGGA CGGCGGCCTC CAGTGGGAGA AAAAAGACGG CTATGAAGCC TGCAATACCG CTGCGGACGG ATGGTCGGGG GTGGAGGGGG ATTTTTCTCC GGATACCTAT ATCTACCCGG AACATGAGGT TTTTGGAAGA ACGGAATCCC GCCACGCCAC GATGATTCGC CGGGTGGATG ACGTCCTGAA GGACATGATC CAGCTGCTTA AGGATTTAGG CATTGATGAC AATACGATGA TTGTCTTCAC GTCCGACAAC GGCCCCCACA ATGAACCCGG TTCCGGCAAT TACGGCAACG GCGCGCAGGA TCCCGCGTAT TTCATGAGCT ACGGGATGAT GGACGGCATT AAGCGCGATT GCTGGGAAGG CGGCATGCGT GTTCCGGCCG TGGTACGCTG GCCCGGTGTG GTTCCCAGCG GAATCAGTTT GAATGCCAGC CAGTTCCAGG ACTGGATGGC TACTTTTGCG GACGTGGCCG GAGTGCCCGT GCCGTCCCGT TGTGACGGCG TTTCCCTGCT GCCTACGCTG GCAGGGGTGC CGGAACGCCA GAAAACGGGC GTTATTTATG CGGAATATGC CTATAGCGGG AGCACTCCGA ATTACAGCGA TTTCCTCCAG CAACACAGGG GGAGAGGGCG GCAGCAGCAG CAGATTATAT TTGTGGACGG CTTCAAGGGG ATTCGGATGG GGAATGCCGC CCCCGCTACG GATTTTGAAA TCTACGATAC GGAAAAAGAC CCGCAGGAAG CGGCGAATCT GGCGGCATCC CGTCCGGACC TGCAGGAGAA GATGAAGGCG CAGGCGCTGC GCTCCCGCCG TTCCTCCCCC ATTGCCTCCA CCAGTTTTGA CACCAGTTAT ATCGCGCCCG TTTCCGCCCC CGCTGGCCTG CGGGAGGGAA GGCTGCGCTG GCGCGCGTGG AACCGCTCTT TTGACTGGGT GCCGGATTTC CGGCAGCTGG AGGAGGGGCC CTGTTCTACG GGTGCGACGG ATGTTTCCGA CGTGCTCAGC GTCAAGGCGG GGCTTTCCGG TCAGAAGGGG GTGGAATTGA CGGGCTATCT CAGGGTGCCT GTCACCGGGG AATATCAATT TTACCTTCAG ACGGATTCCA ATGAAGGCTC CAGGGCTTTT GTCCATTTGC ACGATATGCA GCTCATTGAT GCGGATTATG CCTATACTCC CGGAACGCAG GCTGCCTCCA ACGCGCGGGA AGGCGTTTCC GGCGATGTGC AGCCCAATGC CGTTCAGACG GTGAAGCTTA CCGCGGGGCT GCATCCCATC CGCATCGGCT ATGTGGGGAA GGCGGCTTCC TCCTCCCTTT CCCTGCAATG GGAGGGGCCG GAAACCAGCG GCAGGGAACC CATTCCCGCC AGCGCGTTTT CCTATGAATA CGTGAATCCC TTCAATCTGG AAAAGACGGA GGAAACGGTA GGATGCGCCG CCTCCGGTAC GGGGCTGACC GTGCAGACGC ATCTTCCCTG GACAGTTTCC TGCGACCAGC CATGGGTTAC GGTTTCCCCC GTTTCCGGTT CCGGAACCTC AGTGCTGGAT ATTGCCGTGG AAGCAAACGG GCTGCAAACG GAGCGGGAAG CCGTTGTCAC CGTCGTATGC GGCGGGGAGC AGAGGACATT CACGCTGCGC CAGGAAGCTG CTCCTGCCCC GGCAGGGTAC GACAAGTGGA AGCAGGACCA TTTTGCGGAT GGGACGCCGG ATGAGCAAAT GGCCCCGGAT GCCTGTCCCG CCGGAGACGG AATTTCCAAT CTGATGAAAT ATGCTACGGG CATGGATCCC AACCAGCCCT GCGGGAGCGT GACGAAGCTG GCTGTCCGCG AGGAAGCGGG CGGGAAATAC CTCGTGCTTT CCTGGCCTGT GAATCCGGAG GCGACGGATG TGACGTTCCA TGTGGAAAGT TCTTCCGACC TGGAGGAATG GGCTGACGAA GGGGCCGTCA CTCCGGACGG GGCCTGCGGG GAATTCCGCG ATACGGTTGC GCTGGGAAAA GGCTCTCCGG AGCGCCGGTT CCTGAGATTA AAGGTGACGA GATAG
|
Protein sequence | MFPHRFFSVL LFFLLFLAPA RSADVEYVWR GVDYQWGSLS NWSVGGIAAS SAPGAAASAY EHWMVTNGTD SAGRTDVGGL GAGGRYLKGV RIEGLNSQPE GKIPLFIKNT NKDVYLRVEE GGITVENAGE GGYSADFGVA QLRVAADQEW HVAEGRSLYV GHDDDAPSGG LCSLTSEGDV PRRVTVTGGG AVRIGEGMLL NNISGLIGFV LNAGKGIPTL DLADRGMGNT VTVEDAARLE GMSLYQGALV TRENASVTFS GTEAKASGQW NIGADTELAL ENSTLDLTEA GVDGNVILSG SSGITGDKGT LRQTLLDDAR VTYTDRNVKA GEIRSVGSNV TITLNNSSMH FDGEIPEVDL VVQGSCALGG SGAFSGTITY APGGTLTVDG DISLPSSSLT LGRLHVATLD GVPQNSAVRP ETPAQQAREY VVLFDSSFED LISEWPGRHT LGVQFRPDMG APVTVKELPE GAVSWNYDAA SGWLTLQTDV AEKPDMPAHV SGARPNIIFV LVDDMGWGDM SVNWTQQDKN GRQVTRRNEF KTPTLDTMAA EGMQLRRHYS AAPVCAPARA SLLLGVHQGH SRVVRNNTFD YPIENSHTLG TLLKSAGYHT AAIGKWGVGG GGQSGKALTA APHMRGFDYF YGIIKHLAGH YHYLTAGSQD IYEYNDQAAA PAWESVSAKV PGTAYDTDLF GARAKQWIVD HHEKTPSRPF FLYLAFPAPH GCLSAPACAY PAGLGRDGGL QWEKKDGYEA CNTAADGWSG VEGDFSPDTY IYPEHEVFGR TESRHATMIR RVDDVLKDMI QLLKDLGIDD NTMIVFTSDN GPHNEPGSGN YGNGAQDPAY FMSYGMMDGI KRDCWEGGMR VPAVVRWPGV VPSGISLNAS QFQDWMATFA DVAGVPVPSR CDGVSLLPTL AGVPERQKTG VIYAEYAYSG STPNYSDFLQ QHRGRGRQQQ QIIFVDGFKG IRMGNAAPAT DFEIYDTEKD PQEAANLAAS RPDLQEKMKA QALRSRRSSP IASTSFDTSY IAPVSAPAGL REGRLRWRAW NRSFDWVPDF RQLEEGPCST GATDVSDVLS VKAGLSGQKG VELTGYLRVP VTGEYQFYLQ TDSNEGSRAF VHLHDMQLID ADYAYTPGTQ AASNAREGVS GDVQPNAVQT VKLTAGLHPI RIGYVGKAAS SSLSLQWEGP ETSGREPIPA SAFSYEYVNP FNLEKTEETV GCAASGTGLT VQTHLPWTVS CDQPWVTVSP VSGSGTSVLD IAVEANGLQT EREAVVTVVC GGEQRTFTLR QEAAPAPAGY DKWKQDHFAD GTPDEQMAPD ACPAGDGISN LMKYATGMDP NQPCGSVTKL AVREEAGGKY LVLSWPVNPE ATDVTFHVES SSDLEEWADE GAVTPDGACG EFRDTVALGK GSPERRFLRL KVTR
|
| |