Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0491 |
Symbol | |
ID | 6274733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 580886 |
End bp | 582388 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642612541 |
Product | sulfatase |
Protein accession | YP_001877110 |
Protein GI | 187734998 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.244971 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACAT TTTACAATAT CACTCTCTCT ACCGCCTTTT CCCTGCTGAT GCTGTCCGTG GCGGAAGCGG CACGGCCCAA TGTGGTTCTC ATCAATGCGG ATGACCTTGG CTGGGCGGAA GTGGGCTGCT ACGGCCAGAA AAAAATTAAA ACCCCGAACA TTGACAAGCT GGCCTCCGAA GGACAGCGAT GGGTTTATTT CTATTCCGGA GCTCCGGTTT GTTCCCCTTC CCGCAATGTG CTGATGACGG GCAAGCATAC GGGCAACTGC GACGTACAGG ATTTGAAACG CGTGGACGCG GGCGAAAACT GGCGCGACCT CAAAGGAGAC TGGCCCATCA GAACGGAAAC CTACACTCTA CCGGAAGCCA TGAAAAAAGC CGGTTACGCC ACAGCGGTGT TCGGTAAATG GGGTATTGGG GATTTCGGTT CCACCGGAGC GCCGGACAAA CACGGCGTGG ACAGGTTCTA TGGCTACACG GACCAGAAAG CCTGCCACAC CTACTATCCT CCATACCTCT GGAATGACGG AAAGAAGGAA GTTCTCAACA CTTCCCTGAC AGCCGCCACT ATCGGACACG GTTCCCAGCC CAAAGGGGAA GTTCTGGCGG ACACCTACCG CGCGGAACAA CACAGTTCCG ATCTTATTGC GGATAAAATG CTGGAATTTG TGAAGGAAAA GGCCCATGGC AAACAACCGT TTTTCCTGTA TTACGCCCCG CTGGAACCCC ATGTGGCCAT GCAGCCTCTT CAGGAATGGA TTGACCGCTA TCCCCGCGAA TGGGACAAAT CCCCCTACCG CGGCAACCGG GGCTATCTGC CCCATCCCCG CCCCCGGGCC GCCTATGCAG GCATGATTTC CCAGATGGAC CACAACGTAG GACGCCTGCT GGACACGCTG AAAGCCTGTG GCCTGGACAA AAATACCATC GTCATTTTTA CCAGCGACAA CGGCACCACG CATGATGCAG GGGGGGTGGA CCACCGGTTC TTCAACTCCG TAGCCGATCT CAAAGGTTTG AAAGGGCAGC TTTATGAAGG CGGTATACGT GTCCCCGGCA TTATCCGCTG GCCTGGGAAA ATAGCCCCGG GAAAAACCAT CACCCAGCCG GCCTTCCATG CGGACGTGAT GCCTACACTG TGCGCTCTGA CAGGAGCGGA TGCAGGTTCT CCGCTGGGAA CGGACCTCTC CCCTGTCCTT CTGGGCAAAA AATCCGCTCT GCATGACAGG AAGCCCCTGG TCTGGGCAGG GGGAGGCTAC GGCGGCCAGG TAGCCGTGCG TTTCGACTCC AAGAAAGTCA TCCGCCGCAA CCTGTTTCCC GGTAAAAAAC CGGACAACTG GGAAGTGTAC GATATCGTGA AAGACCCCGC AGAGAAAAAT AATATCGCCG CAGAAAACCG TGACCTTATC AACAGAGCCA TCGCCATTCT GGACAGGGAA TATCAACCCG CGCCCGGCTT CCAGGCCCTG CGTTACAAGG CCCCGGAACA GGTAGCCGAA TAA
|
Protein sequence | MVTFYNITLS TAFSLLMLSV AEAARPNVVL INADDLGWAE VGCYGQKKIK TPNIDKLASE GQRWVYFYSG APVCSPSRNV LMTGKHTGNC DVQDLKRVDA GENWRDLKGD WPIRTETYTL PEAMKKAGYA TAVFGKWGIG DFGSTGAPDK HGVDRFYGYT DQKACHTYYP PYLWNDGKKE VLNTSLTAAT IGHGSQPKGE VLADTYRAEQ HSSDLIADKM LEFVKEKAHG KQPFFLYYAP LEPHVAMQPL QEWIDRYPRE WDKSPYRGNR GYLPHPRPRA AYAGMISQMD HNVGRLLDTL KACGLDKNTI VIFTSDNGTT HDAGGVDHRF FNSVADLKGL KGQLYEGGIR VPGIIRWPGK IAPGKTITQP AFHADVMPTL CALTGADAGS PLGTDLSPVL LGKKSALHDR KPLVWAGGGY GGQVAVRFDS KKVIRRNLFP GKKPDNWEVY DIVKDPAEKN NIAAENRDLI NRAIAILDRE YQPAPGFQAL RYKAPEQVAE
|
| |