Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0121 |
Symbol | |
ID | 6274910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 149378 |
End bp | 150958 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642612166 |
Product | sulfatase |
Protein accession | YP_001876747 |
Protein GI | 187734635 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAT CCGCATTACA CCTGATGCTT GCCATGGCGG CGGCAGCGGC CTGCCCGGCC GCAGTCACGC CAACGGTCAA GCCACCCAAG GCCATCGTCA TGATTTACGC TGACGACCTC GGCTATGGAG ACGTAGGCTG CTATGGAGCC AAGGGAATTC CCACCCCCGC CATCGACAAG CTTGCCGAGC AGGGCTGCCG CTTCACGGAC GCCTATTCCA CCACATCCGT CTGCACCCCT TCCCGCTATG CCCTGTTCAC AGGGGAATAT CCGTGGCGCA AGGAAGGCAC GGGCATCCTG CCGGGAGATG CCGCCCTCAT TATCGATACC AAGAAGCCCA CCCTGCCCAA GATGCTCCAA TCCCACGGCT ACAAAACCTA CATGATAGGC AAATGGCACC TGGGCCTGGG GGAAAAGGGG AAGAAGATTG ACTGGAACAA ACATATCTCC CCCAGCCCGA ACGAAATCGG ATTTGACGAG AGCTTTATCT TCGCCGCTAC GGGCGACCGC GTTCCCTGCG TGATTCTGGA AAACGGCAAT GTCCGCAACC TGGACCCGAA CGACCCCATT GAAGTATCCT ACAAGCACAA CTTCCCCGGG CTTCCCAATG GCAAGGATAA TAAGGACCAG CTCAAACTCA TGTGGAGCCA CGGACACAAC CAGGCTATTA TCAATGGGAT CGGACGCATC GGGTTCATGA AGGGCGGCAG AAGCGCCTTG TGGAAGGATG AGGAAAACGC CGATATCATT ACGGATAAGG CCATTGAATA CATTCAAAAA AGCGCCAAAG CCAAGGAACC GTTTTTCCTG ATGTTCGCCA CGCATGACAT CCACGTGCCG CGCTGCCCGG AAAAACGCTT TGTGGGCAAG AGCCGGCACG GCGTGCGCGG TGACGTGACC GTGGAACTGG ATGACTGTGT GCGCCGCATT ACAGAGGCTC TGCAACAGGC CGGTCTGGAA AAAGACGCCC TGGTGATCTT CTCCAGCGAC AACGGTCCCG TGCTGGATGA CGGCTACAGG GATTTCGCCG TCCGGGACAA CGCCACCCAT TCCCCCGCCG GCCCCTTCCG CGCAGGCAAA TACAGCATTC TGGAAGGAGG TTCCCGCATT CCGTTTATCG TCAAATGGCC CGGCGTGATC AAACCCGGAA CCACGAGCAA AGCCCTGCTC AATCAAATGG ATTTGGGGGC CTCCCTGGAA CAGCTGCTGG CCCCCGGCAA GGCCAATTCC TTCCGCGACT CTGAAAACGT GATGCCCGCC CTTCTGGGCA AATCCGCCAA GGGGCGTGAC TACCATGTCA TCAACAGCAC CGGCAAGGCA TTGGCGATTC GCCACGGCAA ATGGAAATTC ATTCCCGCCG GCGTGGCCAT TCGCGACGGC ATCAACGGAG CCTCCGCAAA AATGAGCAAG TCCCCGGAAG GAGGAAGCCT CTTTGACCTG GAAAAAGACC CGAAGGAACT TGACAACGTA GCCTCCCAGC ATCCGGACAT TTGCGAACAG ATGAAAGCCA AGCTTGAGGA AATCCGCCAG AGGCCCGAAA CCAAGGCTGA CCAGGAGGAC CTGCTTCCCT TGGACGACTA A
|
Protein sequence | MNKSALHLML AMAAAAACPA AVTPTVKPPK AIVMIYADDL GYGDVGCYGA KGIPTPAIDK LAEQGCRFTD AYSTTSVCTP SRYALFTGEY PWRKEGTGIL PGDAALIIDT KKPTLPKMLQ SHGYKTYMIG KWHLGLGEKG KKIDWNKHIS PSPNEIGFDE SFIFAATGDR VPCVILENGN VRNLDPNDPI EVSYKHNFPG LPNGKDNKDQ LKLMWSHGHN QAIINGIGRI GFMKGGRSAL WKDEENADII TDKAIEYIQK SAKAKEPFFL MFATHDIHVP RCPEKRFVGK SRHGVRGDVT VELDDCVRRI TEALQQAGLE KDALVIFSSD NGPVLDDGYR DFAVRDNATH SPAGPFRAGK YSILEGGSRI PFIVKWPGVI KPGTTSKALL NQMDLGASLE QLLAPGKANS FRDSENVMPA LLGKSAKGRD YHVINSTGKA LAIRHGKWKF IPAGVAIRDG INGASAKMSK SPEGGSLFDL EKDPKELDNV ASQHPDICEQ MKAKLEEIRQ RPETKADQED LLPLDD
|
| |