Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1755 |
Symbol | |
ID | 6275045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2136730 |
End bp | 2138418 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613818 |
Product | sulfatase |
Protein accession | YP_001878354 |
Protein GI | 187736242 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTT CCCGTATTGC AGCCTTTCTC ATCCCGTTCC TGCTGGGAAG CGCCTGCTCT GCTTCCGCAG CTTCCGTAAA AGCATCCCGC GCGGCAGAAC CGAAACGCCC CAACATCATC CTCTTTCTGG TGGACGACAT GGGCTGGCAG GATACATCCC TGCCCTTTTG GCGGGAAGAG GACGGCACTC CCAAACCTAC TTTCCTGAAC AAGCGCTACC GCACGCCCAA CATGGAGGCT CTGGGCAAGC AGGGCATGGT CTTCACGAAC GCCTACGCGC AGCCTATCTG TTCCCCCAGC CGATGCAGCC TCATGTCCGG CATGAACTCC GCCCGCCACC GCGTCACGAA CTGGACGCTC CTGCGTGACC AAACCACGGA CGCAGGCCAT CAAGCGCTCA AGGCTCCCGC AGACTGGAGC GTCAACGGCA TCCAGCCGGC AGGAACCAGG GCATCCGGAA CCACCCATCT TCCCCTGACG GAAGAAAAAA TCCAGTACAG GATGGAAAAA CCCTTCACCC AGGTGCTTGG GCTGCCCGCC CTGCTCAAAA AACAGGGGTA TACCACCATC CACTGCGGAA AAGCCCATTT CGGCTCCAAA AATACGCCCG GAGCCAATCC CAGGCTCTTC GGCTTTGACT ACAATATCGC CGGCACGGAA ATAGGCGGCC CGGCGGACTA CCGGGGTTCC CGGAAATACG GCACGGGGAA TTTCCATGTC CGCGGACTGG ATGAAAATAA TTACTATGAA AACGACACAT TCCTGACGGA AGCCCTGACG CAGGAGGCAC TGAAACGCCT GGACGCCATC CGGAAAAACC CCAGGGAGGC GGACAAGCCT TTCTACCTGT ACATGTCCCA CTACGCCATC CATGCCCCCT TCGACATGCG CGGTTATGAC AAGAGGTTTG CGGAAGAGTA CTCCAACCCC AATGACGGCC ACAAATGGTC CGACAACGAA AAACGCTATT CCGCCCTGAT CCAGGGGATG GACAAGAGCC TGGGCGACAT CCGGGAATAC CTGAAAAAAA ACAATCTGGA TAAAAATACC GTCATCATCT TCATGGCGGA CAACGGGGGC CTTGCCATCA GCGGGCGCAT GGGCAACAAG GAATCCAATT ACCCGCTTAG CTTCGGCAAG GGTTCCAACC GGGAAGGCGG CATCCGGGAA CCGATGATCG TCTACTGGCC CGGGGTCACG AAGGCGGAAA GCGTCTGCAC TACTCCCGTC ATCATTGAAG ACTTTTTCCC CACCATTCTG GAAATAGCAG GAGCAAAAAA AATTCAGGCG CCTCAGGTTG TGGACGGCAA AAGCTTTGTT GCCCTGCTGA AAGGCGGCAG CATGAATCCC AACCGCTCCC TGCTCTTCCA CACGCCCAAC GTATGGGGGG AAGGGAACGG AAACAACTCC CTTTATTCCC CCAGCACGGC CATGCGCCAG GGGGACTGGA AGCTTATCTA CTGGCACCCT GACCAGAAGT TCGAACTCTT CAACCTCAAG GAGGACATCA GCGAAGAGCA CAACCTGGCG GAACAGCAGC CGGAACGCGT TAAGGTCATG GCCAGGACCA TGACCACCCT GCTCAAGGAG CGCAAGGCCC AAATGCCCAC CTACAAGAAG AATAATCCCG CCGGAGCCCG TGAAGGCGCT CCCGTACCGT GGCCGGACCA GGCGGCGGCC AGGCTGTAA
|
Protein sequence | MNISRIAAFL IPFLLGSACS ASAASVKASR AAEPKRPNII LFLVDDMGWQ DTSLPFWREE DGTPKPTFLN KRYRTPNMEA LGKQGMVFTN AYAQPICSPS RCSLMSGMNS ARHRVTNWTL LRDQTTDAGH QALKAPADWS VNGIQPAGTR ASGTTHLPLT EEKIQYRMEK PFTQVLGLPA LLKKQGYTTI HCGKAHFGSK NTPGANPRLF GFDYNIAGTE IGGPADYRGS RKYGTGNFHV RGLDENNYYE NDTFLTEALT QEALKRLDAI RKNPREADKP FYLYMSHYAI HAPFDMRGYD KRFAEEYSNP NDGHKWSDNE KRYSALIQGM DKSLGDIREY LKKNNLDKNT VIIFMADNGG LAISGRMGNK ESNYPLSFGK GSNREGGIRE PMIVYWPGVT KAESVCTTPV IIEDFFPTIL EIAGAKKIQA PQVVDGKSFV ALLKGGSMNP NRSLLFHTPN VWGEGNGNNS LYSPSTAMRQ GDWKLIYWHP DQKFELFNLK EDISEEHNLA EQQPERVKVM ARTMTTLLKE RKAQMPTYKK NNPAGAREGA PVPWPDQAAA RL
|
| |