Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1118 |
Symbol | |
ID | 6273952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1336709 |
End bp | 1338595 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642613169 |
Product | sulfatase |
Protein accession | YP_001877725 |
Protein GI | 187735613 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCGTT ATTATTGGGG GAGGTTTTGG CCGTTCCTCC TGTTTGCGTT CGGCATTGAG GCGGTGGAAA ATCTGTTTAC GGTATTTTTC GAATACCGCA ACATGGATTT CGGGTTGCTT CCCCTCCTGA AAACGGCGTA CGTTTTTGTG ACGGAGTTTG CCGTCACCAT GTGCTACTGG CTTATTCCCT ATGCCGTTTA TTTGTGGATT CTGCCGCGCG GGAAAGCAGG AGGGAAGGCG GACAGGTGGC TCACCTCCGC ATGGTTTTTC CTGTTTGTGC TCGCCAATCT GTTTGAAGAT GTGGCGGAAG CCTTTTTCTG GAACGAGTTT GAAGCCAGCT TCAATTTCAT TGCGGTGGAT TACCTGGTTT ACACCAAGGA GGTTATCGGG AATATTTACG AGTCCTATCC CATCATTCCT ATTCTGGGCG GCATTCTGGC GGCGTCCGTT CTGGCCGCCT GGGGAATGAA GAGGTTCCTG CTTCCCAGGA ACGGGGCGGT TCCCGCCGGA TGGAAACGGG GCTGTGTGGT GCTGTTCCTG CTGGCCTGCG TCACGGGGGG ATATTGGCTG GTGGATATCA AGGATGCGGA TGCCGTGAAC AACCGTTATA ATTCGGAAAT GGCCAAGGAT GGCCTTTACA GCCTGTTCAG CGCCTTTCTC AAGAATGAAC TGGATTACCG CGCTTATTAC CGGACGCTGC CGGATGCGGA AGCGGCGGCG TTTCTGGCCC GGGAGTTCAC GGCGGATGAC ACGTCCGTGC CGGAGGCTTC GTCCGGCAGC GTAAAGAGGC AGGTGCGTCC TTCCGAGGGG GCTATCCGCC CGAATGTCGT GGTTGTGGTC ATGGAGAGCA TGGGAGCGGA ATTTTTGAAC GAGTGCCGGG AAGACGGGGC TGACGTCACT CCGTGCCTGA GCCGTCTGGG AAAGGAAGGC ATTTTTTTCC CGAATACTTA TGCCACGGGC ACCCGTTCCG TACGCGGTCT GGAAGCAATC AGCACATCCC TGCCGCCGCT TCCCGGCATG TCCATCCTTC GCCAGGAAGG AAACGAGCAT TTGCAGACCA TAGGTTCCAT ATTCAGGGAC AAGGGATATG ATCTCAAATG GATTTACGGC GGCTACGGGT ATTTTGACAA CATGAATTAT TTTTTCGGGA ACAACGGGTT TCAGGTTCTG GACCGTAATT CCATGGCTGA TTCCGAGGTG ACCCATTCCA CCATTTGGGG CGTTTGCGAT GAAGATTTGT TCCGCCGCGC CGTACGGGAG GCGGATGAAT CCTGCGGACG CGGCAAGCCG TTTTTGCAGG TGGTGTTTAC CACGTCCAAC CACCGCCCCT ACACGTATCC GGAAGGGCGC ATTGACATTC CTTCCCACAC GGGGCGCATG GGGGCCGTGA AATACGCGGA TTATGCGGTA GGCGCCTTTG TGGAGGAGGC CAGAACCAAA CCCTGGTTTG ACAACACGCT GTTCGTGTTC GTAGGAGACC ACGGCGCCGG GAGCGCGGGA AAGCAGGCCC TCAATCCGGA AACGCACCGC ATTTTTTCCA TTTTCTACGC TCCGGCTCTG CTGAAACCGG AACGGCGGGA CACTCCCGTG AGCCAGATTG ACGTGCTGCC CACCCTGCTG GGGCTGTTGA ACTGGCCGTA TGATGCGGCC TTTTATGGGA AGGATGCCTT GAAGCCTTCC TATCAATCCC GGTATTTTGT GAGCAATTAC CAATATATCG GCTATTTGAA GGGGAAAGAC ATGGTGGTGC TCAAACCCCA GCGCGGAGTG GAATTTTTCC GGGACGGAGA GGCCGTTGAG CCGGACGGGC GGATGAAAGA GCTGGAAAGG GAAGCGGTTT ATTACTATCA GCACGCTTCC GGCTGGCGCA CCAGTTTGAA AGAATAA
|
Protein sequence | MIRYYWGRFW PFLLFAFGIE AVENLFTVFF EYRNMDFGLL PLLKTAYVFV TEFAVTMCYW LIPYAVYLWI LPRGKAGGKA DRWLTSAWFF LFVLANLFED VAEAFFWNEF EASFNFIAVD YLVYTKEVIG NIYESYPIIP ILGGILAASV LAAWGMKRFL LPRNGAVPAG WKRGCVVLFL LACVTGGYWL VDIKDADAVN NRYNSEMAKD GLYSLFSAFL KNELDYRAYY RTLPDAEAAA FLAREFTADD TSVPEASSGS VKRQVRPSEG AIRPNVVVVV MESMGAEFLN ECREDGADVT PCLSRLGKEG IFFPNTYATG TRSVRGLEAI STSLPPLPGM SILRQEGNEH LQTIGSIFRD KGYDLKWIYG GYGYFDNMNY FFGNNGFQVL DRNSMADSEV THSTIWGVCD EDLFRRAVRE ADESCGRGKP FLQVVFTTSN HRPYTYPEGR IDIPSHTGRM GAVKYADYAV GAFVEEARTK PWFDNTLFVF VGDHGAGSAG KQALNPETHR IFSIFYAPAL LKPERRDTPV SQIDVLPTLL GLLNWPYDAA FYGKDALKPS YQSRYFVSNY QYIGYLKGKD MVVLKPQRGV EFFRDGEAVE PDGRMKELER EAVYYYQHAS GWRTSLKE
|
| |