Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2022 |
Symbol | |
ID | 6275625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2455475 |
End bp | 2456950 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642614082 |
Product | Alkyl sulfatase and related hydrolase-like protein |
Protein accession | YP_001878613 |
Protein GI | 187736501 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2015] Alkyl sulfatase and related hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0010234 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACAA AAATGAACTA TTTGACAGCC ACGCTTCTGG CAATATTTGC ATGGGGCGGT ACAGGCGTCG GTATGGCAAA ACAATCCAAA AAGGCGCTAA GGCATCCCGT TGAAGACAAC AAAAAATCTC CCGGAGTTTA CGATGCGAAT CAAAATGACC GTCAAGGCAT TCAAGATGCG GAACAAAGCC CCGGGGACAA ACCGGTTACG TGGAGCATTT CCCGTCCCAA TGCCAAAAAC GCGCTGTCTG AAGTCACAAA AGGAATTTAC CAGATTCACG GCGGAGGTTT TCCGAACATA ACTATTATTG AAGGACGGGA AGGCATCATG ATCATTGCCC CTTTCGTCCC GAAGGAAACC ATGGCCGAGA GTCTTGACCT CTATTACCGG AAGGCGGGAA AACGGCCAAT CAAGGCTGTT GTAGATGCAC ATCCACATAC CAACTATTTT GCCAGTACCA AAAGAACGGC ATCCGGGCTG GACATAGACG GCATAGAAAT GGAATTCATG GCGGTTCCCG GAATGGGGGC CTCTTCTGCC GCACTGATGT ATTTTCCCCA ATTCAAGGCG CTTTTTTATG GAGAGGACAC GGCAAGCGCC ATACATGATA TCTGCACTCT GGGAGTATCC AAAATAAGGG ATGCAAAAAA CAGGTGGAAA GCCCTTGATC AAGCCATTCA GCGTTATGAA GACAAAATAG AGATTTTGTT CTCACAACAC CATTGGCCCA GAATAGGAAA AGAAAATATC AAACGGTTCC TGGCCAGGGA ATGCCGCAAC TGCAAATACA TGCATGACCG GATATTGAAC CTGATTAGCA AAGGGTATTC CCCCGCAGAA ATTGAGGGAA TAATCAATCC GATTCCGGAA AGCGGCAGAA TCAGGAATAG CCCTCCGGAA ATCACCGGGC AAAAGGACTA TCGTCAAGAC GCCGGGACAT TAAGGAACGT TATGTCTGCC CATCTAAAAA GAGGAAACAT AAGGAATCTG GTAAGAAATA TGTTGAAACA GTTCGGTTAT CAAACGGAAC CCGTATTCCG GAACGATGAA CTTCTCGTCA ATGCCGGAGA ACCGGGAATC GGGCTGCTTA AAAACTCCCG CGAACTTACG TTGACGGACA CTTTATATGC CCTGACTCCG GAACTGCTGT TCGACTATTT GGGCGTCAGC CTGAATAGTG AGAAATCCAA AGGGAAAAAA CTGGCTTTCA ACTGGATTGC CCAAAACGGA AGATCATACG GCTTCTGGAT TGAAAACGAA GTGCTGATGT ACCGCGAAGG AAAACTGGTC AAACATCCCG ACGCAGTCAT CACCGGAGAC AGGCTCCACT TCGCCCTGGT CGCCATGCGG GCAATGCCCT TAAAGACAGC TCTGGACAAA GGCATGATTA AAATTGAAGG CAATACGGAT AAATTCAGGG AATTGCTCGG ATGCATGGAT AAGTTCCATG GAAATTTCCA TGCCATAACA CCCTGA
|
Protein sequence | MKTKMNYLTA TLLAIFAWGG TGVGMAKQSK KALRHPVEDN KKSPGVYDAN QNDRQGIQDA EQSPGDKPVT WSISRPNAKN ALSEVTKGIY QIHGGGFPNI TIIEGREGIM IIAPFVPKET MAESLDLYYR KAGKRPIKAV VDAHPHTNYF ASTKRTASGL DIDGIEMEFM AVPGMGASSA ALMYFPQFKA LFYGEDTASA IHDICTLGVS KIRDAKNRWK ALDQAIQRYE DKIEILFSQH HWPRIGKENI KRFLARECRN CKYMHDRILN LISKGYSPAE IEGIINPIPE SGRIRNSPPE ITGQKDYRQD AGTLRNVMSA HLKRGNIRNL VRNMLKQFGY QTEPVFRNDE LLVNAGEPGI GLLKNSRELT LTDTLYALTP ELLFDYLGVS LNSEKSKGKK LAFNWIAQNG RSYGFWIENE VLMYREGKLV KHPDAVITGD RLHFALVAMR AMPLKTALDK GMIKIEGNTD KFRELLGCMD KFHGNFHAIT P
|
| |