Gene Amuc_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2022 
Symbol 
ID6275625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2455475 
End bp2456950 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content47% 
IMG OID642614082 
ProductAlkyl sulfatase and related hydrolase-like protein 
Protein accessionYP_001878613 
Protein GI187736501 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2015] Alkyl sulfatase and related hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0010234 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACAA AAATGAACTA TTTGACAGCC ACGCTTCTGG CAATATTTGC ATGGGGCGGT 
ACAGGCGTCG GTATGGCAAA ACAATCCAAA AAGGCGCTAA GGCATCCCGT TGAAGACAAC
AAAAAATCTC CCGGAGTTTA CGATGCGAAT CAAAATGACC GTCAAGGCAT TCAAGATGCG
GAACAAAGCC CCGGGGACAA ACCGGTTACG TGGAGCATTT CCCGTCCCAA TGCCAAAAAC
GCGCTGTCTG AAGTCACAAA AGGAATTTAC CAGATTCACG GCGGAGGTTT TCCGAACATA
ACTATTATTG AAGGACGGGA AGGCATCATG ATCATTGCCC CTTTCGTCCC GAAGGAAACC
ATGGCCGAGA GTCTTGACCT CTATTACCGG AAGGCGGGAA AACGGCCAAT CAAGGCTGTT
GTAGATGCAC ATCCACATAC CAACTATTTT GCCAGTACCA AAAGAACGGC ATCCGGGCTG
GACATAGACG GCATAGAAAT GGAATTCATG GCGGTTCCCG GAATGGGGGC CTCTTCTGCC
GCACTGATGT ATTTTCCCCA ATTCAAGGCG CTTTTTTATG GAGAGGACAC GGCAAGCGCC
ATACATGATA TCTGCACTCT GGGAGTATCC AAAATAAGGG ATGCAAAAAA CAGGTGGAAA
GCCCTTGATC AAGCCATTCA GCGTTATGAA GACAAAATAG AGATTTTGTT CTCACAACAC
CATTGGCCCA GAATAGGAAA AGAAAATATC AAACGGTTCC TGGCCAGGGA ATGCCGCAAC
TGCAAATACA TGCATGACCG GATATTGAAC CTGATTAGCA AAGGGTATTC CCCCGCAGAA
ATTGAGGGAA TAATCAATCC GATTCCGGAA AGCGGCAGAA TCAGGAATAG CCCTCCGGAA
ATCACCGGGC AAAAGGACTA TCGTCAAGAC GCCGGGACAT TAAGGAACGT TATGTCTGCC
CATCTAAAAA GAGGAAACAT AAGGAATCTG GTAAGAAATA TGTTGAAACA GTTCGGTTAT
CAAACGGAAC CCGTATTCCG GAACGATGAA CTTCTCGTCA ATGCCGGAGA ACCGGGAATC
GGGCTGCTTA AAAACTCCCG CGAACTTACG TTGACGGACA CTTTATATGC CCTGACTCCG
GAACTGCTGT TCGACTATTT GGGCGTCAGC CTGAATAGTG AGAAATCCAA AGGGAAAAAA
CTGGCTTTCA ACTGGATTGC CCAAAACGGA AGATCATACG GCTTCTGGAT TGAAAACGAA
GTGCTGATGT ACCGCGAAGG AAAACTGGTC AAACATCCCG ACGCAGTCAT CACCGGAGAC
AGGCTCCACT TCGCCCTGGT CGCCATGCGG GCAATGCCCT TAAAGACAGC TCTGGACAAA
GGCATGATTA AAATTGAAGG CAATACGGAT AAATTCAGGG AATTGCTCGG ATGCATGGAT
AAGTTCCATG GAAATTTCCA TGCCATAACA CCCTGA
 
Protein sequence
MKTKMNYLTA TLLAIFAWGG TGVGMAKQSK KALRHPVEDN KKSPGVYDAN QNDRQGIQDA 
EQSPGDKPVT WSISRPNAKN ALSEVTKGIY QIHGGGFPNI TIIEGREGIM IIAPFVPKET
MAESLDLYYR KAGKRPIKAV VDAHPHTNYF ASTKRTASGL DIDGIEMEFM AVPGMGASSA
ALMYFPQFKA LFYGEDTASA IHDICTLGVS KIRDAKNRWK ALDQAIQRYE DKIEILFSQH
HWPRIGKENI KRFLARECRN CKYMHDRILN LISKGYSPAE IEGIINPIPE SGRIRNSPPE
ITGQKDYRQD AGTLRNVMSA HLKRGNIRNL VRNMLKQFGY QTEPVFRNDE LLVNAGEPGI
GLLKNSRELT LTDTLYALTP ELLFDYLGVS LNSEKSKGKK LAFNWIAQNG RSYGFWIENE
VLMYREGKLV KHPDAVITGD RLHFALVAMR AMPLKTALDK GMIKIEGNTD KFRELLGCMD
KFHGNFHAIT P