Gene Amuc_0491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0491 
Symbol 
ID6274733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp580886 
End bp582388 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content56% 
IMG OID642612541 
Productsulfatase 
Protein accessionYP_001877110 
Protein GI187734998 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.244971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACAT TTTACAATAT CACTCTCTCT ACCGCCTTTT CCCTGCTGAT GCTGTCCGTG 
GCGGAAGCGG CACGGCCCAA TGTGGTTCTC ATCAATGCGG ATGACCTTGG CTGGGCGGAA
GTGGGCTGCT ACGGCCAGAA AAAAATTAAA ACCCCGAACA TTGACAAGCT GGCCTCCGAA
GGACAGCGAT GGGTTTATTT CTATTCCGGA GCTCCGGTTT GTTCCCCTTC CCGCAATGTG
CTGATGACGG GCAAGCATAC GGGCAACTGC GACGTACAGG ATTTGAAACG CGTGGACGCG
GGCGAAAACT GGCGCGACCT CAAAGGAGAC TGGCCCATCA GAACGGAAAC CTACACTCTA
CCGGAAGCCA TGAAAAAAGC CGGTTACGCC ACAGCGGTGT TCGGTAAATG GGGTATTGGG
GATTTCGGTT CCACCGGAGC GCCGGACAAA CACGGCGTGG ACAGGTTCTA TGGCTACACG
GACCAGAAAG CCTGCCACAC CTACTATCCT CCATACCTCT GGAATGACGG AAAGAAGGAA
GTTCTCAACA CTTCCCTGAC AGCCGCCACT ATCGGACACG GTTCCCAGCC CAAAGGGGAA
GTTCTGGCGG ACACCTACCG CGCGGAACAA CACAGTTCCG ATCTTATTGC GGATAAAATG
CTGGAATTTG TGAAGGAAAA GGCCCATGGC AAACAACCGT TTTTCCTGTA TTACGCCCCG
CTGGAACCCC ATGTGGCCAT GCAGCCTCTT CAGGAATGGA TTGACCGCTA TCCCCGCGAA
TGGGACAAAT CCCCCTACCG CGGCAACCGG GGCTATCTGC CCCATCCCCG CCCCCGGGCC
GCCTATGCAG GCATGATTTC CCAGATGGAC CACAACGTAG GACGCCTGCT GGACACGCTG
AAAGCCTGTG GCCTGGACAA AAATACCATC GTCATTTTTA CCAGCGACAA CGGCACCACG
CATGATGCAG GGGGGGTGGA CCACCGGTTC TTCAACTCCG TAGCCGATCT CAAAGGTTTG
AAAGGGCAGC TTTATGAAGG CGGTATACGT GTCCCCGGCA TTATCCGCTG GCCTGGGAAA
ATAGCCCCGG GAAAAACCAT CACCCAGCCG GCCTTCCATG CGGACGTGAT GCCTACACTG
TGCGCTCTGA CAGGAGCGGA TGCAGGTTCT CCGCTGGGAA CGGACCTCTC CCCTGTCCTT
CTGGGCAAAA AATCCGCTCT GCATGACAGG AAGCCCCTGG TCTGGGCAGG GGGAGGCTAC
GGCGGCCAGG TAGCCGTGCG TTTCGACTCC AAGAAAGTCA TCCGCCGCAA CCTGTTTCCC
GGTAAAAAAC CGGACAACTG GGAAGTGTAC GATATCGTGA AAGACCCCGC AGAGAAAAAT
AATATCGCCG CAGAAAACCG TGACCTTATC AACAGAGCCA TCGCCATTCT GGACAGGGAA
TATCAACCCG CGCCCGGCTT CCAGGCCCTG CGTTACAAGG CCCCGGAACA GGTAGCCGAA
TAA
 
Protein sequence
MVTFYNITLS TAFSLLMLSV AEAARPNVVL INADDLGWAE VGCYGQKKIK TPNIDKLASE 
GQRWVYFYSG APVCSPSRNV LMTGKHTGNC DVQDLKRVDA GENWRDLKGD WPIRTETYTL
PEAMKKAGYA TAVFGKWGIG DFGSTGAPDK HGVDRFYGYT DQKACHTYYP PYLWNDGKKE
VLNTSLTAAT IGHGSQPKGE VLADTYRAEQ HSSDLIADKM LEFVKEKAHG KQPFFLYYAP
LEPHVAMQPL QEWIDRYPRE WDKSPYRGNR GYLPHPRPRA AYAGMISQMD HNVGRLLDTL
KACGLDKNTI VIFTSDNGTT HDAGGVDHRF FNSVADLKGL KGQLYEGGIR VPGIIRWPGK
IAPGKTITQP AFHADVMPTL CALTGADAGS PLGTDLSPVL LGKKSALHDR KPLVWAGGGY
GGQVAVRFDS KKVIRRNLFP GKKPDNWEVY DIVKDPAEKN NIAAENRDLI NRAIAILDRE
YQPAPGFQAL RYKAPEQVAE