Gene Amuc_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0121 
Symbol 
ID6274910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp149378 
End bp150958 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content58% 
IMG OID642612166 
Productsulfatase 
Protein accessionYP_001876747 
Protein GI187734635 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT CCGCATTACA CCTGATGCTT GCCATGGCGG CGGCAGCGGC CTGCCCGGCC 
GCAGTCACGC CAACGGTCAA GCCACCCAAG GCCATCGTCA TGATTTACGC TGACGACCTC
GGCTATGGAG ACGTAGGCTG CTATGGAGCC AAGGGAATTC CCACCCCCGC CATCGACAAG
CTTGCCGAGC AGGGCTGCCG CTTCACGGAC GCCTATTCCA CCACATCCGT CTGCACCCCT
TCCCGCTATG CCCTGTTCAC AGGGGAATAT CCGTGGCGCA AGGAAGGCAC GGGCATCCTG
CCGGGAGATG CCGCCCTCAT TATCGATACC AAGAAGCCCA CCCTGCCCAA GATGCTCCAA
TCCCACGGCT ACAAAACCTA CATGATAGGC AAATGGCACC TGGGCCTGGG GGAAAAGGGG
AAGAAGATTG ACTGGAACAA ACATATCTCC CCCAGCCCGA ACGAAATCGG ATTTGACGAG
AGCTTTATCT TCGCCGCTAC GGGCGACCGC GTTCCCTGCG TGATTCTGGA AAACGGCAAT
GTCCGCAACC TGGACCCGAA CGACCCCATT GAAGTATCCT ACAAGCACAA CTTCCCCGGG
CTTCCCAATG GCAAGGATAA TAAGGACCAG CTCAAACTCA TGTGGAGCCA CGGACACAAC
CAGGCTATTA TCAATGGGAT CGGACGCATC GGGTTCATGA AGGGCGGCAG AAGCGCCTTG
TGGAAGGATG AGGAAAACGC CGATATCATT ACGGATAAGG CCATTGAATA CATTCAAAAA
AGCGCCAAAG CCAAGGAACC GTTTTTCCTG ATGTTCGCCA CGCATGACAT CCACGTGCCG
CGCTGCCCGG AAAAACGCTT TGTGGGCAAG AGCCGGCACG GCGTGCGCGG TGACGTGACC
GTGGAACTGG ATGACTGTGT GCGCCGCATT ACAGAGGCTC TGCAACAGGC CGGTCTGGAA
AAAGACGCCC TGGTGATCTT CTCCAGCGAC AACGGTCCCG TGCTGGATGA CGGCTACAGG
GATTTCGCCG TCCGGGACAA CGCCACCCAT TCCCCCGCCG GCCCCTTCCG CGCAGGCAAA
TACAGCATTC TGGAAGGAGG TTCCCGCATT CCGTTTATCG TCAAATGGCC CGGCGTGATC
AAACCCGGAA CCACGAGCAA AGCCCTGCTC AATCAAATGG ATTTGGGGGC CTCCCTGGAA
CAGCTGCTGG CCCCCGGCAA GGCCAATTCC TTCCGCGACT CTGAAAACGT GATGCCCGCC
CTTCTGGGCA AATCCGCCAA GGGGCGTGAC TACCATGTCA TCAACAGCAC CGGCAAGGCA
TTGGCGATTC GCCACGGCAA ATGGAAATTC ATTCCCGCCG GCGTGGCCAT TCGCGACGGC
ATCAACGGAG CCTCCGCAAA AATGAGCAAG TCCCCGGAAG GAGGAAGCCT CTTTGACCTG
GAAAAAGACC CGAAGGAACT TGACAACGTA GCCTCCCAGC ATCCGGACAT TTGCGAACAG
ATGAAAGCCA AGCTTGAGGA AATCCGCCAG AGGCCCGAAA CCAAGGCTGA CCAGGAGGAC
CTGCTTCCCT TGGACGACTA A
 
Protein sequence
MNKSALHLML AMAAAAACPA AVTPTVKPPK AIVMIYADDL GYGDVGCYGA KGIPTPAIDK 
LAEQGCRFTD AYSTTSVCTP SRYALFTGEY PWRKEGTGIL PGDAALIIDT KKPTLPKMLQ
SHGYKTYMIG KWHLGLGEKG KKIDWNKHIS PSPNEIGFDE SFIFAATGDR VPCVILENGN
VRNLDPNDPI EVSYKHNFPG LPNGKDNKDQ LKLMWSHGHN QAIINGIGRI GFMKGGRSAL
WKDEENADII TDKAIEYIQK SAKAKEPFFL MFATHDIHVP RCPEKRFVGK SRHGVRGDVT
VELDDCVRRI TEALQQAGLE KDALVIFSSD NGPVLDDGYR DFAVRDNATH SPAGPFRAGK
YSILEGGSRI PFIVKWPGVI KPGTTSKALL NQMDLGASLE QLLAPGKANS FRDSENVMPA
LLGKSAKGRD YHVINSTGKA LAIRHGKWKF IPAGVAIRDG INGASAKMSK SPEGGSLFDL
EKDPKELDNV ASQHPDICEQ MKAKLEEIRQ RPETKADQED LLPLDD