Gene Amuc_0565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0565 
Symbol 
ID6275592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp664189 
End bp665817 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content57% 
IMG OID642612614 
Productsulfatase 
Protein accessionYP_001877183 
Protein GI187735071 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.776698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATTCC ATATTCCTTT CATGTGCTCT ACCCTGCGTT CCTCCATCAT GTCCAGCATA 
GCCTTCTGGT TGCTTACCCT TGTCCCTTCA CTGGCGGACA GGCCTAATAT CGTCCTTATC
CTGGCTGATG ACATGGGATG GTCCGACCCG GGCTGCTACG GTTCGGAAAT ACCCACTCCA
GCCCTGGATA CCCTGGCCAG ACAGGGGATG CTGGCAACCC GGCTCTACAC CGCTTCCCGC
TGTTCACCCT CCCGGGCCTC CATCATGACC GGCTGCGAAC CTCACAAGGT GGACGTAGGG
CTCCTGGATG ACGACAGCGG GCGCCCCGGC TACCGCGGAC GCCTGAACCC GGGTATCCCC
ACCCTGCCGG AACTTCTGAA AAAAGCGGGA TACCGTACCT ATCTTTCCGG GAAATGGCAT
CTTGGAAAAG TTCGGGGATC CTACCCATGG GACCGCGGCT TCGACCGTTC CCGCGGTTTG
CTGGGTGGAG CGGCAGATTA CTACAGGCCC ATGCCGGACA GCCCCTTCGG TGAAAACGGG
AAACTGCTCC GTCCGGAGGA TCTGCCGGAG GATTTCTACA TGACGGACGA TATCACCAAA
ACGGCTCTGG CCTATATTGG CGATGCCGCC AAAAGCAGGC AGCCCTTCTT CCTTTACGTG
GCTTATACGG CTCCGCATAC ACCCCTCCAG GCCCCCCGGA GGGAAATAGA AAAAATGCTC
CCGTTCTACA ACGGCAAATC CCCCCATGCC ATTGCTTCCA AAAGACTGGA AAAACAAAAG
CTGCTGGGAA TCGTCCCTCC TGCCGCCAAA CTGGGCATGG CCGGCAAATT CAATCCAGAA
GGCTATGAAA AAACTTCCGC AAAGCGGAAG GATTATATTG CCGAATGCAT GGCCACCTAC
GCCGCCCAAA TCGTTATCAT GGACCGCGGC ATAGGCCGCA TTCTCGCGTC CCTAGAACGT
CACCGCCTCA GTGACAATAC CATCGTCATG TTTTTATCAG ACAATGGCGC AACAGCGGAA
ATGCCCCAGA ACAATAAAAA CAAGAAGACT ACCCTCCCCA CAGGCCCGCT GGGAGAAGTC
GGATGCAGGG ACGGATACGG CCCCATGTGG GCGGCTGTGT CCAATACCCC TTACCGCCAG
TATAAAATAG AAACCTTTGA CGGAGGGCTG TCCGCCCCCT TCATCATTCG CTACCCTTCC
AAAATACGTC CGGGATCGCG CTACCACTCG CCTTTCCTGC TTCAGGACAT CGCCCCAACC
TGCCTTGCGT GGGCCGCTCT CCCAATTCCG GCCCATATGG ACGGCAAGCC GCTCAACACC
TACTGGAATA ATCCTCCGGA ACTTCCTCCG TCCAAGGTGT GGGACTTCAT TCCCAATACC
TGCCCTCCCC GCACTATCTT CTGGGAACAT CAGAGGAACC GGGCGGCCCT GACAAGTCAA
TTCAAGCTGG TGGCCCCCAA CCGCGGCCCC TGGCAGGTGT ACGACATCAG GGACAGGACG
GAACAGAACA ATCTGGCATC CAGGCACCAG ACGCTTGTAG AACAATTGTC CGCACAGTAC
AGGAAATGGG CGGCGGAAAA CCATGCCGAA CGACACAGCC CGGCAGAAAA ACGGGCATAC
GCGCCCTAA
 
Protein sequence
MIFHIPFMCS TLRSSIMSSI AFWLLTLVPS LADRPNIVLI LADDMGWSDP GCYGSEIPTP 
ALDTLARQGM LATRLYTASR CSPSRASIMT GCEPHKVDVG LLDDDSGRPG YRGRLNPGIP
TLPELLKKAG YRTYLSGKWH LGKVRGSYPW DRGFDRSRGL LGGAADYYRP MPDSPFGENG
KLLRPEDLPE DFYMTDDITK TALAYIGDAA KSRQPFFLYV AYTAPHTPLQ APRREIEKML
PFYNGKSPHA IASKRLEKQK LLGIVPPAAK LGMAGKFNPE GYEKTSAKRK DYIAECMATY
AAQIVIMDRG IGRILASLER HRLSDNTIVM FLSDNGATAE MPQNNKNKKT TLPTGPLGEV
GCRDGYGPMW AAVSNTPYRQ YKIETFDGGL SAPFIIRYPS KIRPGSRYHS PFLLQDIAPT
CLAWAALPIP AHMDGKPLNT YWNNPPELPP SKVWDFIPNT CPPRTIFWEH QRNRAALTSQ
FKLVAPNRGP WQVYDIRDRT EQNNLASRHQ TLVEQLSAQY RKWAAENHAE RHSPAEKRAY
AP