Gene Amuc_1182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1182 
Symbol 
ID6273823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1416784 
End bp1418181 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content55% 
IMG OID642613233 
Productsulfatase 
Protein accessionYP_001877788 
Protein GI187735676 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAGG AAGTGCGCCT TTTGTTAATT CTGTGCGGTT TGTTTTGCGG AACGGCAGTT 
GCGCAGCCCC GCCTTTCCTC ACCGCCGAAC ATGATTGTCA TTCTGGCGGA TGACCTGGGT
TATGGGGATT TGGGCTGTAC AGGTTCCAAG CAGATAAAAA CGCCTTCCCT TGACCGGCTG
GCAAGGGAGG GGGTGTTCTG CTCCCGGGCG TATGTGACGG CGCCGATGTG TTCTCCCTCC
CGCATGGGAC TGCTGACGGG GCGTTTCCCC AAGCGTTACG GCATCACGAC GAATCCCAAT
ATCCAGATGG ATTATCTTCC GGAGTCCCAC TACGGGCTGC CGCAGACGGA GAAATTGATT
CCGGAGTATT TGGCTCCCTG TGGGTACCGG AGCGCGGTAT TCGGCAAGTG GCATCTTGGC
CACACGAAGG GATATACGCC GCCGGAGCGC GGTTTTACGC ATTGGTGGGG GTTCCTGGGC
GGTTCCCGGC ATTATTTTCC CGTGAAGAAA GAGGCTGAAG GCCTGAATCC CTCCATGATT
GTATCCAATT TTACGGATAA GACGGACATC ACCTATTTGA CGGACGATAT TACAGACCGG
GCGGTTGAAT TTCTGCAGGA AGCCGGAAAG GATAAGAAAC CGTTTTTCAT GTTCGTTTCC
TACAACGCCC CCCATTGGCC CAATGAAGCC AAACCGGAGG ATATCGCCAA ATTCAGGAAC
GTGCAGAACG GGGAACGACG GGTTTATTGC GCCATGGTAT ATGCGATGGA CAGGGGAATA
GGCCGCATTC TGGATGCCTT GAAAGCAGAC GGTTTGGAAA AGGATACCAT CGTCGTCTTC
CTGTCGGACA ATGGAGGCGC TCCGGAAGCT TCTTCCTGCA ACGCCCCTTT CCGGGGTGCC
AAGAGGCAGC ATTTTGAAGG AGGTGTTCGC GTACCTTTTA TTATCAGATA TCCGGCGGAC
AAGCGTTTGG TTCCCGGAAG CGTTTGCAGA CAGCCCGTTT CCTCCGTGGA TTTGCTTCCC
GCTTTGCTGA AGGCGAATGG CCGTCACATT CCCAGGAAGC TGGACGGCAT GGATATTCTG
GAGCTGGTGG GGAACAAGGG AGCTCCCGTT CCGCGCACCT TTTTCTGGTG CACGGATTAC
ACGTCCGCTG TGCTGACCGG TGATATGAAG TACCTGCTGG TCCCGGATCG CGCTCCGCAG
TTTTATAATG TGGCGGACGA TCCCCAGGAA CAGAGGGATT TGTATTTTTC CAGGCACCAG
GATGCGGATC TCCTCGCTAA AAAGCTGGGA ACATACCTGA CTACGACGCC TGCATGCCGT
TTTCCCGACA GTATCAGCTG GTCCGCCAAA TTGATGAGGG AGTATGACAA GACTGCGCCG
GACAGGCAGC CTGAGTAA
 
Protein sequence
MMKEVRLLLI LCGLFCGTAV AQPRLSSPPN MIVILADDLG YGDLGCTGSK QIKTPSLDRL 
AREGVFCSRA YVTAPMCSPS RMGLLTGRFP KRYGITTNPN IQMDYLPESH YGLPQTEKLI
PEYLAPCGYR SAVFGKWHLG HTKGYTPPER GFTHWWGFLG GSRHYFPVKK EAEGLNPSMI
VSNFTDKTDI TYLTDDITDR AVEFLQEAGK DKKPFFMFVS YNAPHWPNEA KPEDIAKFRN
VQNGERRVYC AMVYAMDRGI GRILDALKAD GLEKDTIVVF LSDNGGAPEA SSCNAPFRGA
KRQHFEGGVR VPFIIRYPAD KRLVPGSVCR QPVSSVDLLP ALLKANGRHI PRKLDGMDIL
ELVGNKGAPV PRTFFWCTDY TSAVLTGDMK YLLVPDRAPQ FYNVADDPQE QRDLYFSRHQ
DADLLAKKLG TYLTTTPACR FPDSISWSAK LMREYDKTAP DRQPE