Gene Amuc_0864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0864 
Symbol 
ID6274304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1032239 
End bp1033957 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content59% 
IMG OID642612919 
Productsulfatase 
Protein accessionYP_001877478 
Protein GI187735366 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.218457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCTC CCCATGCATC CGCCGGTTCC GCAGAACGGT ATCCGTCCGC ATACGGCCTG 
TCGGCCGCTG TCTGCCAGGC TCTGCGCGCC GCGTTCATTC CCCGCCGCAT GCGCTGCCTG
TGGGCAGCCG GGCTTGCGGC GGCGCTGGCG GCCAATCCCT ATGTGCTCAA CGATGGCCAA
AGCCTGCTGA TGAGCAGCAT GGGCATCGCC TGCTCCGCAT CCCTGCTCTG CGGCACCATT
CTTCTCTGCC TGAAATACCG CTTCACCGCC TATGTCCTTC TTCCCGCCAT CATTCTGTTC
AATGCGGGCC TGTACATGAT GCAGATGCGG TACGGATTGG TCCTGAACCT TTCCGTGCTT
TCCAGCATGG CGGAGACCAA TTTCCAGGAA GCCTGCGCCT TCTGCACCCC CGCTTCCATT
GCAGGAACCC TGCTTCTGGC CGGGTTTATC TACCTGGCCA TCTACTGGAG CCGCCGTTCC
CTGCGTCAAA AAGCCACGTG GGGTGCACTG GCATGCATTT GGGGCCTGTA CGCGGCCCTT
CTGCTGCTGA GCATTCCGGC AGCCTCCTAC AGCTTGGAAC CCCTTTATCT TTACTACACG
TCTGACAAGG CCAAAGGCTG GCCTCTCGTG GATATTGCAA TGACGTGCAA ACTGGCGGAC
GAATACATCA CCCAGGATGC GGGGCGTTTC AACACATTGC GGAATCTTCC GTCCTGCGCG
GAGCCCCTTT CCCAGTGCGA AGCCCCAGAC GACCTGGCCG TGGTTTTCCA CATGGGGGAG
AGTGTCCGGG GGGACCATCT TCCCCTGAAC GGTTACCATC GGAACACAAT GCCCCGGCTT
TCCAAAGAAC CCAACGTCGT TTCCTTCCCG CATGCCACTT CCTTTGGCAT CGTGACCAGA
ATTTCCGCCA TCGGCATGTT TACGGATGCG GAACTGTGCC GCCGCACTCC CGGTCACTCC
TCCTTCATTG ACCTGTTCAA CAAACACGGA TTCCGCACCG TCCGTATCAT GGACCTGAGC
GGAGATTCCA TTCATGATTA TTCCCTGGGC ATCCTGACAC GGAACTGCCG TGAACGGAGA
CAGACGCCGC TCCAGCACCA GACGCCCGGA ATGATGCAGG AACGAACTTC CCTGGTCATG
GAGGAATCCC TGAAAAACTT CGGCCGCAAC AGGCAGCTTT ATATCATTTA CAATAACGGG
AGCCATATGG CGTTCAGCTA CCCCGCGCAG GCGGAATGTT TTACCCCGGC ATCCTGCAAT
ATGGACGACC CCAAGGCCCG TCTGGAAGAA ACCGTCAATG CCTATGACAA TTCCATCGTC
GACCTGGATG CCTCCATTCA CCGCATGATT GCACTGTTGA AGAACAGGCC CGCCATTTAT
TTTTACTGCT CCGACCACGG CGTAGCGCTG GGAGAGGAAG GAAAAATGTT CCAGGGCCAT
ATCCTGCCGC CTGTTTACCG GCCTGCCATG TTCATCTGGT ATTCGGACAC CTTCGCCTCA
CGCTATCCGG ACATGGTGCG CGCCCTGAAA GCCAACCGGC TGAAAGCCGT CTCCCACGAC
CACATCTTCC ATACCCTTCT TTCCCTGGCT TCCATCCGGT CGGAAATCGT CAGGAACGAC
CTGAATCTGG CTTCTCCGGA CGCGCGGGAA ACTCCGGCCC CCCTCCAGCC GGAAACGCTG
GCGGAATGGC TGCCCATTCC CGCACCGCCG CAGCCGTAA
 
Protein sequence
MIPPHASAGS AERYPSAYGL SAAVCQALRA AFIPRRMRCL WAAGLAAALA ANPYVLNDGQ 
SLLMSSMGIA CSASLLCGTI LLCLKYRFTA YVLLPAIILF NAGLYMMQMR YGLVLNLSVL
SSMAETNFQE ACAFCTPASI AGTLLLAGFI YLAIYWSRRS LRQKATWGAL ACIWGLYAAL
LLLSIPAASY SLEPLYLYYT SDKAKGWPLV DIAMTCKLAD EYITQDAGRF NTLRNLPSCA
EPLSQCEAPD DLAVVFHMGE SVRGDHLPLN GYHRNTMPRL SKEPNVVSFP HATSFGIVTR
ISAIGMFTDA ELCRRTPGHS SFIDLFNKHG FRTVRIMDLS GDSIHDYSLG ILTRNCRERR
QTPLQHQTPG MMQERTSLVM EESLKNFGRN RQLYIIYNNG SHMAFSYPAQ AECFTPASCN
MDDPKARLEE TVNAYDNSIV DLDASIHRMI ALLKNRPAIY FYCSDHGVAL GEEGKMFQGH
ILPPVYRPAM FIWYSDTFAS RYPDMVRALK ANRLKAVSHD HIFHTLLSLA SIRSEIVRND
LNLASPDARE TPAPLQPETL AEWLPIPAPP QP