Gene Amuc_1655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1655 
Symbol 
ID6275705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2000025 
End bp2001689 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content56% 
IMG OID642613714 
Productsulfatase 
Protein accessionYP_001878255 
Protein GI187736143 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.789399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.0842498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCG TTTGCGGAGT ATTGTTCCTG TTTGCCGGGG CATCCGCTCT GGGCAATGGT 
TCCAACACTG TTTCCGGAAA GAAGCCCAAT ATCATCGTTT TCCTGGTAGA TGACATGGGA
TGGCAGGATA CTTCCCACCC GTTCTGGTCC GACAGCCAGG GCAACCCGAA GAAAACCTTT
TTGAACAGGC GCTACCGGAC GCCGAATATG GAGAAGCTGG CTTCCCAGGG CATGACGTTT
ACGGATGCAT ATGCCCATCC CCTTTGCACT CCTTCCCGGG TGAGCCTGAT GTCCGGCATG
AATCCGGCGC GGCACCGCGT GACCTGCTGG GTACGGGAAC AGAACGGAAC GACGGACGCC
AACAGCAGGA GCCTCCTGCC TCCGGACTGG GCGTTGAACG GCCTTCAGCC TATAGGCACT
CCCGCCAGGG GAACGACAAA ACGCCCCATT TCCGGGGAGG ATATGCGCTA TCACATGACG
CGTCCTTTTG CCACGGCGGC TACACTGCCG GAGATGCTGA AGAAGTGCGG TTATGTTACC
GTCCATTGCG GGAAGGCCCA TTTCGGCACG CAGGGAACTC CGGGTTCCAA TCCGTTGAAT
ATGGGATTTG ATTATAATAT CGCAGGTACG GAGATCGGCC ATCCGGCGGA TTACCGCGGT
TCCAGGCATT ACGGAAAGGG GTTTAACCAT GTGCGCGGAC TGGATGAGAA TAATTATTAC
CAGGACGATG TATTTTTGAC GGAGGCCCTG ACGCGGGAGG CCATTAAACG CCTGGAAGCC
ATCAGGACCA ATCCCAGGGA GGCTGGCAAG CCCTTTTATC TGTACATGGC CCATTACGCT
TTGCATTCTC CGCTGGATGA GCGCGCTTAC GACAAGAGGT TTGCGGATGC CTACAAAAAC
CCGGAGGACG GCCACAAGTG GTCCCGGACG GAGAAACATT ATTCCGGGCT GATCGAGGGG
ATGGACAAAA GCCTGGGGGA TATCATGAAG TATCTCAGGG AACATCATCT GGAAAAAAAT
ACCGTACTGG TGTTTATGTC GGATAACGGA GGCCTGGCCA TCTCCGGCAG ACTGGGCAAT
GAAGAGGCCA ATTACCCTCT TTCCTTCGGC AAGGGGTCAT GCATGGAAGG CGGTATCCGG
GAGCCTATGA TTGTTTCCTG GCCGGGCGTG ACGAAGGGCG GTTCAAGGTG TGCCGTTCCG
GTGGTTATTG ACGATTTTTT CCCAACTCTT CTGGATATCG CCGGATGCCG GAACGTAGAA
GTTCCGCAGA AGCTTGACGG CTTAAGCCTG GTTCCTTTGC TCAAAGGCGG CCGGTTTCCT
GAAGACCGCC CCCTTTTGTT CCACCAGCCG AATAATTGGG GGGAAGGCAG CCGGCAGGCG
CCTCAGTATA CTTCTTCCAC CGCATTGCGC CAGGGGGATT GGAAATTGAT TTACCGCCAC
CTGACCCAGA GCTTTGAGCT GTACCATTTA AGGAAAGATA TCGGCGAGAA GGAAAACCTG
GCTTCCAGGG AGCCGCGGAA AACCAGGGAA ATGGCTGTTG TCATGGGCAG GTTGCTCCGG
GAGAGGAAAG CGCAGATGCC AACCTATAAG AAGGACAATG AGCTGGGCGC TCCTGCCGGG
AGTTCCGTTC CGTGGCCCGA CCAGGTGAAG GGGAATGGTT TTTGA
 
Protein sequence
MSAVCGVLFL FAGASALGNG SNTVSGKKPN IIVFLVDDMG WQDTSHPFWS DSQGNPKKTF 
LNRRYRTPNM EKLASQGMTF TDAYAHPLCT PSRVSLMSGM NPARHRVTCW VREQNGTTDA
NSRSLLPPDW ALNGLQPIGT PARGTTKRPI SGEDMRYHMT RPFATAATLP EMLKKCGYVT
VHCGKAHFGT QGTPGSNPLN MGFDYNIAGT EIGHPADYRG SRHYGKGFNH VRGLDENNYY
QDDVFLTEAL TREAIKRLEA IRTNPREAGK PFYLYMAHYA LHSPLDERAY DKRFADAYKN
PEDGHKWSRT EKHYSGLIEG MDKSLGDIMK YLREHHLEKN TVLVFMSDNG GLAISGRLGN
EEANYPLSFG KGSCMEGGIR EPMIVSWPGV TKGGSRCAVP VVIDDFFPTL LDIAGCRNVE
VPQKLDGLSL VPLLKGGRFP EDRPLLFHQP NNWGEGSRQA PQYTSSTALR QGDWKLIYRH
LTQSFELYHL RKDIGEKENL ASREPRKTRE MAVVMGRLLR ERKAQMPTYK KDNELGAPAG
SSVPWPDQVK GNGF