Gene Amuc_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1074 
Symbol 
ID6274034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1282440 
End bp1284098 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content56% 
IMG OID642613125 
Productsulfatase 
Protein accessionYP_001877681 
Protein GI187735569 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0111181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCTT TGAAAACCAT CATCGCCGGA ACTCTGGCCC TGCTGGCGGC AGCTCCCCTC 
TCAGCTCAAA CCAAGGCTGA GGAAAATAAA AAACCGAATA TCCTCTTTAT CATTACGGAC
GACCACGCCT ACCAGACGCT GGGCACCGGC AATAATGATT CCCCCGTGGC CCTGCCCAAT
TTCAACAAAC TGGGACGCCA AGGCATGGTT TTTGACCGCA GCTACTGCGC CAACTCCCTG
TGCGGCCCCT CCCGCGCCTG CATCCTGACC GGCAGGCATT CCCACATGAA CGGTTTTGTC
TTCAACGGAC AAAGACCGCT GGACGGCTCC CAGCCCACTT ACCCGAAAAT GCTGCAGAAG
GCCGGATACC AGACGGGCCT TTTCGGCAAA TGGCATCTGG AATCGGACCC CACCGGGTTC
GACACGTGGG AAATCTTCCC CGGCCAGGGC AGCTACTACA ATCCGGACTT TATCAGCCTC
AAGCCGGACG GCAAACGCCA GACAAAGCGT TTTCCCGGAT ATGCCACGGA CGTGGTCACG
GACAAATCCA TCCAGTGGCT GGGAAACCGG GACAAGAACA AACCTTTCCT GCTCGTTGTG
GGCCACAAGG CTCCCCACCG CGCCTGGTGC CCTGCTCTGC GCCACCTGGG CAAGGTGGAC
ACTTCCAGCA TGACGCCGCC CGCCAACTTC CATGACGACT ATGCCAACCG TCCGGAATTC
CTGAAGAAAA ACCAGCAGAC AGTCGCCAAT CACATGGCGA TTTATTCCGA CCTCAAAGTG
CTTAAGGACC AGGTTCCGGA AGAAATGCGC AAAAGCATCG TTTCCCCCGG TTACGGCTGG
GACCTGGGCG AGTTGAACCG CATGACTCCG GAAGAAAAGA AAACCTGGAC GGACTATTAC
GCCAAGCGCA CCAAATCCCT GGTGGACGGC ATGAAATCCG GAAAACTGAA GGACCCGAAA
GCGTTTGCGG AATGGAAGTG GCATGCCTAC ATGGAGGATT ATCTGGGATG CCTTCTGTCC
GTGGACGACA GCATCGGCCG CCTTATGGAA TATCTGGACA AAGAGGGGAT TGCGAAAGAC
ACGCTGGTCA TCTACTGCGG AGACCAGGGG TTCTACATGG GAGAACACGG CATGTACGAC
AAGCGCTGGA TTTTTGAAGA ATCCCTCCGC ATGCCCCTCA TCATGAGATG GCCCGGCAAA
ATTCCCGCGG GCATCCGCAA CAACACCATG GTGCAGAATA TCGACTACGC TCCCACCATC
GTTTCCGCGG CAGGGGCGGA CACCCCGGAA AACATGAATA CCTTCCAGGG CGTATCCCTG
CTTCCCACCG CTTTCACGGG CAAAACTCCC GACAACTGGA GGGATGCCAT TTACTACTGT
TTTTACGAAA ATCCCGGCGA ACACAACGCC CCGCGCCACG ACGGCATCCG GACGGACCGC
TACACGCTTT CCTACATCTG GACCAGCGAC GAATGGATGC TCTTTGACAT GAAAAAGGAT
CCCATGCAAA TGAAAAACGT CATTGACGAT CCTGCCTACA AGACTACGGT GGAACAGCTC
AAGAAGCGTT ACCACGAACT GCGCAAAACC TATAAAGTTC CGGAAAACAG CCCCGGAGGC
AAAGGAACGC CTATCCCCAA ATTCGACGCT TCCTGGTAA
 
Protein sequence
MRPLKTIIAG TLALLAAAPL SAQTKAEENK KPNILFIITD DHAYQTLGTG NNDSPVALPN 
FNKLGRQGMV FDRSYCANSL CGPSRACILT GRHSHMNGFV FNGQRPLDGS QPTYPKMLQK
AGYQTGLFGK WHLESDPTGF DTWEIFPGQG SYYNPDFISL KPDGKRQTKR FPGYATDVVT
DKSIQWLGNR DKNKPFLLVV GHKAPHRAWC PALRHLGKVD TSSMTPPANF HDDYANRPEF
LKKNQQTVAN HMAIYSDLKV LKDQVPEEMR KSIVSPGYGW DLGELNRMTP EEKKTWTDYY
AKRTKSLVDG MKSGKLKDPK AFAEWKWHAY MEDYLGCLLS VDDSIGRLME YLDKEGIAKD
TLVIYCGDQG FYMGEHGMYD KRWIFEESLR MPLIMRWPGK IPAGIRNNTM VQNIDYAPTI
VSAAGADTPE NMNTFQGVSL LPTAFTGKTP DNWRDAIYYC FYENPGEHNA PRHDGIRTDR
YTLSYIWTSD EWMLFDMKKD PMQMKNVIDD PAYKTTVEQL KKRYHELRKT YKVPENSPGG
KGTPIPKFDA SW