Gene Amuc_1755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1755 
Symbol 
ID6275045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2136730 
End bp2138418 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content58% 
IMG OID642613818 
Productsulfatase 
Protein accessionYP_001878354 
Protein GI187736242 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTT CCCGTATTGC AGCCTTTCTC ATCCCGTTCC TGCTGGGAAG CGCCTGCTCT 
GCTTCCGCAG CTTCCGTAAA AGCATCCCGC GCGGCAGAAC CGAAACGCCC CAACATCATC
CTCTTTCTGG TGGACGACAT GGGCTGGCAG GATACATCCC TGCCCTTTTG GCGGGAAGAG
GACGGCACTC CCAAACCTAC TTTCCTGAAC AAGCGCTACC GCACGCCCAA CATGGAGGCT
CTGGGCAAGC AGGGCATGGT CTTCACGAAC GCCTACGCGC AGCCTATCTG TTCCCCCAGC
CGATGCAGCC TCATGTCCGG CATGAACTCC GCCCGCCACC GCGTCACGAA CTGGACGCTC
CTGCGTGACC AAACCACGGA CGCAGGCCAT CAAGCGCTCA AGGCTCCCGC AGACTGGAGC
GTCAACGGCA TCCAGCCGGC AGGAACCAGG GCATCCGGAA CCACCCATCT TCCCCTGACG
GAAGAAAAAA TCCAGTACAG GATGGAAAAA CCCTTCACCC AGGTGCTTGG GCTGCCCGCC
CTGCTCAAAA AACAGGGGTA TACCACCATC CACTGCGGAA AAGCCCATTT CGGCTCCAAA
AATACGCCCG GAGCCAATCC CAGGCTCTTC GGCTTTGACT ACAATATCGC CGGCACGGAA
ATAGGCGGCC CGGCGGACTA CCGGGGTTCC CGGAAATACG GCACGGGGAA TTTCCATGTC
CGCGGACTGG ATGAAAATAA TTACTATGAA AACGACACAT TCCTGACGGA AGCCCTGACG
CAGGAGGCAC TGAAACGCCT GGACGCCATC CGGAAAAACC CCAGGGAGGC GGACAAGCCT
TTCTACCTGT ACATGTCCCA CTACGCCATC CATGCCCCCT TCGACATGCG CGGTTATGAC
AAGAGGTTTG CGGAAGAGTA CTCCAACCCC AATGACGGCC ACAAATGGTC CGACAACGAA
AAACGCTATT CCGCCCTGAT CCAGGGGATG GACAAGAGCC TGGGCGACAT CCGGGAATAC
CTGAAAAAAA ACAATCTGGA TAAAAATACC GTCATCATCT TCATGGCGGA CAACGGGGGC
CTTGCCATCA GCGGGCGCAT GGGCAACAAG GAATCCAATT ACCCGCTTAG CTTCGGCAAG
GGTTCCAACC GGGAAGGCGG CATCCGGGAA CCGATGATCG TCTACTGGCC CGGGGTCACG
AAGGCGGAAA GCGTCTGCAC TACTCCCGTC ATCATTGAAG ACTTTTTCCC CACCATTCTG
GAAATAGCAG GAGCAAAAAA AATTCAGGCG CCTCAGGTTG TGGACGGCAA AAGCTTTGTT
GCCCTGCTGA AAGGCGGCAG CATGAATCCC AACCGCTCCC TGCTCTTCCA CACGCCCAAC
GTATGGGGGG AAGGGAACGG AAACAACTCC CTTTATTCCC CCAGCACGGC CATGCGCCAG
GGGGACTGGA AGCTTATCTA CTGGCACCCT GACCAGAAGT TCGAACTCTT CAACCTCAAG
GAGGACATCA GCGAAGAGCA CAACCTGGCG GAACAGCAGC CGGAACGCGT TAAGGTCATG
GCCAGGACCA TGACCACCCT GCTCAAGGAG CGCAAGGCCC AAATGCCCAC CTACAAGAAG
AATAATCCCG CCGGAGCCCG TGAAGGCGCT CCCGTACCGT GGCCGGACCA GGCGGCGGCC
AGGCTGTAA
 
Protein sequence
MNISRIAAFL IPFLLGSACS ASAASVKASR AAEPKRPNII LFLVDDMGWQ DTSLPFWREE 
DGTPKPTFLN KRYRTPNMEA LGKQGMVFTN AYAQPICSPS RCSLMSGMNS ARHRVTNWTL
LRDQTTDAGH QALKAPADWS VNGIQPAGTR ASGTTHLPLT EEKIQYRMEK PFTQVLGLPA
LLKKQGYTTI HCGKAHFGSK NTPGANPRLF GFDYNIAGTE IGGPADYRGS RKYGTGNFHV
RGLDENNYYE NDTFLTEALT QEALKRLDAI RKNPREADKP FYLYMSHYAI HAPFDMRGYD
KRFAEEYSNP NDGHKWSDNE KRYSALIQGM DKSLGDIREY LKKNNLDKNT VIIFMADNGG
LAISGRMGNK ESNYPLSFGK GSNREGGIRE PMIVYWPGVT KAESVCTTPV IIEDFFPTIL
EIAGAKKIQA PQVVDGKSFV ALLKGGSMNP NRSLLFHTPN VWGEGNGNNS LYSPSTAMRQ
GDWKLIYWHP DQKFELFNLK EDISEEHNLA EQQPERVKVM ARTMTTLLKE RKAQMPTYKK
NNPAGAREGA PVPWPDQAAA RL