Gene Amuc_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1118 
Symbol 
ID6273952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1336709 
End bp1338595 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content55% 
IMG OID642613169 
Productsulfatase 
Protein accessionYP_001877725 
Protein GI187735613 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGTT ATTATTGGGG GAGGTTTTGG CCGTTCCTCC TGTTTGCGTT CGGCATTGAG 
GCGGTGGAAA ATCTGTTTAC GGTATTTTTC GAATACCGCA ACATGGATTT CGGGTTGCTT
CCCCTCCTGA AAACGGCGTA CGTTTTTGTG ACGGAGTTTG CCGTCACCAT GTGCTACTGG
CTTATTCCCT ATGCCGTTTA TTTGTGGATT CTGCCGCGCG GGAAAGCAGG AGGGAAGGCG
GACAGGTGGC TCACCTCCGC ATGGTTTTTC CTGTTTGTGC TCGCCAATCT GTTTGAAGAT
GTGGCGGAAG CCTTTTTCTG GAACGAGTTT GAAGCCAGCT TCAATTTCAT TGCGGTGGAT
TACCTGGTTT ACACCAAGGA GGTTATCGGG AATATTTACG AGTCCTATCC CATCATTCCT
ATTCTGGGCG GCATTCTGGC GGCGTCCGTT CTGGCCGCCT GGGGAATGAA GAGGTTCCTG
CTTCCCAGGA ACGGGGCGGT TCCCGCCGGA TGGAAACGGG GCTGTGTGGT GCTGTTCCTG
CTGGCCTGCG TCACGGGGGG ATATTGGCTG GTGGATATCA AGGATGCGGA TGCCGTGAAC
AACCGTTATA ATTCGGAAAT GGCCAAGGAT GGCCTTTACA GCCTGTTCAG CGCCTTTCTC
AAGAATGAAC TGGATTACCG CGCTTATTAC CGGACGCTGC CGGATGCGGA AGCGGCGGCG
TTTCTGGCCC GGGAGTTCAC GGCGGATGAC ACGTCCGTGC CGGAGGCTTC GTCCGGCAGC
GTAAAGAGGC AGGTGCGTCC TTCCGAGGGG GCTATCCGCC CGAATGTCGT GGTTGTGGTC
ATGGAGAGCA TGGGAGCGGA ATTTTTGAAC GAGTGCCGGG AAGACGGGGC TGACGTCACT
CCGTGCCTGA GCCGTCTGGG AAAGGAAGGC ATTTTTTTCC CGAATACTTA TGCCACGGGC
ACCCGTTCCG TACGCGGTCT GGAAGCAATC AGCACATCCC TGCCGCCGCT TCCCGGCATG
TCCATCCTTC GCCAGGAAGG AAACGAGCAT TTGCAGACCA TAGGTTCCAT ATTCAGGGAC
AAGGGATATG ATCTCAAATG GATTTACGGC GGCTACGGGT ATTTTGACAA CATGAATTAT
TTTTTCGGGA ACAACGGGTT TCAGGTTCTG GACCGTAATT CCATGGCTGA TTCCGAGGTG
ACCCATTCCA CCATTTGGGG CGTTTGCGAT GAAGATTTGT TCCGCCGCGC CGTACGGGAG
GCGGATGAAT CCTGCGGACG CGGCAAGCCG TTTTTGCAGG TGGTGTTTAC CACGTCCAAC
CACCGCCCCT ACACGTATCC GGAAGGGCGC ATTGACATTC CTTCCCACAC GGGGCGCATG
GGGGCCGTGA AATACGCGGA TTATGCGGTA GGCGCCTTTG TGGAGGAGGC CAGAACCAAA
CCCTGGTTTG ACAACACGCT GTTCGTGTTC GTAGGAGACC ACGGCGCCGG GAGCGCGGGA
AAGCAGGCCC TCAATCCGGA AACGCACCGC ATTTTTTCCA TTTTCTACGC TCCGGCTCTG
CTGAAACCGG AACGGCGGGA CACTCCCGTG AGCCAGATTG ACGTGCTGCC CACCCTGCTG
GGGCTGTTGA ACTGGCCGTA TGATGCGGCC TTTTATGGGA AGGATGCCTT GAAGCCTTCC
TATCAATCCC GGTATTTTGT GAGCAATTAC CAATATATCG GCTATTTGAA GGGGAAAGAC
ATGGTGGTGC TCAAACCCCA GCGCGGAGTG GAATTTTTCC GGGACGGAGA GGCCGTTGAG
CCGGACGGGC GGATGAAAGA GCTGGAAAGG GAAGCGGTTT ATTACTATCA GCACGCTTCC
GGCTGGCGCA CCAGTTTGAA AGAATAA
 
Protein sequence
MIRYYWGRFW PFLLFAFGIE AVENLFTVFF EYRNMDFGLL PLLKTAYVFV TEFAVTMCYW 
LIPYAVYLWI LPRGKAGGKA DRWLTSAWFF LFVLANLFED VAEAFFWNEF EASFNFIAVD
YLVYTKEVIG NIYESYPIIP ILGGILAASV LAAWGMKRFL LPRNGAVPAG WKRGCVVLFL
LACVTGGYWL VDIKDADAVN NRYNSEMAKD GLYSLFSAFL KNELDYRAYY RTLPDAEAAA
FLAREFTADD TSVPEASSGS VKRQVRPSEG AIRPNVVVVV MESMGAEFLN ECREDGADVT
PCLSRLGKEG IFFPNTYATG TRSVRGLEAI STSLPPLPGM SILRQEGNEH LQTIGSIFRD
KGYDLKWIYG GYGYFDNMNY FFGNNGFQVL DRNSMADSEV THSTIWGVCD EDLFRRAVRE
ADESCGRGKP FLQVVFTTSN HRPYTYPEGR IDIPSHTGRM GAVKYADYAV GAFVEEARTK
PWFDNTLFVF VGDHGAGSAG KQALNPETHR IFSIFYAPAL LKPERRDTPV SQIDVLPTLL
GLLNWPYDAA FYGKDALKPS YQSRYFVSNY QYIGYLKGKD MVVLKPQRGV EFFRDGEAVE
PDGRMKELER EAVYYYQHAS GWRTSLKE