Gene Amuc_0161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0161 
Symbol 
ID6274966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp199559 
End bp200563 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content55% 
IMG OID642612206 
ProductMammalian cell entry related domain protein 
Protein accessionYP_001876786 
Protein GI187734674 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.458265 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGA AACTTAAACC GGAAACCTGG GTAGGCATCT TTCTGATCGC CGGGATCCTG 
ATGATTATCG GAGTCATCCT GGGATTCGGC AACATCAAAA CATCCAAGGA GCAAACCTAC
CCCATCAACA TCATTTTCAA AGATGCGGCG GGGCTCATCA AGGATTCCCA GATACGCCTG
GGCGGCGTCA CGGTGGGGAA AGTCACCAAA GCCCCTGAAC TCCTGCCTTC CGGCAATGAA
GTCATGCTGG AAGCCAGCAT TCAGAGCGAC GTGAAAATCC AGCAAGGGTC CGTCTTCCGG
GTGGACATGC AGAACATCCT GGGCGACAAA TACATCGATA TCGTCCCTCC GGCCCAGCCC
ACGCATGAAT ACATCCTTCC CCATGCCACC ATCATCGGTC AGCCGGAAAG CGATTTCAGC
AAAATCAAAA ACAATGCCGT GGGCGCCACA GAGGAAATCC TGAAAATCCT CAAGCAAATC
GAGAAAAATT CGGACAACAT TGACGACGCC ATCCTCAATA TCGGAGAGGC GGCCAAAGGG
CTGGCCCAGA CGACCAGGCT GATCAACGAG GGCATCCTGA ACCGGGAGAA CATCCAGAAC
CTCGGCAGCG TACTGTCTCA AATAAACCGG GCGGGAGAAC AGCTCCCCGG CCTCATGGAA
GAAACGCGCT CCTCGGTCGC GGCCATGAAA GACACCGTCC GGGATGCCCG CAAACTCATT
GCCGGAGCTG AAGAAAAGCT CAACACTCTG GACCCCGCCG TCAAGGCAAT CCCCCCTACG
CTGGCCGCTC TGAGGAAAGC ATCTGAACAA ATCTCCTCCT TCACGGCTGA TGCACGTAAA
AACCAGGGCT TTCTTGGGCT GCTTATGTAC GATGCTCGCT TCCGGGCCAA CGCTCAGGAA
TTCATCCGCA ATCTGAGGGA TTACGGCATT CTGAGGTACC GGAATCCTAA CGAACCCCAA
GTCAAACCGG ACCCCCGCGG CGGTTTTTCA GGAAGCCGCC GCTGA
 
Protein sequence
MKQKLKPETW VGIFLIAGIL MIIGVILGFG NIKTSKEQTY PINIIFKDAA GLIKDSQIRL 
GGVTVGKVTK APELLPSGNE VMLEASIQSD VKIQQGSVFR VDMQNILGDK YIDIVPPAQP
THEYILPHAT IIGQPESDFS KIKNNAVGAT EEILKILKQI EKNSDNIDDA ILNIGEAAKG
LAQTTRLINE GILNRENIQN LGSVLSQINR AGEQLPGLME ETRSSVAAMK DTVRDARKLI
AGAEEKLNTL DPAVKAIPPT LAALRKASEQ ISSFTADARK NQGFLGLLMY DARFRANAQE
FIRNLRDYGI LRYRNPNEPQ VKPDPRGGFS GSRR