Gene Amuc_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0084 
Symbol 
ID6275024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp110100 
End bp111743 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content51% 
IMG OID642612127 
Producthypothetical protein 
Protein accessionYP_001876710 
Protein GI187734598 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0897347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATT ATATACCAAT CAACAAGGCA GTTAATCCCG TCTACATCTA TCAAGGGAAT 
GAGCCTCTTA TCAGCAAGAC GCTCGGAGTT GACGCGATGA CGTTCAACTT TGACGGTTCG
GCGTGCGAAC ACGCCACTTG CACCGGAGCG CCGGAAATGT TCATCAAGGT CGAGGAAGAC
GGCCTGTACT ATTTCGGCGT GGAAGCCGAC GACACAGGCA GCCTGACGAT TGCCGGTGAG
CAAGCTATAA AAAAGGATGG GATACCACCC AACGGCAAAC TGAATATTGA AACCGATTCC
AGGTACCTGA AAGCTGGCTA TTACAAGGTG GCCTTGTCTT ACACGAATAA TGCCTACACC
CCCGTCAGCA ACAATGCCAT TGCATTCAAC GTCACGATGG ACAGGGAACC CATTCAGGCA
GGGAAGTATG AAGGCAACTC AACGATGAAG CGCGAATTCT CGTCGTCCCC CAAAATCAAA
TTGTGGACGA TTGAGAAGGA ATCCGCTATC ACCTGTGAGC CATCCAAGGA AGTGGGACTG
GAATTTGAGA AACCGGACCC CGTCTATTTG GAAGGGAATG GTGACTGCAA TGTAAGTAGC
TTGAGACCTA AGATTTGCGT AACGGTTTGC AAAGATGAAT CGGAAAACGT ATGGAGGTGT
CGTGTTGTTT CCGTTTCTGC CGGAGCCAAG CTAACTATGT ATGAAGGCAT TTACGTGAAT
CCCTATGTTG ATCCCCCTCT CAATGAAGAA GAAGCCACCG AAGCGGTCAA CGAAATGAAC
GGTTATCAAT CTCGTGCAGA AGTTGGAACA TGGCACACGC CGCAGGCTTC CCTCGCCCAC
GAAGAACACC ACCGCCGTCA ATGGGAGGAT GCCTACAAGT TCTACTGGAA GGACTCCAAA
ATACAGGAAA TGCTTGAAAA TCAGACTATC TCCTGCGACA AAGAACCGAA CATGGATGAA
GCCGTGAAGT TCATGCAGGC TCTTGCCAAT GAAATGGCAT TGGACCTTTG GAAAGAAACG
TCAGACTATG TACTGGCGTT GCCGGATGAT GCCAATGACA GACCCTATTG CGCGGGACAG
GAAGTCCTGA ACGAGGCAAC CGCCTATGTC ATTCGTCTGG CCGACGCGAT GGGGTGGAAC
AATGTGCCCA GGTTTATCAC GAAACCTGGA ACGATCGAGC CTCCCTGTTT CATGCCTCCG
GTCAGCGAAG GTAAAACCCG CAGCATGGCA ATTGCAGAGG AATCGACACC TCTTCAGATC
TCCATTGTGG ACACATCCCA ATTCACGGAG GGCAAAATCA CAGTCCGCTT CTGCAACGAA
GGAAACGAGC CTGTCCGCAT TCCCGATGAA ATCAACGACG AAACGTCCGA TTTCTTTTTC
GTGACGGTGT TGAGGACGCA ACAGGGGAAA ATGCGCGTTC TGAACAGGGA AATTGGCACC
ATGACCTTCC AGCGCCCCTT GAACTACCGG GAGCTTGCAC CCGGTCAGGA ATACAGCGTC
ACGATTCCGG CGTGTCTGGA TGAAGTCGAT CTGGAAGGCT GGAAACAATG TTCTTGTGAA
CTGGAAACGC GCTACTATAA TCAGCAGGGT AAGGATTGTT TCCTGGGGGT TCTCCGGGCA
ACGGCCAGGT TCACGCTGCA ATGA
 
Protein sequence
MNNYIPINKA VNPVYIYQGN EPLISKTLGV DAMTFNFDGS ACEHATCTGA PEMFIKVEED 
GLYYFGVEAD DTGSLTIAGE QAIKKDGIPP NGKLNIETDS RYLKAGYYKV ALSYTNNAYT
PVSNNAIAFN VTMDREPIQA GKYEGNSTMK REFSSSPKIK LWTIEKESAI TCEPSKEVGL
EFEKPDPVYL EGNGDCNVSS LRPKICVTVC KDESENVWRC RVVSVSAGAK LTMYEGIYVN
PYVDPPLNEE EATEAVNEMN GYQSRAEVGT WHTPQASLAH EEHHRRQWED AYKFYWKDSK
IQEMLENQTI SCDKEPNMDE AVKFMQALAN EMALDLWKET SDYVLALPDD ANDRPYCAGQ
EVLNEATAYV IRLADAMGWN NVPRFITKPG TIEPPCFMPP VSEGKTRSMA IAEESTPLQI
SIVDTSQFTE GKITVRFCNE GNEPVRIPDE INDETSDFFF VTVLRTQQGK MRVLNREIGT
MTFQRPLNYR ELAPGQEYSV TIPACLDEVD LEGWKQCSCE LETRYYNQQG KDCFLGVLRA
TARFTLQ