Gene Amuc_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1823 
Symbol 
ID6275738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2213437 
End bp2215074 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content52% 
IMG OID642613887 
Producthypothetical protein 
Protein accessionYP_001878422 
Protein GI187736310 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0024569 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT ATATACCACT GAACAAGGCG GTCGGTCCGA AGCTTATCTG CGACGGGGAG 
CAGGCTCTCA TCCCGCAGAA ACTCGGAGTT GACGCGATGG AGTTCAACTT TGACGGTTCG
GCGTGCGAAC ACGCCACTTG CGCCGGAGCG CCGGAAATGT TCATCAAGGT CGAGGAAGAC
GGCCTGTATT ATTTCGGCGT GGAAGCCGAC GACACAGGCA GCATCTCCAT TGCCAACGAG
AAACTTTGTG AGAAGGATGG AACGAAGCCC AACGGCAAGC TGAATCTGGC AACGGGTGCC
AGGTACCTGA AAGCTGGCTA TTACAAGGTG GCCTTGTCTT ACACGAATAA TGCCTACACC
CCCGTCAGCA ACAACGCTGC CGCGTTCAAT GTGACGATGG ACAGGGAACC CATTCAGGAA
GGGAAGTACG AAGGCAACTC GACGATGAAA CGCGAATTCT CACCGTCCCC CAAAATCAAA
TTGTGGACGA TTGAAAAGGA ATCCACCATT ACCTGTGAAG AGTCCAGGAA AGTGGAGGTG
ACGCTCGAAG AACCTGCTCC TGTCGTTGAA GAAAGCGAAG GCATATGCGA AACGTATATC
ATTCCCCCCC AGATTTTTGT AACGGCTTGC AAGGATGAAG TGCTCGACGA GTGGAGACTG
CGCGTGCAGC AAGTTTCCGC CGGTTCCAGG ATTCTCTTGC GCACGGGAGG ATACAGGGAT
GCCCTTAAAA ATCCCCCTGT CACCGAAGAA GAAGCTATAG ATGCTGTCAA TGAAATGAAC
CGTTACCAGT CTCATAGACT CGGCGCATGG CATACTCAGG AGGCATCCCT CGCCCACGAG
GAACACCACC GCCGAGAATT TAATGACGCC TTCCAATTCT ACTGGGACAA TTTGAGAATA
CAGGATACGC TTGAAATGAA GCATGAATCC TGTGAAAAAG TTCCGAAAAT GGAAGAGTTT
CTGGAAAATA TGCAGCCATT CGTTACCGAG TTGAGGAATA TGTATTCTGA CGCTGTCTAT
AACTATGTCT TGGTATTGCC GGATGATGCC AATGATCGTC CCTACTGTGC CGGACAAAAA
GTTCTGAACG AGGCGACCCG GAAGATTATT GCCCAGGCAA AGACCAATGG ATGGAGCAGG
GTACCTGATG AAGTCACGGA ACCGGGAACG ATTGAACCGC CCTGTTTCCT TCCGCCCGTC
AACGGCATGT ATGCCCGCAG CGTGGCTGCT GCGGGGGAAC AGGCTCCCTT GACCCTCTCC
ATTGCGGATA CCTCCCGATT CAGGGAGGGC AGAATCACAG TCCGCTTCCG CAACGAAGGA
AACGAGCCTG TCCGGATTCC GGACGAAATC AACGACGAAA CAGCCGATTA TTTCTTCCTG
ACGGTGTTGA GGACGGAACG GGGGAACTTC CGCGTCCTGA ACAGGAAAGT CGGGAAAATA
ACCTTCAATC GTTCCTTGAA CTACCGGGAA CTGGCCCCCG GCCAGGAATA CAGCGTCACG
ATTCCGGTAT GCCTGGATGA AGTCGATCTG GAAGGCTGGA AACAATGTTC TTGTGAACTG
GAAACGCGCT ACTATAATCA GCAGGGTAAG GATTGTTTCC TGGGGAAGCT CCGGGCGACG
GCCAAACTCG CGCTGTGA
 
Protein sequence
MNNYIPLNKA VGPKLICDGE QALIPQKLGV DAMEFNFDGS ACEHATCAGA PEMFIKVEED 
GLYYFGVEAD DTGSISIANE KLCEKDGTKP NGKLNLATGA RYLKAGYYKV ALSYTNNAYT
PVSNNAAAFN VTMDREPIQE GKYEGNSTMK REFSPSPKIK LWTIEKESTI TCEESRKVEV
TLEEPAPVVE ESEGICETYI IPPQIFVTAC KDEVLDEWRL RVQQVSAGSR ILLRTGGYRD
ALKNPPVTEE EAIDAVNEMN RYQSHRLGAW HTQEASLAHE EHHRREFNDA FQFYWDNLRI
QDTLEMKHES CEKVPKMEEF LENMQPFVTE LRNMYSDAVY NYVLVLPDDA NDRPYCAGQK
VLNEATRKII AQAKTNGWSR VPDEVTEPGT IEPPCFLPPV NGMYARSVAA AGEQAPLTLS
IADTSRFREG RITVRFRNEG NEPVRIPDEI NDETADYFFL TVLRTERGNF RVLNRKVGKI
TFNRSLNYRE LAPGQEYSVT IPVCLDEVDL EGWKQCSCEL ETRYYNQQGK DCFLGKLRAT
AKLAL