Gene Amuc_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2100 
Symbol 
ID6274555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2553762 
End bp2554910 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content57% 
IMG OID642614162 
Productaminoglycoside phosphotransferase 
Protein accessionYP_001878690 
Protein GI187736578 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.624576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGT CCGCCGTTTT CCACACTTCC TCCCTTCAGA AGCAAGTGGC CGCCCTGAGC 
GACGCATTCG CCATCGCAGG AGAATTCCTC CATTGCGACG TCATTAACAG CGGCCATATC
AACATGACCT TCCGGGCCAC CTACAGGAAG CCGGACGGCA CCACCCGCCG CTACATCTTC
CAGCGCGTGA ACGATGCCGT GTTCCCATGC CCCAGGGATG TCATGCACAA CGTGGAAAAG
GTGACCAACC ATATCCGCTG GAAAATGTTC CGGGTGCTGA AAACGCCCTT CCGCCAGACG
CTGAACCTGT ACTCCGCGCG GGGCGGCCGC AAATACCTGG AAATTCCCGG TTCCGGCTTC
TGGCGCTGCT ACAACTGCAT AGAAAACACC CACACGTTCG ACGTAGCGGA CCATCCCCGC
CAGGCTTACG AAGCAGCCCG CGCTTTCGGC GCTTTCCAAC AGCTCCTGTG TGACATGAAT
CCGGAGGACA TCCATGAAAC CATTCCGTTC TTCCACCATA CCCGCAGGCG CTTTGACCAT
TTGGAAAAAG CCGCGGCAGC AGACTCCCAC GGACGTCTGA ATACCTGCCG CAAGGAGCTG
GACTTCATCC GCCGCCGTGA ACGTTATGTG GACGTGCTGC TGGATCTCCA GGAACGGGGG
GAGCTCCCCG TCAGAATCGT CCACAACGAC ACGAAAATCA ACAACGTGAT GCTGGACAGG
GAGACGGACA AGGCTGTCTG CGTCATTGAC CTGGACACCG TCATGCCCGG GAGCGTCCTG
TACGACTTCG GAGACATGGT GCGCACCATG ACCTCCCCTG CGGCGGAAGA TGAAGAAAAT
CTGGATAAAA CCTTCCTGCG CATGCCCATG TTCGAGGCCG TCGTCAAGGG ATACCTGGAG
GCCTCCAGAG AATTCATCAC GCCGCAGGAA GTCTCCAAAC TCGCTTTTTC CGGTCTGCTT
ATCACGCTGG AAACGGGAAT CCGCTTCCTG ACGGACTACC TGGAAGGGGA CGTTTATTTC
AAAACGAAAA AAGAACGGCA CAATCTGCAC CGTGCCCGCA CCCAGCTCAG GCTGGTGGAA
AGCATGGAAG AGCAAATGCC TGAAATGGAA GAATGCGTCC GGAAATGCTT CCAGACTGTT
AACGGCTGA
 
Protein sequence
MPLSAVFHTS SLQKQVAALS DAFAIAGEFL HCDVINSGHI NMTFRATYRK PDGTTRRYIF 
QRVNDAVFPC PRDVMHNVEK VTNHIRWKMF RVLKTPFRQT LNLYSARGGR KYLEIPGSGF
WRCYNCIENT HTFDVADHPR QAYEAARAFG AFQQLLCDMN PEDIHETIPF FHHTRRRFDH
LEKAAAADSH GRLNTCRKEL DFIRRRERYV DVLLDLQERG ELPVRIVHND TKINNVMLDR
ETDKAVCVID LDTVMPGSVL YDFGDMVRTM TSPAAEDEEN LDKTFLRMPM FEAVVKGYLE
ASREFITPQE VSKLAFSGLL ITLETGIRFL TDYLEGDVYF KTKKERHNLH RARTQLRLVE
SMEEQMPEME ECVRKCFQTV NG