Gene Amuc_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2112 
Symbol 
ID6275496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2571886 
End bp2574795 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content58% 
IMG OID642614174 
Producthypothetical protein 
Protein accessionYP_001878702 
Protein GI187736590 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.610034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.144044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCC CTCCTCCCCC TGCCCTGCCG CCCCATTTCA ACATAGGCGG CTACAAAATC 
ACGGCGCTTG TAGAAAGCGG TCCGGACTAT CATCTGTACC AGGCTCTTTC CCCGGAAGGC
CATGCCGTCC TGATACGCGA ATTCTGCCCC CGGGGCCTCG TTACGCGTGA CCTGGCCAGC
GGAGAACTGG CTGTCTCTCC GGAAAATGAA TCCCAATTCG CCCAGGCCCG GGAAGCTTTT
GAAACCCAAT ACGCAGCCAA TGCAGAAGGC AAGCTGAGGG GATTCGGCAC CGTGCTCTTC
CTTTACCCGC TTTCTCCGGC GCAGCCCCAG CCGGCTGCAG CGCATGCCCA GGCTCTCCGC
CCCGCAAAAA AACCGCAGCA ACCGCAACTG CGGAAACCCG TAGTCGGCGC CGCCATCCCC
GGAACGCCGC TGCCGCGGGT AAAGCACTCC GGAGGGTTTC CGGTCATCCC GGTCATTGTG
ACCGGGATGC TCGCCCTCTT CGGATTTCTG GGCTACCAGA TACTCAAGGA TAAAGAAGAA
CCCGTCGCTA AAGCCGTTAC CGTGCCCGTA CCGGCTCCGC CCAAACCCAA ACCCAAGCCC
GCTCCGCCAA AACCGGAGCC CGTGGTGGTC ACTCCTGAAC CGGAACCCGT GGTGGTCGCT
CCCGAACCGG AACCGGAGCC TGAACCGGAG CCGCCTGCTC CCGACCTTTC CCCCTCCCCG
GAAGTCATTG CCATGGAAAA GGCTTTACGG GAGGAAGCTA TCGCGTCCAA AGGGAAATTT
TCTGAAAAAC TGTTGAATAA ATATCCCCAT TACGCGGAAG CCTACGTACG GGATTACGTG
AAAAAACGCG GAGGCAGTTT TTCTCCGGAT TTTGAAAAAT GGCTGAAAAA CACGAAAAAC
AATCGTGAAG TCTTCGCCAT GTTCTTCCCG CCGGACCCCA GCGTCGCCAC CAATGTGGCT
TTCATGATTG ATGAACTGGG GCTGGAAACA ACGGAAAAAT ATGACCAGCT CGTTCTGGCG
TTCGCAGTAG GACGCCGCGA ATTCGGCATG GGAGCCTTCG ACCTTACCCA TCAGGGCCGT
TACGTTGATG CCCTGGGCAA GCTGAATGAT CTGAGATCCT CCGGAGTCAT GCCTCCGCCA
GCGGACCTGT ACTGCGCCGG GAAACCGCCC GTCAACTGGT ATGGGAACGC CCCCCGCACC
GTGGATGAGG AATGCTACAA AAAGGTGGAA GCCTATCTGG ACCGCAAAAA AATTACCCCC
AAGCAGGCCT GGCTCAGGAA ATACCCCACC GTCTCGGAAA TTGGGGACTC CGCCATTACG
GAAGACAACC TGGCCGGCTT TCTGCACGAA TACATGTACC GCCACGGCCA GCTGAGGCGC
AAGAGGGATC CCTTCCCCAC TCCGGTGGAA TTCTTTTCCT ACCTGGTGGA TAAATATGAA
CATTGCGGTG ATTTGCGCGA CGTGGACCGC AAGCGTGTGG AATGGACCGG CGTCTCCCTG
GAAGGAACGC CCTGGCCGGC CATGATGGCC CTTTCGGAAA CGCGCCCTCT GCGTGAGTGC
GACAGCGTGT GGGAACGCTA CATGGGCCAG CGGGGCCCGA CCCGCCTGTG GCTGTACGGC
CCCTACCGGG CGGATGACGA TAAGGAACCG CCCATCCTGT TCAGCTTTGA CCCGGATCCG
GAATGGTCCA GGGAATCCAA TGAACGCAAG CTCCATGAAG GCGGCGTGTG CGGCACCATG
TCCCTTATCT CCCGCAATTC CCAGATCGCG CGCGGCATTC CCGCCGCCCC CGCCGGGCAG
CCTGGCCACG GCAACCTGAT GACCACCCAT TTCACCGGCA ACGGCTGCTG GCTGAGCGTG
GGACAAAGTG TGGACACCCT GAAAGCCACC ACGGGATTCT GGTACTTCCG GGATTCCAAC
GCACCGCGCA CTGGCAATGC GGAATACCAG TCTGGACTGG CCCTGTCCAT GAATATCGAC
TATGAAAAGT TCATCGACAG CCGATTCGCC ATGAACATTT ACAAACTGGC GGCCACCGGC
TCCTCCACGG AAGAGACGGC CGACCCTTCC GCCACGCTCC CCAAGGAATT CACGCAGACC
GCCATGAGGA CCGTGCTCAA GGCCAACCCG TTCTACACGG AAGCCTGGTA CACGCTCTTC
AAGCAGGAAC CCCAGGACCT CATGGGAGCC ACCAAGATGG TGGATGAAGT GAGGGAGGCC
CTGCCGGACG GCATGGGCAT CAGAAAACTC TGGAAAACGC GCAAATACGT TTCCTCCGTA
GGCCGCGGCG ACAAAAACGG CAAGGACATG CTGGCCAACC ATGCCAGGGA ATACGTCAAT
GTGCTCTGCT CCGTGATTCT GGAAAATGCC CTGAAACAGG AATATGACTA TAAGACCTTC
CAGTGGGCCG AACTCATGTC CTGGCTCAAG TCTGAATCCA AACGCAACTC CTACCCGGAG
CCTCAGGCCG CCTATCAGAT AGCGTATGCC AAGGCCCAGG GCACGGACAG GCTCAAAAGA
ACCGTAGACA GGGGATTCAA GAAAGCCCTC AATTTCTACC GGGACGACAG CAACGCCCTG
AAGGAACCCA AGGATGTGGA TCAGGAGGAA ATGTCCTTTT CCCTGGCCGC CCTGTGCCAG
GCCCTGCCCA AGGAAGAACT GATCCCCTGG ATGAAAAACA TGCTGGACAC CTGTCCGGAC
GGGTTCAAAT ACAAGCCCAA AAATAAGAAG GAAACGAAAA TCCACCCCTT CTACGACGCC
CTGACGAAAA ACTACATGTC TCTGGCTGAC GGTTCGGAAA AATCCCGCGT CAAGTCGGAA
ATGAAAGAGG CTTCCGACAG AATTCTGGAG CTTTCCCAGG ACAAGAAAGG CGATGGGAAT
GGGTCCTCCG GACGCCGCCG GAGACGCTGA
 
Protein sequence
MTIPPPPALP PHFNIGGYKI TALVESGPDY HLYQALSPEG HAVLIREFCP RGLVTRDLAS 
GELAVSPENE SQFAQAREAF ETQYAANAEG KLRGFGTVLF LYPLSPAQPQ PAAAHAQALR
PAKKPQQPQL RKPVVGAAIP GTPLPRVKHS GGFPVIPVIV TGMLALFGFL GYQILKDKEE
PVAKAVTVPV PAPPKPKPKP APPKPEPVVV TPEPEPVVVA PEPEPEPEPE PPAPDLSPSP
EVIAMEKALR EEAIASKGKF SEKLLNKYPH YAEAYVRDYV KKRGGSFSPD FEKWLKNTKN
NREVFAMFFP PDPSVATNVA FMIDELGLET TEKYDQLVLA FAVGRREFGM GAFDLTHQGR
YVDALGKLND LRSSGVMPPP ADLYCAGKPP VNWYGNAPRT VDEECYKKVE AYLDRKKITP
KQAWLRKYPT VSEIGDSAIT EDNLAGFLHE YMYRHGQLRR KRDPFPTPVE FFSYLVDKYE
HCGDLRDVDR KRVEWTGVSL EGTPWPAMMA LSETRPLREC DSVWERYMGQ RGPTRLWLYG
PYRADDDKEP PILFSFDPDP EWSRESNERK LHEGGVCGTM SLISRNSQIA RGIPAAPAGQ
PGHGNLMTTH FTGNGCWLSV GQSVDTLKAT TGFWYFRDSN APRTGNAEYQ SGLALSMNID
YEKFIDSRFA MNIYKLAATG SSTEETADPS ATLPKEFTQT AMRTVLKANP FYTEAWYTLF
KQEPQDLMGA TKMVDEVREA LPDGMGIRKL WKTRKYVSSV GRGDKNGKDM LANHAREYVN
VLCSVILENA LKQEYDYKTF QWAELMSWLK SESKRNSYPE PQAAYQIAYA KAQGTDRLKR
TVDRGFKKAL NFYRDDSNAL KEPKDVDQEE MSFSLAALCQ ALPKEELIPW MKNMLDTCPD
GFKYKPKNKK ETKIHPFYDA LTKNYMSLAD GSEKSRVKSE MKEASDRILE LSQDKKGDGN
GSSGRRRRR