Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2112 |
Symbol | |
ID | 6275496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2571886 |
End bp | 2574795 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642614174 |
Product | hypothetical protein |
Protein accession | YP_001878702 |
Protein GI | 187736590 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.610034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.144044 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATCC CTCCTCCCCC TGCCCTGCCG CCCCATTTCA ACATAGGCGG CTACAAAATC ACGGCGCTTG TAGAAAGCGG TCCGGACTAT CATCTGTACC AGGCTCTTTC CCCGGAAGGC CATGCCGTCC TGATACGCGA ATTCTGCCCC CGGGGCCTCG TTACGCGTGA CCTGGCCAGC GGAGAACTGG CTGTCTCTCC GGAAAATGAA TCCCAATTCG CCCAGGCCCG GGAAGCTTTT GAAACCCAAT ACGCAGCCAA TGCAGAAGGC AAGCTGAGGG GATTCGGCAC CGTGCTCTTC CTTTACCCGC TTTCTCCGGC GCAGCCCCAG CCGGCTGCAG CGCATGCCCA GGCTCTCCGC CCCGCAAAAA AACCGCAGCA ACCGCAACTG CGGAAACCCG TAGTCGGCGC CGCCATCCCC GGAACGCCGC TGCCGCGGGT AAAGCACTCC GGAGGGTTTC CGGTCATCCC GGTCATTGTG ACCGGGATGC TCGCCCTCTT CGGATTTCTG GGCTACCAGA TACTCAAGGA TAAAGAAGAA CCCGTCGCTA AAGCCGTTAC CGTGCCCGTA CCGGCTCCGC CCAAACCCAA ACCCAAGCCC GCTCCGCCAA AACCGGAGCC CGTGGTGGTC ACTCCTGAAC CGGAACCCGT GGTGGTCGCT CCCGAACCGG AACCGGAGCC TGAACCGGAG CCGCCTGCTC CCGACCTTTC CCCCTCCCCG GAAGTCATTG CCATGGAAAA GGCTTTACGG GAGGAAGCTA TCGCGTCCAA AGGGAAATTT TCTGAAAAAC TGTTGAATAA ATATCCCCAT TACGCGGAAG CCTACGTACG GGATTACGTG AAAAAACGCG GAGGCAGTTT TTCTCCGGAT TTTGAAAAAT GGCTGAAAAA CACGAAAAAC AATCGTGAAG TCTTCGCCAT GTTCTTCCCG CCGGACCCCA GCGTCGCCAC CAATGTGGCT TTCATGATTG ATGAACTGGG GCTGGAAACA ACGGAAAAAT ATGACCAGCT CGTTCTGGCG TTCGCAGTAG GACGCCGCGA ATTCGGCATG GGAGCCTTCG ACCTTACCCA TCAGGGCCGT TACGTTGATG CCCTGGGCAA GCTGAATGAT CTGAGATCCT CCGGAGTCAT GCCTCCGCCA GCGGACCTGT ACTGCGCCGG GAAACCGCCC GTCAACTGGT ATGGGAACGC CCCCCGCACC GTGGATGAGG AATGCTACAA AAAGGTGGAA GCCTATCTGG ACCGCAAAAA AATTACCCCC AAGCAGGCCT GGCTCAGGAA ATACCCCACC GTCTCGGAAA TTGGGGACTC CGCCATTACG GAAGACAACC TGGCCGGCTT TCTGCACGAA TACATGTACC GCCACGGCCA GCTGAGGCGC AAGAGGGATC CCTTCCCCAC TCCGGTGGAA TTCTTTTCCT ACCTGGTGGA TAAATATGAA CATTGCGGTG ATTTGCGCGA CGTGGACCGC AAGCGTGTGG AATGGACCGG CGTCTCCCTG GAAGGAACGC CCTGGCCGGC CATGATGGCC CTTTCGGAAA CGCGCCCTCT GCGTGAGTGC GACAGCGTGT GGGAACGCTA CATGGGCCAG CGGGGCCCGA CCCGCCTGTG GCTGTACGGC CCCTACCGGG CGGATGACGA TAAGGAACCG CCCATCCTGT TCAGCTTTGA CCCGGATCCG GAATGGTCCA GGGAATCCAA TGAACGCAAG CTCCATGAAG GCGGCGTGTG CGGCACCATG TCCCTTATCT CCCGCAATTC CCAGATCGCG CGCGGCATTC CCGCCGCCCC CGCCGGGCAG CCTGGCCACG GCAACCTGAT GACCACCCAT TTCACCGGCA ACGGCTGCTG GCTGAGCGTG GGACAAAGTG TGGACACCCT GAAAGCCACC ACGGGATTCT GGTACTTCCG GGATTCCAAC GCACCGCGCA CTGGCAATGC GGAATACCAG TCTGGACTGG CCCTGTCCAT GAATATCGAC TATGAAAAGT TCATCGACAG CCGATTCGCC ATGAACATTT ACAAACTGGC GGCCACCGGC TCCTCCACGG AAGAGACGGC CGACCCTTCC GCCACGCTCC CCAAGGAATT CACGCAGACC GCCATGAGGA CCGTGCTCAA GGCCAACCCG TTCTACACGG AAGCCTGGTA CACGCTCTTC AAGCAGGAAC CCCAGGACCT CATGGGAGCC ACCAAGATGG TGGATGAAGT GAGGGAGGCC CTGCCGGACG GCATGGGCAT CAGAAAACTC TGGAAAACGC GCAAATACGT TTCCTCCGTA GGCCGCGGCG ACAAAAACGG CAAGGACATG CTGGCCAACC ATGCCAGGGA ATACGTCAAT GTGCTCTGCT CCGTGATTCT GGAAAATGCC CTGAAACAGG AATATGACTA TAAGACCTTC CAGTGGGCCG AACTCATGTC CTGGCTCAAG TCTGAATCCA AACGCAACTC CTACCCGGAG CCTCAGGCCG CCTATCAGAT AGCGTATGCC AAGGCCCAGG GCACGGACAG GCTCAAAAGA ACCGTAGACA GGGGATTCAA GAAAGCCCTC AATTTCTACC GGGACGACAG CAACGCCCTG AAGGAACCCA AGGATGTGGA TCAGGAGGAA ATGTCCTTTT CCCTGGCCGC CCTGTGCCAG GCCCTGCCCA AGGAAGAACT GATCCCCTGG ATGAAAAACA TGCTGGACAC CTGTCCGGAC GGGTTCAAAT ACAAGCCCAA AAATAAGAAG GAAACGAAAA TCCACCCCTT CTACGACGCC CTGACGAAAA ACTACATGTC TCTGGCTGAC GGTTCGGAAA AATCCCGCGT CAAGTCGGAA ATGAAAGAGG CTTCCGACAG AATTCTGGAG CTTTCCCAGG ACAAGAAAGG CGATGGGAAT GGGTCCTCCG GACGCCGCCG GAGACGCTGA
|
Protein sequence | MTIPPPPALP PHFNIGGYKI TALVESGPDY HLYQALSPEG HAVLIREFCP RGLVTRDLAS GELAVSPENE SQFAQAREAF ETQYAANAEG KLRGFGTVLF LYPLSPAQPQ PAAAHAQALR PAKKPQQPQL RKPVVGAAIP GTPLPRVKHS GGFPVIPVIV TGMLALFGFL GYQILKDKEE PVAKAVTVPV PAPPKPKPKP APPKPEPVVV TPEPEPVVVA PEPEPEPEPE PPAPDLSPSP EVIAMEKALR EEAIASKGKF SEKLLNKYPH YAEAYVRDYV KKRGGSFSPD FEKWLKNTKN NREVFAMFFP PDPSVATNVA FMIDELGLET TEKYDQLVLA FAVGRREFGM GAFDLTHQGR YVDALGKLND LRSSGVMPPP ADLYCAGKPP VNWYGNAPRT VDEECYKKVE AYLDRKKITP KQAWLRKYPT VSEIGDSAIT EDNLAGFLHE YMYRHGQLRR KRDPFPTPVE FFSYLVDKYE HCGDLRDVDR KRVEWTGVSL EGTPWPAMMA LSETRPLREC DSVWERYMGQ RGPTRLWLYG PYRADDDKEP PILFSFDPDP EWSRESNERK LHEGGVCGTM SLISRNSQIA RGIPAAPAGQ PGHGNLMTTH FTGNGCWLSV GQSVDTLKAT TGFWYFRDSN APRTGNAEYQ SGLALSMNID YEKFIDSRFA MNIYKLAATG SSTEETADPS ATLPKEFTQT AMRTVLKANP FYTEAWYTLF KQEPQDLMGA TKMVDEVREA LPDGMGIRKL WKTRKYVSSV GRGDKNGKDM LANHAREYVN VLCSVILENA LKQEYDYKTF QWAELMSWLK SESKRNSYPE PQAAYQIAYA KAQGTDRLKR TVDRGFKKAL NFYRDDSNAL KEPKDVDQEE MSFSLAALCQ ALPKEELIPW MKNMLDTCPD GFKYKPKNKK ETKIHPFYDA LTKNYMSLAD GSEKSRVKSE MKEASDRILE LSQDKKGDGN GSSGRRRRR
|
| |