Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0687 |
Symbol | |
ID | 6273928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 809148 |
End bp | 812282 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642612739 |
Product | outer membrane autotransporter barrel domain protein |
Protein accession | YP_001877305 |
Protein GI | 187735193 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCTGG CGGCTGTTAT CGCCTGCCTT GGCAGTTTCT CCGTTGCTAC GGCAGCTGAC TATGCAGTCA ACAGTGCCGA TGAATTCGTT ACGGCCTGGA ACCAGGCTGC CGCTTCCAAT GAAGCTTCAA CCATCACGAT TACCGTACCT TCCGGCTCCG ATAATATTAC GCTGACCCAA GAGCAACGAG CCCAGTTGAA CGCCATCTCC GGAACGGGCA ACGTCACCAT TCAAATGACG GACGCCAGCA ACAAACTGGT TAATTTCAAT TACGATCTGG TTAACAACCA AGTCAGTTTC AACGACGTCA CCTTGAGTGA ACCTCTGGGA AGCGACAATG ACATTGTGGT GACGAATGCT TCAAATACCA CAACTATTGA TGGCAGGAAT ATTGCCATCG AAGGTACGAA TGATGCTGCC AATCCCAAGA CGGTGGGAGC CACCGTCACC TCCACGAACG GACAGGTAGG CATCGGCGAC AATGTTTCCA TGGAAAAAGC CGTTACCGTC ACCGGCCAGG CAGCTCAAAC CGTAGATCCG GCAACGCCGC CGACCGGCCA AGCCTATACC AAGACGACGT ACAATACTTA CAATAATTCC AACGAACAGG CCGTGGTCCT GGGCAACAAC GTCACGATGC AGGATACCGT AACGGCGACA GGCCAAATCG TAAGCGATCC GGCATCCAAG GTTGCGCTGA ACGGAGATGT GACCTCCACA GCGGGAATCG GAACCGTAAC TACGGAACAG TTTGACGCCT CCAACGCCCA GACCGGCAAA ACCGTCATCG ACTCTATGGA TGACAGCGTC TCCGGCGGCA TCATTCTTGG AGAAACCACA GCCGGGGCCG TCACCCTCAA AACTGAGGGC GGCAATATTT CCCTGGGAGA CAATTCCGTC CTGGATGGCA CTACTGTTTC CGCAGAATCC GCCGACCTGA ACAAAACCGT TTACAGCAAT GACGGCGGCA GCTGGAATAC CGTCGCAAGC ACTGAAAAGC TGGGCACCAT CGAAGGCAGC GTCACGCTGG GTGAAAATAC TACCGTCAAG GGAAATTCCA CCCTGACAGC GGATGACAAT ATCGCCATCG GCAACAACAG CGTCATCACC GGCAATACCG CCGCTGACGG CGTGATCTCC GCAGGTGGCC AGATTTCCAT CGGGGACGGT ACCCAGGTCT TGGATAACAC CGCCACCAGC GACGGCAAAG CCGCCATCAA CCTGGCGGAC GGGCAAACCC TGTACATCGG CTCCGGAGCA ATCCTTTCCG GCAATACTTC CAACGGCGTT TCCGGTTCCG TGCAGGCTGG CCAAAACACC CAGATCAACG TTTATACGGA CGCCACCGCC TATACCTTCA TCAATGACGG CATCTCCACC ACGGCCGCCG CTTCCACGGC TGACGCCGCT CCGCTGGCTG CTGACCAGAC CGTCATGACG AAGACCGGAG CGGGAACTCT CGTTTACGGC GGCACAGGAA CCACCGATAC GTTCGGAGGA ACCTACCAGC AGCTTGAAGG CAACCTGATC ATCGGCCATG CCACCTTCGG CGCCGTTGAC TCCGCCACCG GAGAAGTCGG TCCGCTCCAG TCCATTGACG GCGCCGTAAT GGGGACGGAC GATACTGTTT ACGACATTCG GACGGGACAG GTGACCCTGG CGAAGAATTC CACCATGAAG GGAGCTTCCG CCACTTTCGG AGGCGATTCT ACCTTGCTGC TGAGCGACGG GTCCGTACTG GACTTCGGCA CTCCCGCCAC GTTCCAGGAT AATTCCCGCG TAGGCATCCA GGTATCCGAC GCCAGCGGCA ATCCCGTTCC GCTCGCCCAA CTCCGCAAGG GGACGGAAAG CGTGACGGTC ACGCTGAACG GCACGGACAT TTCCGGACGC CTGCTGAACA ACGTATTTCT GAGCACCACC ATGGCTCCGG GAACGGCGGA AGGCACCACC ACCATCACCC AGGATATGAA GGGCATTGAC GGCCCCATGT CCGGCTACAA CGGCAACGTT TACACCGTAG CGGCCGCTCT GGAAAACAAC CGTCTGAATG TGGCCGCCGG TTCTCCCGCC GCCCAGTTCT ATGAAAACCT GTTCCGGGCC ACGAGCGCGG ATGAAGCGGC CCGCATCATC CAGTCCGTCA GCGGTGAACA CGTGGTGAAC TTCACCTGGG CAGCCAGCCG CACCGTGCGG AACTTCGCCG ACCTGGGGCG CATTCAATCC GCCGCCTCCA TGGCACGCCA GACGGAAGAC ACCGTTGAAG TAGTGGACGC CAAGGGTTCT CCGATCGCCC GCAAGACAAT CGCCAGAGGC AACGGCAATA TCTGGGTGGG CGGCATGGGA ATCTGGGATG ACCAAGATGC CCGCGACGGT GTTTCCGGGT ATAAGTATAA TGCCGGCGGT TATGCCGTAG GCATTGACTA CAAAGCCGCT CAAGGTTCCC TGATCGGCAT CGCCGCCGGT CAAAGCTTCG GCAGCTTCAA GGATAAGACA GGCATCGGCG CCGACTACGA CGTCGATTCC TTCCTGGCCA TGATTTATGG ACGCATGCAT CCCTTCAGGG ACAGCAAGTT CACTCTGGAC GGCTACGGCG CCTACGGACG TTCCCGTTTC AAAGGTGATT CCTACATCAT GGGGTCTGCC GCCAACGGCA GAGCAGATAC GGACACCTTC AGCGGCGCCC TTTACGGCAC CTGGACGGAA CGTTTCGCAC TCGGCAGGGC CTTTATAACA CCTTACACGG GTATTGAGTT CATGACTTCC GAACTCAAGG GATTCTCTGA AAGCGGACCT TATGGGCGCA CCTTCGGCCA CGCCCGGGCC CAGAACTGGA CCATTCCTGT CGGCATTACG ATCGCCCGCG CCTACCAGAC GGACGGAGGC ACCACCATCA CCCCGGCCTT GACGGTAGCC GTATCCCAGG ATGTGAGCCG CATGAATCCG AAATCCAATG TCGCCGGCCC CCTCGGAACC TGGAACGCCC GCGGCGTCAA TGTTGGCCGC ACCGCATTCC GTTTGAATGC CGGCATTGAT GTGCTCTTCT CCAACCAGTG GGGAGCACGC GTCTGCTACC AGTTCGAGAC CCGCAACAAG CTGACCGCCC ACGGTATCAA CGGCGCCCTC AGCTATACGT TCTAA
|
Protein sequence | MLLAAVIACL GSFSVATAAD YAVNSADEFV TAWNQAAASN EASTITITVP SGSDNITLTQ EQRAQLNAIS GTGNVTIQMT DASNKLVNFN YDLVNNQVSF NDVTLSEPLG SDNDIVVTNA SNTTTIDGRN IAIEGTNDAA NPKTVGATVT STNGQVGIGD NVSMEKAVTV TGQAAQTVDP ATPPTGQAYT KTTYNTYNNS NEQAVVLGNN VTMQDTVTAT GQIVSDPASK VALNGDVTST AGIGTVTTEQ FDASNAQTGK TVIDSMDDSV SGGIILGETT AGAVTLKTEG GNISLGDNSV LDGTTVSAES ADLNKTVYSN DGGSWNTVAS TEKLGTIEGS VTLGENTTVK GNSTLTADDN IAIGNNSVIT GNTAADGVIS AGGQISIGDG TQVLDNTATS DGKAAINLAD GQTLYIGSGA ILSGNTSNGV SGSVQAGQNT QINVYTDATA YTFINDGIST TAAASTADAA PLAADQTVMT KTGAGTLVYG GTGTTDTFGG TYQQLEGNLI IGHATFGAVD SATGEVGPLQ SIDGAVMGTD DTVYDIRTGQ VTLAKNSTMK GASATFGGDS TLLLSDGSVL DFGTPATFQD NSRVGIQVSD ASGNPVPLAQ LRKGTESVTV TLNGTDISGR LLNNVFLSTT MAPGTAEGTT TITQDMKGID GPMSGYNGNV YTVAAALENN RLNVAAGSPA AQFYENLFRA TSADEAARII QSVSGEHVVN FTWAASRTVR NFADLGRIQS AASMARQTED TVEVVDAKGS PIARKTIARG NGNIWVGGMG IWDDQDARDG VSGYKYNAGG YAVGIDYKAA QGSLIGIAAG QSFGSFKDKT GIGADYDVDS FLAMIYGRMH PFRDSKFTLD GYGAYGRSRF KGDSYIMGSA ANGRADTDTF SGALYGTWTE RFALGRAFIT PYTGIEFMTS ELKGFSESGP YGRTFGHARA QNWTIPVGIT IARAYQTDGG TTITPALTVA VSQDVSRMNP KSNVAGPLGT WNARGVNVGR TAFRLNAGID VLFSNQWGAR VCYQFETRNK LTAHGINGAL SYTF
|
| |