Gene Amuc_0687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0687 
Symbol 
ID6273928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp809148 
End bp812282 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content58% 
IMG OID642612739 
Productouter membrane autotransporter barrel domain protein 
Protein accessionYP_001877305 
Protein GI187735193 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTGG CGGCTGTTAT CGCCTGCCTT GGCAGTTTCT CCGTTGCTAC GGCAGCTGAC 
TATGCAGTCA ACAGTGCCGA TGAATTCGTT ACGGCCTGGA ACCAGGCTGC CGCTTCCAAT
GAAGCTTCAA CCATCACGAT TACCGTACCT TCCGGCTCCG ATAATATTAC GCTGACCCAA
GAGCAACGAG CCCAGTTGAA CGCCATCTCC GGAACGGGCA ACGTCACCAT TCAAATGACG
GACGCCAGCA ACAAACTGGT TAATTTCAAT TACGATCTGG TTAACAACCA AGTCAGTTTC
AACGACGTCA CCTTGAGTGA ACCTCTGGGA AGCGACAATG ACATTGTGGT GACGAATGCT
TCAAATACCA CAACTATTGA TGGCAGGAAT ATTGCCATCG AAGGTACGAA TGATGCTGCC
AATCCCAAGA CGGTGGGAGC CACCGTCACC TCCACGAACG GACAGGTAGG CATCGGCGAC
AATGTTTCCA TGGAAAAAGC CGTTACCGTC ACCGGCCAGG CAGCTCAAAC CGTAGATCCG
GCAACGCCGC CGACCGGCCA AGCCTATACC AAGACGACGT ACAATACTTA CAATAATTCC
AACGAACAGG CCGTGGTCCT GGGCAACAAC GTCACGATGC AGGATACCGT AACGGCGACA
GGCCAAATCG TAAGCGATCC GGCATCCAAG GTTGCGCTGA ACGGAGATGT GACCTCCACA
GCGGGAATCG GAACCGTAAC TACGGAACAG TTTGACGCCT CCAACGCCCA GACCGGCAAA
ACCGTCATCG ACTCTATGGA TGACAGCGTC TCCGGCGGCA TCATTCTTGG AGAAACCACA
GCCGGGGCCG TCACCCTCAA AACTGAGGGC GGCAATATTT CCCTGGGAGA CAATTCCGTC
CTGGATGGCA CTACTGTTTC CGCAGAATCC GCCGACCTGA ACAAAACCGT TTACAGCAAT
GACGGCGGCA GCTGGAATAC CGTCGCAAGC ACTGAAAAGC TGGGCACCAT CGAAGGCAGC
GTCACGCTGG GTGAAAATAC TACCGTCAAG GGAAATTCCA CCCTGACAGC GGATGACAAT
ATCGCCATCG GCAACAACAG CGTCATCACC GGCAATACCG CCGCTGACGG CGTGATCTCC
GCAGGTGGCC AGATTTCCAT CGGGGACGGT ACCCAGGTCT TGGATAACAC CGCCACCAGC
GACGGCAAAG CCGCCATCAA CCTGGCGGAC GGGCAAACCC TGTACATCGG CTCCGGAGCA
ATCCTTTCCG GCAATACTTC CAACGGCGTT TCCGGTTCCG TGCAGGCTGG CCAAAACACC
CAGATCAACG TTTATACGGA CGCCACCGCC TATACCTTCA TCAATGACGG CATCTCCACC
ACGGCCGCCG CTTCCACGGC TGACGCCGCT CCGCTGGCTG CTGACCAGAC CGTCATGACG
AAGACCGGAG CGGGAACTCT CGTTTACGGC GGCACAGGAA CCACCGATAC GTTCGGAGGA
ACCTACCAGC AGCTTGAAGG CAACCTGATC ATCGGCCATG CCACCTTCGG CGCCGTTGAC
TCCGCCACCG GAGAAGTCGG TCCGCTCCAG TCCATTGACG GCGCCGTAAT GGGGACGGAC
GATACTGTTT ACGACATTCG GACGGGACAG GTGACCCTGG CGAAGAATTC CACCATGAAG
GGAGCTTCCG CCACTTTCGG AGGCGATTCT ACCTTGCTGC TGAGCGACGG GTCCGTACTG
GACTTCGGCA CTCCCGCCAC GTTCCAGGAT AATTCCCGCG TAGGCATCCA GGTATCCGAC
GCCAGCGGCA ATCCCGTTCC GCTCGCCCAA CTCCGCAAGG GGACGGAAAG CGTGACGGTC
ACGCTGAACG GCACGGACAT TTCCGGACGC CTGCTGAACA ACGTATTTCT GAGCACCACC
ATGGCTCCGG GAACGGCGGA AGGCACCACC ACCATCACCC AGGATATGAA GGGCATTGAC
GGCCCCATGT CCGGCTACAA CGGCAACGTT TACACCGTAG CGGCCGCTCT GGAAAACAAC
CGTCTGAATG TGGCCGCCGG TTCTCCCGCC GCCCAGTTCT ATGAAAACCT GTTCCGGGCC
ACGAGCGCGG ATGAAGCGGC CCGCATCATC CAGTCCGTCA GCGGTGAACA CGTGGTGAAC
TTCACCTGGG CAGCCAGCCG CACCGTGCGG AACTTCGCCG ACCTGGGGCG CATTCAATCC
GCCGCCTCCA TGGCACGCCA GACGGAAGAC ACCGTTGAAG TAGTGGACGC CAAGGGTTCT
CCGATCGCCC GCAAGACAAT CGCCAGAGGC AACGGCAATA TCTGGGTGGG CGGCATGGGA
ATCTGGGATG ACCAAGATGC CCGCGACGGT GTTTCCGGGT ATAAGTATAA TGCCGGCGGT
TATGCCGTAG GCATTGACTA CAAAGCCGCT CAAGGTTCCC TGATCGGCAT CGCCGCCGGT
CAAAGCTTCG GCAGCTTCAA GGATAAGACA GGCATCGGCG CCGACTACGA CGTCGATTCC
TTCCTGGCCA TGATTTATGG ACGCATGCAT CCCTTCAGGG ACAGCAAGTT CACTCTGGAC
GGCTACGGCG CCTACGGACG TTCCCGTTTC AAAGGTGATT CCTACATCAT GGGGTCTGCC
GCCAACGGCA GAGCAGATAC GGACACCTTC AGCGGCGCCC TTTACGGCAC CTGGACGGAA
CGTTTCGCAC TCGGCAGGGC CTTTATAACA CCTTACACGG GTATTGAGTT CATGACTTCC
GAACTCAAGG GATTCTCTGA AAGCGGACCT TATGGGCGCA CCTTCGGCCA CGCCCGGGCC
CAGAACTGGA CCATTCCTGT CGGCATTACG ATCGCCCGCG CCTACCAGAC GGACGGAGGC
ACCACCATCA CCCCGGCCTT GACGGTAGCC GTATCCCAGG ATGTGAGCCG CATGAATCCG
AAATCCAATG TCGCCGGCCC CCTCGGAACC TGGAACGCCC GCGGCGTCAA TGTTGGCCGC
ACCGCATTCC GTTTGAATGC CGGCATTGAT GTGCTCTTCT CCAACCAGTG GGGAGCACGC
GTCTGCTACC AGTTCGAGAC CCGCAACAAG CTGACCGCCC ACGGTATCAA CGGCGCCCTC
AGCTATACGT TCTAA
 
Protein sequence
MLLAAVIACL GSFSVATAAD YAVNSADEFV TAWNQAAASN EASTITITVP SGSDNITLTQ 
EQRAQLNAIS GTGNVTIQMT DASNKLVNFN YDLVNNQVSF NDVTLSEPLG SDNDIVVTNA
SNTTTIDGRN IAIEGTNDAA NPKTVGATVT STNGQVGIGD NVSMEKAVTV TGQAAQTVDP
ATPPTGQAYT KTTYNTYNNS NEQAVVLGNN VTMQDTVTAT GQIVSDPASK VALNGDVTST
AGIGTVTTEQ FDASNAQTGK TVIDSMDDSV SGGIILGETT AGAVTLKTEG GNISLGDNSV
LDGTTVSAES ADLNKTVYSN DGGSWNTVAS TEKLGTIEGS VTLGENTTVK GNSTLTADDN
IAIGNNSVIT GNTAADGVIS AGGQISIGDG TQVLDNTATS DGKAAINLAD GQTLYIGSGA
ILSGNTSNGV SGSVQAGQNT QINVYTDATA YTFINDGIST TAAASTADAA PLAADQTVMT
KTGAGTLVYG GTGTTDTFGG TYQQLEGNLI IGHATFGAVD SATGEVGPLQ SIDGAVMGTD
DTVYDIRTGQ VTLAKNSTMK GASATFGGDS TLLLSDGSVL DFGTPATFQD NSRVGIQVSD
ASGNPVPLAQ LRKGTESVTV TLNGTDISGR LLNNVFLSTT MAPGTAEGTT TITQDMKGID
GPMSGYNGNV YTVAAALENN RLNVAAGSPA AQFYENLFRA TSADEAARII QSVSGEHVVN
FTWAASRTVR NFADLGRIQS AASMARQTED TVEVVDAKGS PIARKTIARG NGNIWVGGMG
IWDDQDARDG VSGYKYNAGG YAVGIDYKAA QGSLIGIAAG QSFGSFKDKT GIGADYDVDS
FLAMIYGRMH PFRDSKFTLD GYGAYGRSRF KGDSYIMGSA ANGRADTDTF SGALYGTWTE
RFALGRAFIT PYTGIEFMTS ELKGFSESGP YGRTFGHARA QNWTIPVGIT IARAYQTDGG
TTITPALTVA VSQDVSRMNP KSNVAGPLGT WNARGVNVGR TAFRLNAGID VLFSNQWGAR
VCYQFETRNK LTAHGINGAL SYTF