Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1098 |
Symbol | |
ID | 6274003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1310567 |
End bp | 1313290 |
Gene Length | 2724 bp |
Protein Length | 907 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642613149 |
Product | type II and III secretion system protein |
Protein accession | YP_001877705 |
Protein GI | 187735593 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0580477 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.000681519 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACCACG CACCACTTTA TCAACCGAAG CGTTCCCTGA TCGCGCTGAT GGCCATTGCT GCCTCCTGCC CGTTTGCGCA GGCTGGTGAT GGCGGCGCCG TCGGAACCTC CAGTGCCTTG GGATCGTCCT ACGCCGGACC GGGCAGTTAC CAATACCAGT CCAGTGCAGC GCGCACAGCC ATGGCCCGCC GCGAAGCGCA AACCCAGGAA GCCATGCAGC TCCTTGCGGA AGGCCGCAAC CTGTACCGCG AAGGCAAGTA CAAGGAAGCC CTGGACAAAT ACAACGCCGC TTACAACATG CTGCCTTCCG CGCCGATCAA CGACCAGCGC AAGGAAGCTA TCGCCAACCA TATTGGCGAC GCCAGCATCG CCGTTGCTCA GGAATACATC AAAGTGGGGC GCTATGACGA AGCCGACAAG CTTCTGCAGG ATGCCATCAA GCTCAATCCC CGGAGCGCCA AGCTTGCCAA GCAAACGCTC GAATACATGA AGGACCCGAT TCGCACGAAT CCGGCGCTCA CCCCCGAACA CGTGAAAAAC GTGGAAAAGG TGAACACCCT CCTTCACATG GCCTATGGTT ACTATGACCT CGGCGACTAC GACAAAGCCA TCGCCGAATT CAACAAGGTT CTCTCCATCG ACCCGTACAA CGTGGCGGCC CGCCGCGGAC AGGAAACGGT CAACCGCCGC AGAATGGCTT ACTATGCAGC CGCTTACGAC GAAACCCGCA GCACCATGCT GGCGGAAGTG GACAAGATGT GGGAACGCCC CATCCCGATG GAAGTCCCGA CCGGAGCCGA CGGCACTGAC AACGCACCGA TCACGGACAT CAACGGCGCC ACGGCCAATC TGATGAAGCT CAAGAGCATC ATCATCCCCT CCGTCTCTTT TGAAGACACC ACCGTGGAAG ACGCCATTGA CTATCTGCGC AAAAAGTCCA TTGAACTGGA CCGCACGGTA GGCCCGAACG GTGAACGCGG CATCAACTTT GTCATCAATG ATTCCCAGCC CGCTGCCGTC GCCCCCGCCG TTCCCGCAAC TGATGAAGAC GGCTTTGGCG AAGAAACTGC GGAAGTCACG GAAGCTGCTC CGGCAGCAGC CCCGCAGGAA AGCATCCGCA CCCGCAAAAT CGGCCAGTTG AAGCTGACTA ATGTTCCCAT GCTGGAAGTG CTCCGTTTCA TCTGCAGCAA TGCCGGCCTG CGCCAGAAGG TGGAAGACTA TGCAGTGACC ATCCTTCCTG CCGGCGGCAA TGACGTGGAT CTGTACCAGC GCACCTTCTC CGTGCCCCCG GGCTTCCAGT CCGCTCTCCG CACCACCGTC GGCGACGGCG GCGGCGAAGT CAGTGACGAC CCCTTTGGCG GCGGTGGCGA AAGCTCCTCC GGCCTTAAGC CCATGCCCTC CATCCGCAGC CTGCTGCAAA AGAGCGGCAT CAGCTTCCCG GAAGGCGCCA CGGCATTCCT TGTCAACGGC AATTCCTCCC TGGTCGTCCG CAACACTTCC GGCAACCTGG ACCTGATCGA ACAGCTCATT GAAAACACCC GCGGCGAATC CCAGCAGGTG CGCATCATGA CCAAGTTCGT GGAAGTAACC CAGGAAAACA CGGAAGAACT CGGCTTTGAC TGGATTGTCA CCCCGTTCTC CGTAAGCAAT GACCGCAGCA CCTTCCTGGG CGGCGGCACG AACTACGGCA CCGGTTCCAC TTCCGACAGC TTTACCCAGT CTCCCGGCGG CGTGACCGGC TGGCCTGTAA ACAGCGGCAG CGACACCATC AACGGCCTCG TTACCGGCGG CAACCGCACG GGTGACTACG CCATCACCAA GAACTCCGTG GACAATCTGC TGAACAGCAC CAACCGCTCT GAAGCCTCCC AGAAAAACGC CGCTCCCGGC ATCATGTCCC TGACGGGCAT TTATGACGAA GGCTCCTTCC AGATGCTGAT GCGCGGCCTG TCCCAGAAAA AAGGCTCTGA CGTCCTCACC GCCCCCAGCG TGACCGCCAA GTCCGGTGAA ACCGCCAAGA TTGAAATCAT CCGCGAATTC TGGTATCCCA CCGAATACGA ACCGCCGGAA CTCCCCAACT CCGTAGGCAA CAGCGGCTAT AACAACGGTT ACGGTTATGG GAACAACGGT AACATCGTGG ACGGCCTGCT GGGCAATCAG ATCCAGCCCC AGATATCCAG CTTCCCCGTC ACTCCCGCCA CCCCCGGCGT GTTTGAAATG AAGCCCGTCG GCGTAACTCT GGAAGTGGTG CCTACCATTG GCGACAACAA GTACATCATC GACCTGAACT TCAAGCCCAG CATCGTGGAA TTTGAAGGCT TCGTGAACTA CGGCAGCCCG ATCCAGTCCA CCGGCGTTGG TTCCGACGGC AAGCCGATGT CCCTGACGCT GACGGAAAAC CGCATCGAGC AGCCGATCTT CTCCAAGAGG TCCGTTGAAA CGTCCCTGTT CATCTACGAC GGCCATACCG TGGCAATCGG TGGTTTGATC ACGGAAAACG TGCAGACGGT GGAAGACAAA GTGCCGATCT TCGGTGACCT GCCTCTCATC GGCCGCTTCT TCCGCAGCAA CTCCGACAAC CACATCAAGA AGAATCTGAT GATCTTCGTA ACGGGACAGA TCATTGACGC CACGGGCCAG CCCGTACGCG GCAATGCCCT TCCCACTGCG GCCGCTCCGG AAAGCGCCCT TCCCGCCTCC GAAGGCCTGC TGCCTCCCAT GTAG
|
Protein sequence | MDHAPLYQPK RSLIALMAIA ASCPFAQAGD GGAVGTSSAL GSSYAGPGSY QYQSSAARTA MARREAQTQE AMQLLAEGRN LYREGKYKEA LDKYNAAYNM LPSAPINDQR KEAIANHIGD ASIAVAQEYI KVGRYDEADK LLQDAIKLNP RSAKLAKQTL EYMKDPIRTN PALTPEHVKN VEKVNTLLHM AYGYYDLGDY DKAIAEFNKV LSIDPYNVAA RRGQETVNRR RMAYYAAAYD ETRSTMLAEV DKMWERPIPM EVPTGADGTD NAPITDINGA TANLMKLKSI IIPSVSFEDT TVEDAIDYLR KKSIELDRTV GPNGERGINF VINDSQPAAV APAVPATDED GFGEETAEVT EAAPAAAPQE SIRTRKIGQL KLTNVPMLEV LRFICSNAGL RQKVEDYAVT ILPAGGNDVD LYQRTFSVPP GFQSALRTTV GDGGGEVSDD PFGGGGESSS GLKPMPSIRS LLQKSGISFP EGATAFLVNG NSSLVVRNTS GNLDLIEQLI ENTRGESQQV RIMTKFVEVT QENTEELGFD WIVTPFSVSN DRSTFLGGGT NYGTGSTSDS FTQSPGGVTG WPVNSGSDTI NGLVTGGNRT GDYAITKNSV DNLLNSTNRS EASQKNAAPG IMSLTGIYDE GSFQMLMRGL SQKKGSDVLT APSVTAKSGE TAKIEIIREF WYPTEYEPPE LPNSVGNSGY NNGYGYGNNG NIVDGLLGNQ IQPQISSFPV TPATPGVFEM KPVGVTLEVV PTIGDNKYII DLNFKPSIVE FEGFVNYGSP IQSTGVGSDG KPMSLTLTEN RIEQPIFSKR SVETSLFIYD GHTVAIGGLI TENVQTVEDK VPIFGDLPLI GRFFRSNSDN HIKKNLMIFV TGQIIDATGQ PVRGNALPTA AAPESALPAS EGLLPPM
|
| |