Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1355 |
Symbol | |
ID | 6275829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1627047 |
End bp | 1628744 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642613411 |
Product | Bacteriophage tail assembly protein-like protein |
Protein accession | YP_001877960 |
Protein GI | 187735848 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00357201 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.000000000207538 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAGACG TTGAAAAGGC TGCCGGATTT CTGAATCTTT TCGCGGATAA CGTATCCGCT GGAATGGACG TGGAGCCGAT AGAATGGATA CGTGAAAACG TGGTAGACCA GCAATCGGCG CGGTCTACCC ATATTGATTT TACGTTGAGC CCGTTTCTGC TGGAACCCAT TGACAAATTC CTGAATGACG GCACGGTCAG GCACATTAAC CTGATGGCTC CCACCGGTTC CGGAAAGTCG ACATTGTTCG TCGGCTTGCT TAATTACCTC ATTGCCAATG ATGCGGGGAA TACCTTGGTC GCTTTCCAGA ATGAACAGGA AACCTCTGAT TTCGCGGAAA CGAGGCTTTT TCCCACATTC CGGGATAATA AGGCTTTAAA AAACTTGCTG CCGAAGAAAA GGCATGCGGC CAGGAAAACG GAGATTCTTT TTCCGCACAT GAACTTGTGG ATGGTCTCCG CTACCAAGGG GCAGTTGCAG TCGAAGTCCT GCCGGTATTT GATTGGTGAT GAAATGTGGG CGTGGGAAAA GGGAATGGTC CGGGAGTTCC TGGCCCGCCA CCATGATCGG TTCAACCGTA AAATCCTGAT GGTTTCCCAG GGAGGGGACA AGGGGACGGA CTGGGTTGAC GAGTACGGAA AGGGCCGCAT CCACCATTAC CATTGGCAAT GCCCCGGCTG CCAGGGATGG AATGCCTATG ACTGGCGGGA TGTCATTTAC AGCAAAGAAG AAAATATTGA TTGGGAACGT TTTAAGGAAT CCGTCAAAAT GGTTTGTCCC CGGTGTTTGC ACGAAATAGA GGATACTGTT CACAACCGGC GCCATCTGGC CGGCGGCGGC AAGTACGTGT TTTCCGGCAA TACGAGCGCC TTGCCGGAGA TCGTAAGCTA CAATTTTAAT GCTTTGGCCT GTTACTGGGT ATCATGGGCG GATTTGGCCG TGGAATGGAT CCTGGCCAAT AAAAAGAAGA GGAACGGGGA TATAGAACCG CTGAAAAAGT TTATCCAGAA GCGGCTTGCT CAAAACATTG TGGATTTGGG CGAGAAAGAC GACGTGTTGA AAATCCCGCT CACGGCGGAA AGCATGGAGG GGTACGCCGT GGAAAACGAA CGGGCGCGCT TCTTAACGGT AGATGTCCAG AAAGGGCACT TTTGGCATAC TGTTTACGGC GTGGATGCCG GAGGGGCTTT CCATTTACTT TCAGAGGGGC GTCTTGAAAC GTTGGAGGAT ATTGAACGTA AACAGGCGGA GTTTAACGTG CCTGATCATT GCGTTGCTTT GGACTGCGCT TTTGATACGG ATGCCGTGCG GAAGATATGC GGGCTTCATC ACTGGTTTTC CATGAATGGG ACGGTCAAGG AAGAATACGT GCATAAAATC AGGGGGCGGG GGATTAAATT GATTTATGCG CCTCTTGAGC GGCACATGGT GGAGGGGATT CAATGCCTTC ATTTCAACTT TTCTTCCCAG CGGGCGAAGG ACGTTCTTGC CGCGCGGATT AAATCGGGGC ATTTCAAGGT TCCCCATGAT GTTTCTGCCG AGTACATCAA GCAGATGCAG GCTGAAAGCA AACAGGAGTC CATAGACAAG CGCACGGGGA GGGTTTCTCT CAAATGGCTT GCGTCCGGTA ATAATTCCCA CATGTGGGAC TGTTCCTGCA TGGCGGTGAT TTTTGCCATG ATTCACCGGA TGATTTAA
|
Protein sequence | MRDVEKAAGF LNLFADNVSA GMDVEPIEWI RENVVDQQSA RSTHIDFTLS PFLLEPIDKF LNDGTVRHIN LMAPTGSGKS TLFVGLLNYL IANDAGNTLV AFQNEQETSD FAETRLFPTF RDNKALKNLL PKKRHAARKT EILFPHMNLW MVSATKGQLQ SKSCRYLIGD EMWAWEKGMV REFLARHHDR FNRKILMVSQ GGDKGTDWVD EYGKGRIHHY HWQCPGCQGW NAYDWRDVIY SKEENIDWER FKESVKMVCP RCLHEIEDTV HNRRHLAGGG KYVFSGNTSA LPEIVSYNFN ALACYWVSWA DLAVEWILAN KKKRNGDIEP LKKFIQKRLA QNIVDLGEKD DVLKIPLTAE SMEGYAVENE RARFLTVDVQ KGHFWHTVYG VDAGGAFHLL SEGRLETLED IERKQAEFNV PDHCVALDCA FDTDAVRKIC GLHHWFSMNG TVKEEYVHKI RGRGIKLIYA PLERHMVEGI QCLHFNFSSQ RAKDVLAARI KSGHFKVPHD VSAEYIKQMQ AESKQESIDK RTGRVSLKWL ASGNNSHMWD CSCMAVIFAM IHRMI
|
| |