Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0228 |
Symbol | |
ID | 6275301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 284763 |
End bp | 286721 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642612273 |
Product | hypothetical protein |
Protein accession | YP_001876852 |
Protein GI | 187734740 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAACT TGCAGAAACG GCAGGCAGGC CATGAGGTCC GGCTGCCGGG GGGATTCGCT TTTTCCTTGA GGGAAGAGGA TGCCTGCCGG GCTGTTACTC CTTCCGTGCT GGAAGGATTG CTGGGGCGTT TCCTGACGGA CGCCCGCCGT TCCGGGACGG TGTATACCCC CGGTTTTCTT GTTCGCTGGA TGGCCCGGGA AGCGGTATCT TCCTGGCTGG AATCCAGATT GCCTTCTCCC AGGAGCGGTG ATGAAGAAGC CGGGTTGCTG GGAAGTATCC GTGTGCTGGA TTTATCCGTA GGGGCCGGAG CATTCACCAT GGGAATGCTC CGGGAGCTTG TCGCCCGCAG AAAGCTTCTG GAACCGGATT GCCCGGAGCC CGATTTAATC CGCGCCGTTT TGGAAGAGAA TATTTACGGA GTGGATGTTT GCGCGGAGGC ATTGGAGGTG GCCCGTTTCC GTTTTCGTTG CGCATGGTTG GCCGCCGGAG GAAAGGGGGA TTTACGGGAT CGCCTGTTGT GCGGGGACAG TCTGGATATG TCTGCGTCCG GCGTCTGGCG TCAGGGTCTT TCAACGGTGA TGGAGGAAGG AGGCTTTGAC CTGGTGATCG GCAATCCCCC TTTTGTCGGA GAAAAAGGGA ACAAGGAATT ATTTGACCGT TTGAAATGTT CTTCCCTTGC TTCCTACTGC TCTTCCCGCA TGGATTACTG GTACGTGTTT GCCTGTGTGG GGCTGGATGC CCTGAAGCGG GGGGGCGTAA TGCATCTGGT CGTTCCCAAC AAGTGGATGT CTAATGCGGG AGCCGCCTCT CTCCGCCGCA AGCTGCTGGA AGATTGCGGT TTGCTGCGCC TGTCGGATTT CGGAGCCTGC CGCGTGTTTG AATCCGCCCG CGTCCATACC ATGACTGTTC TGGCGGAGAA AAAGGGAGAG GGAGGCGGAC GCCTGGTTCC GGAATACCGC CGTTTTGGCG GCTCCTTGAA TGACGTGGAA TCATTTCTGG AACATGAGCC GTACCGTCGG TTTCCCTTGC CGGAAGATGC CGGAGCGTGC GTCAGGGAAG GTTTGTCCTT CTGCTCTGCA GCAGAGCGGC ATCTTCTGGA AAAGATGGAT GCCTGCCGGA ATTTCAGGCT GGATGCCGCA AAGGAAATGA CGCAGGGGAT TGTGCCGAAT CCTGACGTCG TTTCTTCCCG AGCTCTGGAG AATCTGGCGC CGGAAACAGT GCGGCGGTAC GGAATCCGGA GGGGGGATGG GGTATTTGTC TTGCCTCAGG GGTATTTTGG CCTGTTGCCG GAACGGGAGC GGCGTTTTCT CAAACCTCTG TATGAACCTG TCCAGGCAGG GCGGCATGCA CTGAAAGCTC CGGAGAAGGT GCTGTTATAT CTGACCCCGT CCAACGGAGC GGAGCAGGCC GTTACGCTTT TGCGCCATTT GGAAAAATTC CGTCCCCTGA TGGAGGCCCG GCGGGAAACA CGCATGGGCC GCATGAAATA TTACCATGTG CACTGGCCCC GGCAGGAACG TTTTTTCCGG CCCGGCCCCA AAATTCTGGC TGTCAGGAAG TGTGCCCGTC CAACCTTCTC CTATACGGAA AAGGAAGCTT ATGTGATGAT GGCATTCAAC GTGATACGCT CGGAACGCGT GAGCATGAAA TATCTGGCGG CCCTGTTCAA TTCGCGCCTG ATGCAGTTTT GGTTTCTCCG CCGGGGAAAA ATGCAGGGGG ATTTTTTCCA GATGGATACC GCCCCTGTCC TCCGGGCTCC CATCCGGGTC CCCGGTCTGT CCCTGCTGCG GGAAGTGGAA GGGCTGGCGG ACGCTCTCGC GGCCCGCTAT TGCCCGGAAG GGGATGAGCG GATGAATGGG CTGATGGAAG ACATTTACGG CCTTTCCCCG CAGGAGCGGG AAATAATTAT TCAGGCATGT TCCCCCATGC GGGCCGGGAG GGAAAGCTTG CCGGAGTGA
|
Protein sequence | MDNLQKRQAG HEVRLPGGFA FSLREEDACR AVTPSVLEGL LGRFLTDARR SGTVYTPGFL VRWMAREAVS SWLESRLPSP RSGDEEAGLL GSIRVLDLSV GAGAFTMGML RELVARRKLL EPDCPEPDLI RAVLEENIYG VDVCAEALEV ARFRFRCAWL AAGGKGDLRD RLLCGDSLDM SASGVWRQGL STVMEEGGFD LVIGNPPFVG EKGNKELFDR LKCSSLASYC SSRMDYWYVF ACVGLDALKR GGVMHLVVPN KWMSNAGAAS LRRKLLEDCG LLRLSDFGAC RVFESARVHT MTVLAEKKGE GGGRLVPEYR RFGGSLNDVE SFLEHEPYRR FPLPEDAGAC VREGLSFCSA AERHLLEKMD ACRNFRLDAA KEMTQGIVPN PDVVSSRALE NLAPETVRRY GIRRGDGVFV LPQGYFGLLP ERERRFLKPL YEPVQAGRHA LKAPEKVLLY LTPSNGAEQA VTLLRHLEKF RPLMEARRET RMGRMKYYHV HWPRQERFFR PGPKILAVRK CARPTFSYTE KEAYVMMAFN VIRSERVSMK YLAALFNSRL MQFWFLRRGK MQGDFFQMDT APVLRAPIRV PGLSLLREVE GLADALAARY CPEGDERMNG LMEDIYGLSP QEREIIIQAC SPMRAGRESL PE
|
| |