Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1522 |
Symbol | |
ID | 6274607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1820014 |
End bp | 1823040 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642613581 |
Product | type III restriction protein res subunit |
Protein accession | YP_001878124 |
Protein GI | 187736012 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000221168 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00161886 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATTCA AGTTTAAGAT ACAACAATAT CAGACGGAGG CGGTGGAGAA TACCGTGGCT GTCTTCACGG GGCAGCCCTC GTACGCCATA GAGGGATATC GCCTCGACCG CGGACGGCAG GCACAACGAC AGTTGGATTT CGATGATGAA ACAGGATACA GGAACCACTG CGTGGAACTG GATGGGAAAG CCCTGTTGAA AAACATCAAT ACCATTCAGA ATCTGTATGA CATCACGCCG TCTTCTTCCC TTTCAAAGGG TATCGGTGCG GTGAATCTCG ACATAGAGAT GGAGACCGGA ACGGGAAAGA CATACGTCTA TATCAAGACG ATGTTTGAGC TGAATAAACA GTACGGCTGG AGCAAATTCA TCGTGGTGGT GCCGAGCATC GCCATCCGTG AGGGCGTGGC AAAGAGTTTC CGCATGTTGG AGGAGCACTT CATGGAACAC TACGGCAAGA AAGCGCGGTG GTTCATTTAC AACAGCGGCA ACCTGCAACA ACTCGACAGT TTTTCATCTG ATTCCGGGTT GAGCGTGATG ATTATCAACA CGCAGGCTTT CGCCTCCTCG ATGAAGGAGG GCGGCAGAAG CAAGGAGAGC AGGATTATTT ACTCCGAGCG TGATGAGTTC GGCAGCCGCC GTCCCATAGA CGTGATTGCC GCCAACCGTC CCATCATTAT CATGGACGAA CCGCAGAAGA TGGAGGGCGA TGCCACGCAG GCGGGCATCA AACGCTTCAA CCCACTGTTC GTGCTGAACT ATTCGGCTAC GCACACGACC AGGCACGATA CCATCTATGC GCTGGATGCT TTGGACGCTT ACCGGCAGAA GCTGGTGAAG CGCATTCATG TGAAAGGTTT TGAAGTGAAG AACCTCCGGG GAACGAGCGG GTATCTCTAT CTCGACAACA TCGTGTTGTC GCCCAAACGT CCGCCGGAAG CGCGCATTGA GCTGGAGGTG AAGAATGCTT CAGGCAGCAT CGTCAGGAAG ATAAAAACAT TCGGCGTGGG CGACAACCTG CGCGAAGAGT CCGGGCTGGC CGAGTACGAC AACTTTGTGG TGTCGGAAAT CAACATGAAC GGCTATGTGA CCTTTCTCAA CGGCGTGACC ATACGCAGGG GCGAGGTGAT AGGCGACCCG GATGAGCTGG ACATGCAGCG GGTGCAAATC CGGGAGACCA TCATGTCGCA CTTGGAGAAA GAACGCCAGC TTTTCAAGCG GGGCATCAAG TGCCTGTCCC TCTTTTTCAT CGACGAGGTG GCGAAGTACA AGAGCTACGA TGAGAACGGG GAGGAAGTGA AAGGCGTGTT CCAGAAAATG TTCGAGGAGG AATATGCGAG GTTGGTGAAT GAGGAGTTCT ACATCTGGGA TGAGGACTAC AACGAATACC TCCGCCGTTT CCTGCCCCAG GACGTGCATC GGGGTTATTT CTCCATAGAC AAAAAGACTA ACCGGGTGAT TGACGGCAAG GTGGAAAAGA AGACGGGACT GTCAGACGAC ATTTCCGCTT ACGACCTTAT CCTGAAGAAC AAGGAACGCC TGCTGAGCTT TGAGGAGCCG ACACGCTTCA TCTTCTCGCA CTCGGCACTG CGTGAGGGGT GGGACAATCC CAATGTATTT CAGATTTGCA CGCTGCGCCA TTCCAACTCC TCCACCGCCA AACGTCAGGA GGTAGGGCGC GGTTTGCGTA TCTGCGTGGA CAGGAACGGC GTGCGCATGG ACAAGGAACT GCTGGGGGAA GACGTGCATG AGGTGAACAA ACTGACCGTG ATAGCCAACG AGAGCTATGC GGATTTTACC ACAGCCCTGC AAAAGGAGAC ACGGGAAGTG TTGCGTGAGC GCGCAGCCAA GGCAACGGTC GCCTATTTTC AAGACAGGCA GATTAAGATT GGGGAGGAAA TACATACCAT TACCGAAACG GAAGCCAGCC GCATCATCAT CTATCTGGAA GACAACGGCT ATATTGACGA GGACAAGCAC ATCACGCCGG ATTACCGTGA GGCCGTGGCA AACGGCACGG TGGCTCCATT GCCGCCCAGG CTGCAACCGA TAGCTGAGGG GGTAGTCCGT CTGATAAACT CCATCTTCGA CCCGAAGGCA CTCGACGACA TGGTGGTGGA GGAGAAGACA ACCACGCCGG ACAACAAACT TAACGAAAAC TTCCAAAAAG CCGAATTCCA AGCTTTGTGG AACGAGATAA ACCACCAGTA TGTTTATACG GTAAGCTACG ACAGCAACGA GCTGATAGAA AAAGCCATCC TGCACATCAA TTCCGAACTG GAGGTAAAGC GGCTCCGCTA TGTGATGGTG GAGGGAACGC AGGATGAGGA GCAGGTAACT GACTTCGGAG ACACCCGTTC CCAATCCAGG CAACTGACCG ATGTCTGCAC TTCCACCGTC CGCTACGACC TTGTGGGCGA CATAGCCAAA GGCGCCAATC TCACCCGCCG CACAGTGGTG AAGATACTGC AAGGCATCCA GACGAGCAAG CTTTACCTGT TCAAGAACAA CCCCGAGGAG TTTATCCGCA AGGCTGTAAG CATCATCAAG GAGCAGAAGG CCACGATGAT TGTGGAGGCC ATCCGCTACA ACATGACGGA AGGCAAATAC GACAGCAGCA TCTTCACCGT GAAGAGCAGA ATGGATTTTG ACCGGGCATA CGAGGCGAAG AAGCATATCA CCGATTATGT GTTCAGCGAC AGCAAGGGGG AACGCCAATT CGCCCATGAC CTTGACGAGG CCCATGAAGT GGTGGTCTAT GCCAAACTGC CCCGTACTTT CCAAATACCC ACTCCGGTAG GCAACTATGC CCCCGACTGG GCTATCGCTA TGACGAAAGA CGGAGTGAAA CACATCTTTT TCATTGCCGA GACCAAAGGC TCCATGTCAT CAATGGATTT GAGTGCCATC GAAAAGGCAA AAATCGCATG TGCGGAGAAG TTGTTCAACT CTATCTCAAC GGCAAATGTG AAGTATCACA AAGTGGCTAC CTATCAGGAT TTGATTGATG AGATGAACGC GGGGTAA
|
Protein sequence | MKFKFKIQQY QTEAVENTVA VFTGQPSYAI EGYRLDRGRQ AQRQLDFDDE TGYRNHCVEL DGKALLKNIN TIQNLYDITP SSSLSKGIGA VNLDIEMETG TGKTYVYIKT MFELNKQYGW SKFIVVVPSI AIREGVAKSF RMLEEHFMEH YGKKARWFIY NSGNLQQLDS FSSDSGLSVM IINTQAFASS MKEGGRSKES RIIYSERDEF GSRRPIDVIA ANRPIIIMDE PQKMEGDATQ AGIKRFNPLF VLNYSATHTT RHDTIYALDA LDAYRQKLVK RIHVKGFEVK NLRGTSGYLY LDNIVLSPKR PPEARIELEV KNASGSIVRK IKTFGVGDNL REESGLAEYD NFVVSEINMN GYVTFLNGVT IRRGEVIGDP DELDMQRVQI RETIMSHLEK ERQLFKRGIK CLSLFFIDEV AKYKSYDENG EEVKGVFQKM FEEEYARLVN EEFYIWDEDY NEYLRRFLPQ DVHRGYFSID KKTNRVIDGK VEKKTGLSDD ISAYDLILKN KERLLSFEEP TRFIFSHSAL REGWDNPNVF QICTLRHSNS STAKRQEVGR GLRICVDRNG VRMDKELLGE DVHEVNKLTV IANESYADFT TALQKETREV LRERAAKATV AYFQDRQIKI GEEIHTITET EASRIIIYLE DNGYIDEDKH ITPDYREAVA NGTVAPLPPR LQPIAEGVVR LINSIFDPKA LDDMVVEEKT TTPDNKLNEN FQKAEFQALW NEINHQYVYT VSYDSNELIE KAILHINSEL EVKRLRYVMV EGTQDEEQVT DFGDTRSQSR QLTDVCTSTV RYDLVGDIAK GANLTRRTVV KILQGIQTSK LYLFKNNPEE FIRKAVSIIK EQKATMIVEA IRYNMTEGKY DSSIFTVKSR MDFDRAYEAK KHITDYVFSD SKGERQFAHD LDEAHEVVVY AKLPRTFQIP TPVGNYAPDW AIAMTKDGVK HIFFIAETKG SMSSMDLSAI EKAKIACAEK LFNSISTANV KYHKVATYQD LIDEMNAG
|
| |