Gene Amuc_1522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1522 
Symbol 
ID6274607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1820014 
End bp1823040 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content53% 
IMG OID642613581 
Producttype III restriction protein res subunit 
Protein accessionYP_001878124 
Protein GI187736012 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000221168 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00161886 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTCA AGTTTAAGAT ACAACAATAT CAGACGGAGG CGGTGGAGAA TACCGTGGCT 
GTCTTCACGG GGCAGCCCTC GTACGCCATA GAGGGATATC GCCTCGACCG CGGACGGCAG
GCACAACGAC AGTTGGATTT CGATGATGAA ACAGGATACA GGAACCACTG CGTGGAACTG
GATGGGAAAG CCCTGTTGAA AAACATCAAT ACCATTCAGA ATCTGTATGA CATCACGCCG
TCTTCTTCCC TTTCAAAGGG TATCGGTGCG GTGAATCTCG ACATAGAGAT GGAGACCGGA
ACGGGAAAGA CATACGTCTA TATCAAGACG ATGTTTGAGC TGAATAAACA GTACGGCTGG
AGCAAATTCA TCGTGGTGGT GCCGAGCATC GCCATCCGTG AGGGCGTGGC AAAGAGTTTC
CGCATGTTGG AGGAGCACTT CATGGAACAC TACGGCAAGA AAGCGCGGTG GTTCATTTAC
AACAGCGGCA ACCTGCAACA ACTCGACAGT TTTTCATCTG ATTCCGGGTT GAGCGTGATG
ATTATCAACA CGCAGGCTTT CGCCTCCTCG ATGAAGGAGG GCGGCAGAAG CAAGGAGAGC
AGGATTATTT ACTCCGAGCG TGATGAGTTC GGCAGCCGCC GTCCCATAGA CGTGATTGCC
GCCAACCGTC CCATCATTAT CATGGACGAA CCGCAGAAGA TGGAGGGCGA TGCCACGCAG
GCGGGCATCA AACGCTTCAA CCCACTGTTC GTGCTGAACT ATTCGGCTAC GCACACGACC
AGGCACGATA CCATCTATGC GCTGGATGCT TTGGACGCTT ACCGGCAGAA GCTGGTGAAG
CGCATTCATG TGAAAGGTTT TGAAGTGAAG AACCTCCGGG GAACGAGCGG GTATCTCTAT
CTCGACAACA TCGTGTTGTC GCCCAAACGT CCGCCGGAAG CGCGCATTGA GCTGGAGGTG
AAGAATGCTT CAGGCAGCAT CGTCAGGAAG ATAAAAACAT TCGGCGTGGG CGACAACCTG
CGCGAAGAGT CCGGGCTGGC CGAGTACGAC AACTTTGTGG TGTCGGAAAT CAACATGAAC
GGCTATGTGA CCTTTCTCAA CGGCGTGACC ATACGCAGGG GCGAGGTGAT AGGCGACCCG
GATGAGCTGG ACATGCAGCG GGTGCAAATC CGGGAGACCA TCATGTCGCA CTTGGAGAAA
GAACGCCAGC TTTTCAAGCG GGGCATCAAG TGCCTGTCCC TCTTTTTCAT CGACGAGGTG
GCGAAGTACA AGAGCTACGA TGAGAACGGG GAGGAAGTGA AAGGCGTGTT CCAGAAAATG
TTCGAGGAGG AATATGCGAG GTTGGTGAAT GAGGAGTTCT ACATCTGGGA TGAGGACTAC
AACGAATACC TCCGCCGTTT CCTGCCCCAG GACGTGCATC GGGGTTATTT CTCCATAGAC
AAAAAGACTA ACCGGGTGAT TGACGGCAAG GTGGAAAAGA AGACGGGACT GTCAGACGAC
ATTTCCGCTT ACGACCTTAT CCTGAAGAAC AAGGAACGCC TGCTGAGCTT TGAGGAGCCG
ACACGCTTCA TCTTCTCGCA CTCGGCACTG CGTGAGGGGT GGGACAATCC CAATGTATTT
CAGATTTGCA CGCTGCGCCA TTCCAACTCC TCCACCGCCA AACGTCAGGA GGTAGGGCGC
GGTTTGCGTA TCTGCGTGGA CAGGAACGGC GTGCGCATGG ACAAGGAACT GCTGGGGGAA
GACGTGCATG AGGTGAACAA ACTGACCGTG ATAGCCAACG AGAGCTATGC GGATTTTACC
ACAGCCCTGC AAAAGGAGAC ACGGGAAGTG TTGCGTGAGC GCGCAGCCAA GGCAACGGTC
GCCTATTTTC AAGACAGGCA GATTAAGATT GGGGAGGAAA TACATACCAT TACCGAAACG
GAAGCCAGCC GCATCATCAT CTATCTGGAA GACAACGGCT ATATTGACGA GGACAAGCAC
ATCACGCCGG ATTACCGTGA GGCCGTGGCA AACGGCACGG TGGCTCCATT GCCGCCCAGG
CTGCAACCGA TAGCTGAGGG GGTAGTCCGT CTGATAAACT CCATCTTCGA CCCGAAGGCA
CTCGACGACA TGGTGGTGGA GGAGAAGACA ACCACGCCGG ACAACAAACT TAACGAAAAC
TTCCAAAAAG CCGAATTCCA AGCTTTGTGG AACGAGATAA ACCACCAGTA TGTTTATACG
GTAAGCTACG ACAGCAACGA GCTGATAGAA AAAGCCATCC TGCACATCAA TTCCGAACTG
GAGGTAAAGC GGCTCCGCTA TGTGATGGTG GAGGGAACGC AGGATGAGGA GCAGGTAACT
GACTTCGGAG ACACCCGTTC CCAATCCAGG CAACTGACCG ATGTCTGCAC TTCCACCGTC
CGCTACGACC TTGTGGGCGA CATAGCCAAA GGCGCCAATC TCACCCGCCG CACAGTGGTG
AAGATACTGC AAGGCATCCA GACGAGCAAG CTTTACCTGT TCAAGAACAA CCCCGAGGAG
TTTATCCGCA AGGCTGTAAG CATCATCAAG GAGCAGAAGG CCACGATGAT TGTGGAGGCC
ATCCGCTACA ACATGACGGA AGGCAAATAC GACAGCAGCA TCTTCACCGT GAAGAGCAGA
ATGGATTTTG ACCGGGCATA CGAGGCGAAG AAGCATATCA CCGATTATGT GTTCAGCGAC
AGCAAGGGGG AACGCCAATT CGCCCATGAC CTTGACGAGG CCCATGAAGT GGTGGTCTAT
GCCAAACTGC CCCGTACTTT CCAAATACCC ACTCCGGTAG GCAACTATGC CCCCGACTGG
GCTATCGCTA TGACGAAAGA CGGAGTGAAA CACATCTTTT TCATTGCCGA GACCAAAGGC
TCCATGTCAT CAATGGATTT GAGTGCCATC GAAAAGGCAA AAATCGCATG TGCGGAGAAG
TTGTTCAACT CTATCTCAAC GGCAAATGTG AAGTATCACA AAGTGGCTAC CTATCAGGAT
TTGATTGATG AGATGAACGC GGGGTAA
 
Protein sequence
MKFKFKIQQY QTEAVENTVA VFTGQPSYAI EGYRLDRGRQ AQRQLDFDDE TGYRNHCVEL 
DGKALLKNIN TIQNLYDITP SSSLSKGIGA VNLDIEMETG TGKTYVYIKT MFELNKQYGW
SKFIVVVPSI AIREGVAKSF RMLEEHFMEH YGKKARWFIY NSGNLQQLDS FSSDSGLSVM
IINTQAFASS MKEGGRSKES RIIYSERDEF GSRRPIDVIA ANRPIIIMDE PQKMEGDATQ
AGIKRFNPLF VLNYSATHTT RHDTIYALDA LDAYRQKLVK RIHVKGFEVK NLRGTSGYLY
LDNIVLSPKR PPEARIELEV KNASGSIVRK IKTFGVGDNL REESGLAEYD NFVVSEINMN
GYVTFLNGVT IRRGEVIGDP DELDMQRVQI RETIMSHLEK ERQLFKRGIK CLSLFFIDEV
AKYKSYDENG EEVKGVFQKM FEEEYARLVN EEFYIWDEDY NEYLRRFLPQ DVHRGYFSID
KKTNRVIDGK VEKKTGLSDD ISAYDLILKN KERLLSFEEP TRFIFSHSAL REGWDNPNVF
QICTLRHSNS STAKRQEVGR GLRICVDRNG VRMDKELLGE DVHEVNKLTV IANESYADFT
TALQKETREV LRERAAKATV AYFQDRQIKI GEEIHTITET EASRIIIYLE DNGYIDEDKH
ITPDYREAVA NGTVAPLPPR LQPIAEGVVR LINSIFDPKA LDDMVVEEKT TTPDNKLNEN
FQKAEFQALW NEINHQYVYT VSYDSNELIE KAILHINSEL EVKRLRYVMV EGTQDEEQVT
DFGDTRSQSR QLTDVCTSTV RYDLVGDIAK GANLTRRTVV KILQGIQTSK LYLFKNNPEE
FIRKAVSIIK EQKATMIVEA IRYNMTEGKY DSSIFTVKSR MDFDRAYEAK KHITDYVFSD
SKGERQFAHD LDEAHEVVVY AKLPRTFQIP TPVGNYAPDW AIAMTKDGVK HIFFIAETKG
SMSSMDLSAI EKAKIACAEK LFNSISTANV KYHKVATYQD LIDEMNAG