Gene Amuc_1913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1913 
Symbol 
ID6275374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2319826 
End bp2322945 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content50% 
IMG OID642613973 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_001878507 
Protein GI187736395 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0328667 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTA CAACGGAAAG CGCCTTTGAG GAAGCCGTCA TTAAGATGTT GTCCGAACAC 
GGATGGGAAA GCGCAGTATT GAAAAACTAC ACGGAAAAGC AGCTTATCCA AAACTGGGCT
GACATCCTGT TCGAGAACAA CCGGGACATA GACAGCCTGA ATAATTGTCC TTTAACGGAG
GGTGAAATGC TGCAAATTCT GGAGCAGATC GCCATGCTCA AATCGCCGTT GAAGCTGAAC
GGTTTTATCA ACGGCCAAAA CGTGACGATT ACGCGCGATA ACGAGGCTGA TACCTTACAC
TACGGCAAGG AGGTTTCCCT CAAGATTTAT GACCGCAGGG AAATAGCCGC TGGGCAAAGC
CGATACCAGA TTGTGCAGCA GCCCAAGTTT CCCACGGCAT CCCCCTTGCT CAATAATAGG
CGCGGGGATT TGATGCTGTT GATTAACGGC ATGCCCGTGA TTCATATTGA GCTGAAAAAG
AGCGGCGTTC CGGTGAGTCA GGCCTGCCGT CAGATAGAGA AATACACGGC AGAGGGCGTC
TTTAGCGGGC TGTTCTCGCT TGTGCAGATA TTTGTTGGCA TGACACCGGA AGAAACGCTG
TATTTCGCCA ATCCCGGAAT AGACGGGCGC TTTAATCCGG ATTACTATTT CCATTGGGCT
GATGTGGATA ACGAGCCTGT TAATGACTGG AAAAGCGTTA TTTCCACCCT GTTGTCAATC
CCCATGGCGC ACCAGATGAT AGGCTTCTAT ACTGTTGCCG ACAATTCGGA CGGAGTGTTG
AAAGTCATGC GCAGCTATCA GTTTTTTGCC GCCAGCAAAA TTTCGGATAA GGTAGCTCAA
ACAAAATGGG ACGAGAGGAA CCAGCTTGGC GGTTTTGTGT GGCACACCAC GGGTTCAGGC
AAAACAATGA CCTCTTTCAA GTCTGCCCAG TTGATAGCCA CGTCCAAAGA CGCGGACAAG
GTTATCTTCC TGATGGACAG AATAGAACTG GGAACGCAAA GCCTGAAAGA ATACCGGGGC
TTTGCCGATG ACAGTCTGGA TGTTCAGGCA ACGGAAAACA CGGGCGAACT TGTTACAAAA
CTGAAAAGCG GCAACCCTTC CGATACGCTC ATTGTTACCT CCATTCAGAA GATGAACAAC
ATTAAGAATG AGGCAGAGGG CGGACTGAAA GCGGCTGACA TCGAACTGAT GAGCGGAAAG
CGCATCGTGT TCATTGTGGA TGAATGCCAT CGTTCCACGT TCGGTGATAT GCTCATCAAC
ATCAAAGCCA CCTTCCCGCG AGCCATATTC TTTGGTTTTA GCGGAACTCC CATACATGAG
GAAAACCAAA AGAAAGACAA TACGACGACG ACGGTTTTCG GAGATGAGCT GCACCGCTAC
AGTATTGCCG ATGGAATTCG GGATAAGAAT GTCCTGGGCT TTGACCCCTA CAAGGTCTTA
ACCTATGAGG ACAGCGAGCT GAAAACGGCT GTTGCCTTGG AAAAAGCGAA AGCGCACACA
GTAGAGGAAG CCTACGCAGC CCCGGCGAAA GCCGCCGTGT TTCAGCACTA CATGGGGTTG
CCCATGCCGG CGGTGTATGA AGATGAAACG GGAACGAAGC ACGGCATCGA GCATTATCTG
CCTAACAGCC AATATGAAAG AGAGGAGCAT CAACAGGCTG TCATCGCGGA CATTCTGAAA
AATTGGGTAG TCTTGAGCCA TAATGGCAAG TTCCATGCTA TTTTTGCCAC AGCCAGCATT
CAGGAAGCAG TCCAGTACTA CCGGCGCCTG AAAGCAGAAG CCCCGCACCT GAAAATCTCC
GCCATCTTTG ACGCGAACAT CGACAACAAC GGCCATGGCC TGATGAAAGA GCAAGGGCTT
GTTGAAATCA TCAAGGATTA CAACGCTCGC TACGGACAGG ATTTTTCGAT TCCTACCTTT
GCCGGAATGA AAAAAGACAT TGCTGCAAGG TTGGCGCATA AACGGCCATA CGAGCGTATC
GACAAATCGC CGGAGCAGCA GTTGGATTTG CTCATTGTGG TGGATCAAAT GCTCACGGGC
TTTGACTCCA AGTGGATTAA CACGCTCTAC CTGGATAAGA TGCTTTACTA TGAAAACCTC
ATCCAAGCAT TTTCCCGCAC CAATCGCCTG TTTGGTCTGG ATAAGCCTTT CGGCACCATC
CGCTATTACC GGAAGCCGCA TACCATGGAG CGCAACGTTC AGCAGGCGGT GAAGCTGTAC
TCCGGCGATA AACCTTTGGG ATTGTTCGTG GAGAAACTGA ATAGGAATTT GGAATTGCTC
AATACTATCT ATCAGGATAT TTCTGATCTG TTTCATCAGG CCGGAATTGA GGATTTCTCC
CACTTGCCTG CGGAGCCGGA AGAATGCAAG AAGTTCGCAC GGCTGTTCCG GGATTTGAAT
GCCCGCATGG AAGCCGCAAA GATTCAAGGC TTCCGTTGGG ATAAGCGCAT CTATCAGTTT
GCAGACTCCA CGATGGAGGT TGCTCTGGAT GAACACACCT TCAATGTTCT CAGCGTGCGC
TACAACGAGC TGTTCGGCGG CGGTGGAGGT GAGAGCGATG GCCATGTGCC AGATGTACCT
TACGACATTC CGGGCTTCCC TATCCCGATA TCCACCGGCG CAATCGACAA TGATTACATG
AACTCCCGAT TTGAAAAGTT TAGGAAGCTA CTGGGCAATG CTACCGAGGA GGAATTGCAG
CAGACGGAGC AGGAACTGCA TAAGTCCTTT GCGTTCCTTT CCCAAGAAGA GCAGAAGTAT
GCCGATATTT TCCTGCACGA CATTAAGCGT GGTGATGTTG TCCCTGTAGA GGGCAAGACT
TTCCGTGACT ATGTGACCGA GTACATGGCA AAAGCCCAGG ACGACCGCAT ACACCGCTTT
GCCGCCGTGT TTGGTCTGGA TGAAACATTG TTGCGCGGCA TGATGTCTCA TCGGGTGACT
GAGGGGAACA TCAACGATTT TGGCCGCTTT GATGCATTGA AAGCTACAGC GGATAAGAAA
AAGGCCAAAG CCTACTTCGA AACCGGGTCT CACACTCCAT TACCCCCACC CAAAGTGGCC
ATGAAACTGG ACAAAATACT CCGGGATTTC ATCACAAACG GCGGTTTTGA CCTCCCGTAA
 
Protein sequence
MTFTTESAFE EAVIKMLSEH GWESAVLKNY TEKQLIQNWA DILFENNRDI DSLNNCPLTE 
GEMLQILEQI AMLKSPLKLN GFINGQNVTI TRDNEADTLH YGKEVSLKIY DRREIAAGQS
RYQIVQQPKF PTASPLLNNR RGDLMLLING MPVIHIELKK SGVPVSQACR QIEKYTAEGV
FSGLFSLVQI FVGMTPEETL YFANPGIDGR FNPDYYFHWA DVDNEPVNDW KSVISTLLSI
PMAHQMIGFY TVADNSDGVL KVMRSYQFFA ASKISDKVAQ TKWDERNQLG GFVWHTTGSG
KTMTSFKSAQ LIATSKDADK VIFLMDRIEL GTQSLKEYRG FADDSLDVQA TENTGELVTK
LKSGNPSDTL IVTSIQKMNN IKNEAEGGLK AADIELMSGK RIVFIVDECH RSTFGDMLIN
IKATFPRAIF FGFSGTPIHE ENQKKDNTTT TVFGDELHRY SIADGIRDKN VLGFDPYKVL
TYEDSELKTA VALEKAKAHT VEEAYAAPAK AAVFQHYMGL PMPAVYEDET GTKHGIEHYL
PNSQYEREEH QQAVIADILK NWVVLSHNGK FHAIFATASI QEAVQYYRRL KAEAPHLKIS
AIFDANIDNN GHGLMKEQGL VEIIKDYNAR YGQDFSIPTF AGMKKDIAAR LAHKRPYERI
DKSPEQQLDL LIVVDQMLTG FDSKWINTLY LDKMLYYENL IQAFSRTNRL FGLDKPFGTI
RYYRKPHTME RNVQQAVKLY SGDKPLGLFV EKLNRNLELL NTIYQDISDL FHQAGIEDFS
HLPAEPEECK KFARLFRDLN ARMEAAKIQG FRWDKRIYQF ADSTMEVALD EHTFNVLSVR
YNELFGGGGG ESDGHVPDVP YDIPGFPIPI STGAIDNDYM NSRFEKFRKL LGNATEEELQ
QTEQELHKSF AFLSQEEQKY ADIFLHDIKR GDVVPVEGKT FRDYVTEYMA KAQDDRIHRF
AAVFGLDETL LRGMMSHRVT EGNINDFGRF DALKATADKK KAKAYFETGS HTPLPPPKVA
MKLDKILRDF ITNGGFDLP