Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1913 |
Symbol | |
ID | 6275374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2319826 |
End bp | 2322945 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642613973 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_001878507 |
Protein GI | 187736395 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.0328667 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTTA CAACGGAAAG CGCCTTTGAG GAAGCCGTCA TTAAGATGTT GTCCGAACAC GGATGGGAAA GCGCAGTATT GAAAAACTAC ACGGAAAAGC AGCTTATCCA AAACTGGGCT GACATCCTGT TCGAGAACAA CCGGGACATA GACAGCCTGA ATAATTGTCC TTTAACGGAG GGTGAAATGC TGCAAATTCT GGAGCAGATC GCCATGCTCA AATCGCCGTT GAAGCTGAAC GGTTTTATCA ACGGCCAAAA CGTGACGATT ACGCGCGATA ACGAGGCTGA TACCTTACAC TACGGCAAGG AGGTTTCCCT CAAGATTTAT GACCGCAGGG AAATAGCCGC TGGGCAAAGC CGATACCAGA TTGTGCAGCA GCCCAAGTTT CCCACGGCAT CCCCCTTGCT CAATAATAGG CGCGGGGATT TGATGCTGTT GATTAACGGC ATGCCCGTGA TTCATATTGA GCTGAAAAAG AGCGGCGTTC CGGTGAGTCA GGCCTGCCGT CAGATAGAGA AATACACGGC AGAGGGCGTC TTTAGCGGGC TGTTCTCGCT TGTGCAGATA TTTGTTGGCA TGACACCGGA AGAAACGCTG TATTTCGCCA ATCCCGGAAT AGACGGGCGC TTTAATCCGG ATTACTATTT CCATTGGGCT GATGTGGATA ACGAGCCTGT TAATGACTGG AAAAGCGTTA TTTCCACCCT GTTGTCAATC CCCATGGCGC ACCAGATGAT AGGCTTCTAT ACTGTTGCCG ACAATTCGGA CGGAGTGTTG AAAGTCATGC GCAGCTATCA GTTTTTTGCC GCCAGCAAAA TTTCGGATAA GGTAGCTCAA ACAAAATGGG ACGAGAGGAA CCAGCTTGGC GGTTTTGTGT GGCACACCAC GGGTTCAGGC AAAACAATGA CCTCTTTCAA GTCTGCCCAG TTGATAGCCA CGTCCAAAGA CGCGGACAAG GTTATCTTCC TGATGGACAG AATAGAACTG GGAACGCAAA GCCTGAAAGA ATACCGGGGC TTTGCCGATG ACAGTCTGGA TGTTCAGGCA ACGGAAAACA CGGGCGAACT TGTTACAAAA CTGAAAAGCG GCAACCCTTC CGATACGCTC ATTGTTACCT CCATTCAGAA GATGAACAAC ATTAAGAATG AGGCAGAGGG CGGACTGAAA GCGGCTGACA TCGAACTGAT GAGCGGAAAG CGCATCGTGT TCATTGTGGA TGAATGCCAT CGTTCCACGT TCGGTGATAT GCTCATCAAC ATCAAAGCCA CCTTCCCGCG AGCCATATTC TTTGGTTTTA GCGGAACTCC CATACATGAG GAAAACCAAA AGAAAGACAA TACGACGACG ACGGTTTTCG GAGATGAGCT GCACCGCTAC AGTATTGCCG ATGGAATTCG GGATAAGAAT GTCCTGGGCT TTGACCCCTA CAAGGTCTTA ACCTATGAGG ACAGCGAGCT GAAAACGGCT GTTGCCTTGG AAAAAGCGAA AGCGCACACA GTAGAGGAAG CCTACGCAGC CCCGGCGAAA GCCGCCGTGT TTCAGCACTA CATGGGGTTG CCCATGCCGG CGGTGTATGA AGATGAAACG GGAACGAAGC ACGGCATCGA GCATTATCTG CCTAACAGCC AATATGAAAG AGAGGAGCAT CAACAGGCTG TCATCGCGGA CATTCTGAAA AATTGGGTAG TCTTGAGCCA TAATGGCAAG TTCCATGCTA TTTTTGCCAC AGCCAGCATT CAGGAAGCAG TCCAGTACTA CCGGCGCCTG AAAGCAGAAG CCCCGCACCT GAAAATCTCC GCCATCTTTG ACGCGAACAT CGACAACAAC GGCCATGGCC TGATGAAAGA GCAAGGGCTT GTTGAAATCA TCAAGGATTA CAACGCTCGC TACGGACAGG ATTTTTCGAT TCCTACCTTT GCCGGAATGA AAAAAGACAT TGCTGCAAGG TTGGCGCATA AACGGCCATA CGAGCGTATC GACAAATCGC CGGAGCAGCA GTTGGATTTG CTCATTGTGG TGGATCAAAT GCTCACGGGC TTTGACTCCA AGTGGATTAA CACGCTCTAC CTGGATAAGA TGCTTTACTA TGAAAACCTC ATCCAAGCAT TTTCCCGCAC CAATCGCCTG TTTGGTCTGG ATAAGCCTTT CGGCACCATC CGCTATTACC GGAAGCCGCA TACCATGGAG CGCAACGTTC AGCAGGCGGT GAAGCTGTAC TCCGGCGATA AACCTTTGGG ATTGTTCGTG GAGAAACTGA ATAGGAATTT GGAATTGCTC AATACTATCT ATCAGGATAT TTCTGATCTG TTTCATCAGG CCGGAATTGA GGATTTCTCC CACTTGCCTG CGGAGCCGGA AGAATGCAAG AAGTTCGCAC GGCTGTTCCG GGATTTGAAT GCCCGCATGG AAGCCGCAAA GATTCAAGGC TTCCGTTGGG ATAAGCGCAT CTATCAGTTT GCAGACTCCA CGATGGAGGT TGCTCTGGAT GAACACACCT TCAATGTTCT CAGCGTGCGC TACAACGAGC TGTTCGGCGG CGGTGGAGGT GAGAGCGATG GCCATGTGCC AGATGTACCT TACGACATTC CGGGCTTCCC TATCCCGATA TCCACCGGCG CAATCGACAA TGATTACATG AACTCCCGAT TTGAAAAGTT TAGGAAGCTA CTGGGCAATG CTACCGAGGA GGAATTGCAG CAGACGGAGC AGGAACTGCA TAAGTCCTTT GCGTTCCTTT CCCAAGAAGA GCAGAAGTAT GCCGATATTT TCCTGCACGA CATTAAGCGT GGTGATGTTG TCCCTGTAGA GGGCAAGACT TTCCGTGACT ATGTGACCGA GTACATGGCA AAAGCCCAGG ACGACCGCAT ACACCGCTTT GCCGCCGTGT TTGGTCTGGA TGAAACATTG TTGCGCGGCA TGATGTCTCA TCGGGTGACT GAGGGGAACA TCAACGATTT TGGCCGCTTT GATGCATTGA AAGCTACAGC GGATAAGAAA AAGGCCAAAG CCTACTTCGA AACCGGGTCT CACACTCCAT TACCCCCACC CAAAGTGGCC ATGAAACTGG ACAAAATACT CCGGGATTTC ATCACAAACG GCGGTTTTGA CCTCCCGTAA
|
Protein sequence | MTFTTESAFE EAVIKMLSEH GWESAVLKNY TEKQLIQNWA DILFENNRDI DSLNNCPLTE GEMLQILEQI AMLKSPLKLN GFINGQNVTI TRDNEADTLH YGKEVSLKIY DRREIAAGQS RYQIVQQPKF PTASPLLNNR RGDLMLLING MPVIHIELKK SGVPVSQACR QIEKYTAEGV FSGLFSLVQI FVGMTPEETL YFANPGIDGR FNPDYYFHWA DVDNEPVNDW KSVISTLLSI PMAHQMIGFY TVADNSDGVL KVMRSYQFFA ASKISDKVAQ TKWDERNQLG GFVWHTTGSG KTMTSFKSAQ LIATSKDADK VIFLMDRIEL GTQSLKEYRG FADDSLDVQA TENTGELVTK LKSGNPSDTL IVTSIQKMNN IKNEAEGGLK AADIELMSGK RIVFIVDECH RSTFGDMLIN IKATFPRAIF FGFSGTPIHE ENQKKDNTTT TVFGDELHRY SIADGIRDKN VLGFDPYKVL TYEDSELKTA VALEKAKAHT VEEAYAAPAK AAVFQHYMGL PMPAVYEDET GTKHGIEHYL PNSQYEREEH QQAVIADILK NWVVLSHNGK FHAIFATASI QEAVQYYRRL KAEAPHLKIS AIFDANIDNN GHGLMKEQGL VEIIKDYNAR YGQDFSIPTF AGMKKDIAAR LAHKRPYERI DKSPEQQLDL LIVVDQMLTG FDSKWINTLY LDKMLYYENL IQAFSRTNRL FGLDKPFGTI RYYRKPHTME RNVQQAVKLY SGDKPLGLFV EKLNRNLELL NTIYQDISDL FHQAGIEDFS HLPAEPEECK KFARLFRDLN ARMEAAKIQG FRWDKRIYQF ADSTMEVALD EHTFNVLSVR YNELFGGGGG ESDGHVPDVP YDIPGFPIPI STGAIDNDYM NSRFEKFRKL LGNATEEELQ QTEQELHKSF AFLSQEEQKY ADIFLHDIKR GDVVPVEGKT FRDYVTEYMA KAQDDRIHRF AAVFGLDETL LRGMMSHRVT EGNINDFGRF DALKATADKK KAKAYFETGS HTPLPPPKVA MKLDKILRDF ITNGGFDLP
|
| |