Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2010 |
Symbol | |
ID | 6275769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2440271 |
End bp | 2443576 |
Gene Length | 3306 bp |
Protein Length | 1101 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642614069 |
Product | hypothetical protein |
Protein accession | YP_001878601 |
Protein GI | 187736489 |
COG category | [S] Function unknown |
COG ID | [COG3513] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.320871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.091538 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGTT CTCTCACTTT TTCGTTTGAT ATTGGTTATG CATCCATTGG ATGGGCTGTC ATTGCTTCCG CATCCCATGA TGATGCGGAT CCCTCGGTTT GCGGTTGCGG TACGGTTCTG TTTCCGAAAG ATGATTGTCA GGCATTTAAA AGGCGTGAAT ACAGACGTTT GAGACGCAAT ATCCGCTCCC GGCGCGTTCG CATTGAGCGT ATCGGCAGAT TGCTGGTTCA GGCGCAAATC ATCACGCCGG AAATGAAAGA AACTTCCGGG CACCCCGCTC CCTTTTATTT GGCGTCAGAA GCGTTAAAAG GGCATCGAAC TCTCGCCCCG ATTGAGCTTT GGCATGTTCT CCGCTGGTAT GCTCATAACA GAGGGTACGA CAATAATGCC TCATGGTCTA ACAGCCTTTC AGAAGATGGC GGGAACGGTG AGGATACCGA GAGAGTGAAG CATGCTCAAG ATTTGATGGA TAAACATGGG ACGGCGACCA TGGCGGAAAC CATTTGCCGG GAGTTGAAAC TGGAAGAAGG CAAAGCGGAT GCTCCGATGG AGGTTTCAAC GCCGGCTTAT AAAAATCTCA ATACCGCCTT TCCTCGCTTA ATCGTGGAAA AGGAGGTACG GCGCATATTG GAGCTTTCCG CGCCTCTGAT TCCTGGGCTG ACTGCGGAGA TCATAGAGTT GATTGCGCAG CATCATCCCC TGACAACGGA ACAGCGCGGC GTGTTGCTTC AGCACGGGAT AAAATTGGCT CGGCGTTATC GTGGAAGTCT TTTGTTCGGG CAGTTAATCC CCCGTTTTGA TAACCGCATC ATCAGCCGCT GCCCTGTCAC GTGGGCGCAG GTGTATGAAG CTGAGTTGAA GAAAGGCAAT TCTGAGCAAA GCGCCCGTGA ACGGGCAGAA AAACTATCCA AGGTGCCCAC GGCGAATTGC CCGGAATTTT ATGAATACCG CATGGCCCGG ATTTTATGCA ATATCCGTGC AGACGGAGAA CCTCTTTCTG CAGAGATACG CAGAGAATTG ATGAATCAGG CCCGACAGGA AGGCAAGTTG ACCAAAGCCT CTCTGGAGAA GGCTATTTCT TCCCGTCTGG GAAAGGAGAC AGAGACTAAT GTAAGCAACT ATTTTACTTT GCATCCTGAC AGCGAAGAGG CTCTTTACCT GAACCCTGCC GTGGAAGTTC TGCAAAGAAG CGGCATCGGG CAAATTCTTT CGCCGTCTGT GTATCGAATT GCCGCCAATC GCCTGCGTCG CGGGAAGTCC GTTACTCCAA ACTATTTGTT GAATTTGCTT AAGTCTCGTG GGGAATCTGG CGAGGCGTTG GAAAAGAAAA TAGAGAAAGA ATCTAAAAAG AAAGAGGCGG ATTATGCCGA CACTCCGTTA AAACCCAAAT ATGCGACGGG GCGTGCGCCG TATGCCCGCA CCGTCTTGAA AAAAGTGGTG GAAGAAATTC TTGATGGAGA AGATCCGACG CGTCCTGCCC GGGGGGAAGC GCATCCGGAT GGGGAACTGA AAGCGCATGA CGGTTGCCTG TATTGCCTCC TTGATACGGA TTCTTCCGTG AATCAGCACC AGAAAGAGCG CCGTCTTGAT ACGATGACCA ACAACCACCT TGTGCGTCAC CGTATGTTGA TTCTGGATCG CTTGCTGAAG GATCTGATTC AAGATTTCGC TGACGGGCAA AAAGACAGAA TCTCCCGCGT TTGCGTGGAA GTTGGCAAGG AGCTGACGAC GTTTTCCGCC ATGGACAGCA AAAAAATTCA GAGAGAACTA ACTCTGCGCC AGAAAAGCCA TACGGATGCC GTCAATAGAT TAAAACGGAA GTTGCCGGGG AAAGCGCTTT CTGCCAACCT GATACGCAAG TGCCGCATTG CCATGGACAT GAACTGGACA TGCCCGTTCA CCGGCGCAAC GTATGGCGAT CATGAGCTGG AAAATCTGGA GCTGGAACAT ATCGTGCCCC ATTCTTTCCG GCAGTCTAAC GCGCTTTCTT CTCTGGTTCT TACCTGGCCG GGAGTCAATA GGATGAAAGG TCAGCGCACC GGGTACGACT TTGTGGAGCA GGAGCAGGAG AATCCTGTGC CGGATAAACC CAACCTGCAT ATTTGTTCCC TGAATAATTA CAGGGAATTG GTTGAAAAGT TGGATGACAA GAAGGGGCAT GAAGATGACC GCAGGCGCAA AAAGAAGCGC AAAGCCTTAC TGATGGTGAG GGGATTGTCT CATAAACATC AATCACAAAA TCACGAGGCC ATGAAGGAAA TAGGCATGAC GGAAGGCATG ATGACGCAGA GTTCCCACCT GATGAAACTG GCATGCAAGT CTATTAAAAC CTCTCTGCCG GATGCGCACA TCGACATGAT TCCCGGCGCT GTTACTGCTG AAGTTCGCAA GGCGTGGGAT GTTTTTGGGG TCTTTAAGGA ATTATGCCCG GAAGCTGCCG ACCCGGACTC CGGCAAGATT CTTAAGGAAA ACCTGCGTTC TCTCACTCAT TTGCATCATG CCTTGGATGC CTGTGTGCTG GGGCTTATTC CCTATATCAT ACCCGCTCAT CATAATGGTT TGCTGAGACG TGTTCTTGCC ATGCGCCGAA TTCCGGAAAA ACTGATACCT CAAGTCAGGC CTGTTGCGAA TCAGCGTCAT TATGTCCTGA ATGATGATGG ACGCATGATG TTGCGTGATC TTTCCGCCTC TCTTAAAGAA AATATTCGTG AACAATTGAT GGAGCAGAGG GTCATTCAGC ATGTCCCTGC AGACATGGGC GGCGCTTTAC TCAAGGAAAC CATGCAGAGA GTGCTTTCTG TTGATGGAAG CGGGGAGGAT GCCATGGTTT CTCTTTCCAA AAAGAAAGAT GGGAAGAAGG AAAAAAATCA GGTAAAAGCA AGCAAATTGG TCGGAGTGTT TCCGGAAGGC CCGTCAAAAT TGAAGGCTCT TAAGGCAGCC ATAGAAATTG ATGGCAATTA TGGAGTGGCG TTAGATCCCA AGCCGGTGGT GATCAGACAT ATTAAGGTGT TTAAGCGAAT CATGGCCCTG AAAGAACAGA ACGGCGGCAA GCCGGTGCGC ATTTTGAAAA AAGGCATGTT GATTCATTTA ACCTCGTCTA AAGATCCCAA GCATGCAGGT GTATGGAGAA TTGAATCCAT ACAGGATTCA AAAGGTGGCG TAAAATTAGA TCTTCAGAGA GCGCATTGCG CTGTACCTAA AAATAAGACG CATGAATGTA ATTGGCGTGA AGTAGATCTC ATTTCTTTAT TAAAAAAATA CCAGATGAAA AGATACCCTA CTTCTTATAC GGGAACTCCA CGATAA
|
Protein sequence | MSRSLTFSFD IGYASIGWAV IASASHDDAD PSVCGCGTVL FPKDDCQAFK RREYRRLRRN IRSRRVRIER IGRLLVQAQI ITPEMKETSG HPAPFYLASE ALKGHRTLAP IELWHVLRWY AHNRGYDNNA SWSNSLSEDG GNGEDTERVK HAQDLMDKHG TATMAETICR ELKLEEGKAD APMEVSTPAY KNLNTAFPRL IVEKEVRRIL ELSAPLIPGL TAEIIELIAQ HHPLTTEQRG VLLQHGIKLA RRYRGSLLFG QLIPRFDNRI ISRCPVTWAQ VYEAELKKGN SEQSARERAE KLSKVPTANC PEFYEYRMAR ILCNIRADGE PLSAEIRREL MNQARQEGKL TKASLEKAIS SRLGKETETN VSNYFTLHPD SEEALYLNPA VEVLQRSGIG QILSPSVYRI AANRLRRGKS VTPNYLLNLL KSRGESGEAL EKKIEKESKK KEADYADTPL KPKYATGRAP YARTVLKKVV EEILDGEDPT RPARGEAHPD GELKAHDGCL YCLLDTDSSV NQHQKERRLD TMTNNHLVRH RMLILDRLLK DLIQDFADGQ KDRISRVCVE VGKELTTFSA MDSKKIQREL TLRQKSHTDA VNRLKRKLPG KALSANLIRK CRIAMDMNWT CPFTGATYGD HELENLELEH IVPHSFRQSN ALSSLVLTWP GVNRMKGQRT GYDFVEQEQE NPVPDKPNLH ICSLNNYREL VEKLDDKKGH EDDRRRKKKR KALLMVRGLS HKHQSQNHEA MKEIGMTEGM MTQSSHLMKL ACKSIKTSLP DAHIDMIPGA VTAEVRKAWD VFGVFKELCP EAADPDSGKI LKENLRSLTH LHHALDACVL GLIPYIIPAH HNGLLRRVLA MRRIPEKLIP QVRPVANQRH YVLNDDGRMM LRDLSASLKE NIREQLMEQR VIQHVPADMG GALLKETMQR VLSVDGSGED AMVSLSKKKD GKKEKNQVKA SKLVGVFPEG PSKLKALKAA IEIDGNYGVA LDPKPVVIRH IKVFKRIMAL KEQNGGKPVR ILKKGMLIHL TSSKDPKHAG VWRIESIQDS KGGVKLDLQR AHCAVPKNKT HECNWREVDL ISLLKKYQMK RYPTSYTGTP R
|
| |