Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2045 |
Symbol | |
ID | 6274743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2487489 |
End bp | 2490545 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642614106 |
Product | hypothetical protein |
Protein accession | YP_001878636 |
Protein GI | 187736524 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.585641 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.431873 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTTT TCTTAACTCT TTTTCAAATA TCCTTTTCAT TTTTCCTTGA CGCTCTGATG AAAATCCGGC TCCCTCTCCC CCTCCTGGCT GCCCTGGCGG CGGCATTTCC CGCAGCAACC GCCGCCCCGG CGGGCACGGA ACTGGGAAAC GTCATGTACG TAGGGGACTC CATCACCCAC GGGGTCAACA GCGCCTCCTA CCGCTGGGCC CTCCATAAAA TATTTACAGA CAACGGTATT TCCTACCAGG CGGAAGGAGT CAAAACCGGC AATTATTCCG GGGGCGTTAC TGCGGGGACC TCCTACGGGG GCCAGATATT CAACAATGAA CATTCCTCCC AGGCAAGCGC CAGGGCCTGG GAAATATCCG GCAGGAACAG CGGAAGCAGG TTTGACGGCT CCAACATTTA CAACTGGCTG GGGCTTTCCG GCACCAAGGC AAACGGAAAC GCCTATTCAG GCCAGACATT CACCGGGGAC AATACGCCGG ATACCTTTTT CATGATGATC GGCACCAACG ATCTCCTTTC CGACGGAAAC AACGCCACGC TGGCGGACCG CCTTGACTCC GTTTCCCAAA ACCTCCTGAA CGATATGGAC ACCATCGTGG AGGCCATGTT CCAGGCCAAT GAACATGCAA ATGTCATCAT CCTTACCATT CCCTGCTGGA CCCGGCATTC CAACGGCAAT TCGGACGCCA CCCACCAGGC CGTGGCTGAT TACAATGATC GTCTGAAAAC ATGGGGGAAT GGCAGGCAGA ACGTCACGGT CATTGACATC AATGCCGGCC TCATTGACGT AACATCCTCC ACGCCGTTCT ACGGCGTGAG CTCCATGTTC AACAACCCCG GCAGCGACGG CCTCCATCCC AATGCCCAGG GAGATCTGCT TATGGCCGGC AATATCGCCA GGGCCATGGG CTATGCGGGG CGCACCGCCG GACAGGAAAG AAAAGACGCT GGCGGCCTGG CCATCAATTT TCACCAGGGC GGGCAATCCC CCGCATGGAG CTCGAACAGC CAGTTGGAAA ATAAAGGGTT TTCCCTCGCC AACGTTACTG TGGACGAGAA CGGCATCAGC CTGGGGCAAT CCGGGGAAAG CTCCATTTCC TATTCCTGGG CGGAAGGGAC GGAGCTGCAA AACGGCTTCA CCTTTGATTG CAACCTGGTT CTGGGAAACG GCGCAGCAGA CGGATGGGAT ACTGCCTCCG ATTTCAGCAT CAGCCTGGGG AACAGCTCCT TTTACGGCAC GCTCAATATT AATGAGGCCT ACATCAAATG GGGCGATGCC ATTCTGTATT CCCTGGACAT GTCCGCCAAT GCGGAAAACC TGCGTATGGC ATATATCTGC GGCAATGATA TGGAAGGACT GAAAGGCGGT TATTACGTCT GGCTGGGAGA CATGCTGATT GGGGAAGCGC TTTCCGTTAC CTCCGGCTCC GGCTGCAACG GCCTCACCAT TCAATACAAC GGCAGCGGAA CCGCTGTTCT GCACGACCTG GCCCTGGACG GCACGGGTTC CTATGCCCCT GCCACTTCCG GAATGCTTAA CGCGGAAAAG TCTTTTATTT CTTCCGGCTC CTTCACCCAG GGTGGAACAC CGGAAGGCAA TATTGAGTGG AAAACGGAGG GATTCACCAA AACCGCTGAT AATCTGGCCA GCTCCGGTAC CTTTAATGCC CGCTCCCAGG CGGATTCCTC CGCAGGGGGA ACGGGGAATT CCGTAAACGT CTCCATTACC TCCGGAAACG CCACTCATAT TTATGCGAAC AGCGGCAACT ACACGGGAGA CGTCTGGCTG ACCATCTCCA AGGAGGGACA GGCCTCCGCC TGGTACGGAG CTCACGGAGC TTCCGGCACG CTCAACGGGA ACGCCTTCCT CCGTTTCACG GATGCCGCCA CAGGCGGAAG CACGGTCTTC GGAGCCGTGA ACGCCGCCGG CGTCACCGGA AACGTTTATC TGGAATTCTC TGCGGAAAAC GCCTCCTTCG GCACCTTTAC CAGCAGTAAT TCCTCCTCTG TGGTCGGCTC CTATGCCACG GATATCCAGG GGAATGTGGA CATTGTGGTC AACTCCGGCA CCTTCAACCA TCAAATCATG GGCGGCATCT TCGCAAATGC CAGGACCGGA ACCACCACCA TTGGAGGAAA CACGCATGTT TACATCAACG GCGGCTCCGT CACCGGAAAT GTTATGGGCG GCGGCCTGAC AGGCTCCATT TCCGGCGGCG CCAACGTCAC CGTCACAGGC GGCGTTATCA GCGGTTCCGT TTACGGCGCG GGCCAGGGCG GCTCCATCCT GGCAGGAAGC TCGGTTTGCT TGACAGGAGG CCTGGTTAAA GGGGACGTCT ACGCCGGAGG CAAGGCGGGT TCCATTCAGG GGGACACGTC CGTCACCATC ACCGGAAACA CGGCCACGCT TTATAACGGG AGCTCCTGGG GCAGCATCTC CGGCGGCGGT TCCGGCGGAA CGGTGGAGGG AAACTCCACC GTACGCATCC AGAATCTTTC CTCAGGAACG ACGGCCTACG GCTTTGACAA ATACGCCGGA AACATCAGCG GAGGAACCAA CGTAAGCGGA GACAGGAGCC TGGTGCTGGA CCATGTAACC GTGGACAGCC TCCTGGCTTC CCTGAGTGAC TTCACCCATA TCTCCGCCGT CAACCAGACC CGCACCTCGC TGGATTCCCT GGGCGAGGCG CTCACCGTAA CCATTGAGGC GGGAAGCAGC CTTATCCTGA ACGGAACCTC TGACCTGACC ACGCTTATTC TGGGGGAACA CGCCTCCCTG ACGCTTCAGG GGCTCACCGC AGATGCGGTC ATCGTGGACA TTACGGGAAC AACCAATTAC ACACTCTCCC TGACAGAGAT ACCTGCCAGC CTGGACAATA TCAAATTCCT GAACGACGGC GTTCTGTACG ACGCGGCCAT GTCCATGGAT CTTCAGGCCA ATTCCGCCAT GCTTTTCGCC CAGGTGCCGG AACCCGGAAG CGCCGCCCTG GCCCTGGCAG GACTGGCCCC CCTCCTGTGG AGGAGGCGCA GAAAAATGTC CCATTGA
|
Protein sequence | MDVFLTLFQI SFSFFLDALM KIRLPLPLLA ALAAAFPAAT AAPAGTELGN VMYVGDSITH GVNSASYRWA LHKIFTDNGI SYQAEGVKTG NYSGGVTAGT SYGGQIFNNE HSSQASARAW EISGRNSGSR FDGSNIYNWL GLSGTKANGN AYSGQTFTGD NTPDTFFMMI GTNDLLSDGN NATLADRLDS VSQNLLNDMD TIVEAMFQAN EHANVIILTI PCWTRHSNGN SDATHQAVAD YNDRLKTWGN GRQNVTVIDI NAGLIDVTSS TPFYGVSSMF NNPGSDGLHP NAQGDLLMAG NIARAMGYAG RTAGQERKDA GGLAINFHQG GQSPAWSSNS QLENKGFSLA NVTVDENGIS LGQSGESSIS YSWAEGTELQ NGFTFDCNLV LGNGAADGWD TASDFSISLG NSSFYGTLNI NEAYIKWGDA ILYSLDMSAN AENLRMAYIC GNDMEGLKGG YYVWLGDMLI GEALSVTSGS GCNGLTIQYN GSGTAVLHDL ALDGTGSYAP ATSGMLNAEK SFISSGSFTQ GGTPEGNIEW KTEGFTKTAD NLASSGTFNA RSQADSSAGG TGNSVNVSIT SGNATHIYAN SGNYTGDVWL TISKEGQASA WYGAHGASGT LNGNAFLRFT DAATGGSTVF GAVNAAGVTG NVYLEFSAEN ASFGTFTSSN SSSVVGSYAT DIQGNVDIVV NSGTFNHQIM GGIFANARTG TTTIGGNTHV YINGGSVTGN VMGGGLTGSI SGGANVTVTG GVISGSVYGA GQGGSILAGS SVCLTGGLVK GDVYAGGKAG SIQGDTSVTI TGNTATLYNG SSWGSISGGG SGGTVEGNST VRIQNLSSGT TAYGFDKYAG NISGGTNVSG DRSLVLDHVT VDSLLASLSD FTHISAVNQT RTSLDSLGEA LTVTIEAGSS LILNGTSDLT TLILGEHASL TLQGLTADAV IVDITGTTNY TLSLTEIPAS LDNIKFLNDG VLYDAAMSMD LQANSAMLFA QVPEPGSAAL ALAGLAPLLW RRRRKMSH
|
| |