Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0396 |
Symbol | |
ID | 6274807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 477908 |
End bp | 479323 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642612447 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_001877016 |
Protein GI | 187734904 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTCC CGTTCCAAAC CGCTAATATC CCTCCCATGA AACACCCTCG CACCCGTGTG GCCGTCATTG ACGTGGTGGC CCTTTCCCGC CAGATGATGG AACACATGCC GCGGCTCTCC GCCTGGGCGG AGGGGCGGAG CGTTTCCTCC TTCCCCCCGG CCTTTCCGGC CCTCACCTGC TCTGCCCAGA GCACCTACGT GACAGGGCTT TCCCCGCGGG AGCACGCCAT TCCCGGCAAC GGATGGTACA ACCGGAATAT GTGTGAAATC CAATTCTGGA AGCAGTCCAA CAAGCTGGTG CAGGGCCCGC GCCTCTGGGA GAAACTGAGG GAACGGTACG GTTCCGGCTT CACCTGCGCC AAACTTTTCT GGTGGTACAA CATGTATTCC ACGGCGGACT GGACCATCAC GCCGCGCCCC ATGTACCCGG CAGACGGCCG CAAGATCTTC GACATTTACA CCCAACCCAT GGAACTCCGG GAAACCATTA AAAAGGATCT GGGAGAATTC CCCTTCCCCA CCTTCTGGGG CCCCATGGCA GGGATTCAGT CCTCCCAATG GATAGCAGAC TCCGCCCGGT GGGTGGAACG GAAACATCGC CCTGACCTCA GCCTCATCTA TCTGCCCTAT CTGGACTATG ACCTTCAGAA ATTCGGACCG TCCTCGACCC AGGCTGCCCA CGCGGCAGAG GCTATGGACG GTCTTCTCTG CGACTTGATC GACTTTCTGG AACGGGAAGG CGTCACCCCC GTCGTCCTCA GTGAATACGG TATTTCCGAC GTATCCCGCA GCATTGCCCT CAACCGCCTC TTCCGGGAAC GGGGCTGGAT TACCGTCAAA CCGGAAATGG GTACGGAAAT GCTGGACTGC GGCGCCTCCC GCGCCTTTGC CGTGGCGGAT CACCAGACTG CCCATATCTA CATCAATGAT CCTTCCGTAA AAGAAGAAGT GAAAGCACTG CTCTCCGCCA CACCCGGAGT GGAAGAAATC AGGGAAACGG ACTTCTCCGG CCTTTCTTCC GCGGCTCTGG AACGCCTGCC GGAATTCACC GCCGTCGCAG CCCCGGATGC ATGGTTCACC TACTATTACT GGCTGGATGA CACCAAGGCG CCGGACTTCG CCCGCTGCGT GGACATCCAC CGCAAACCCG GCTATGACCC CGCGGAAATG TTCTTTGATC CGGGCCTTAC CCTCCCCATG TTCCATGCCG CCGCCTTTCT GCTGAAAAAA AAGCTGGGGT TCCGCGCCCT GATGAAAGTT ATCCCCCTCA ATGGCGACCA GGTGAAAGGC TCCCATGGCA GAGACCGGGT GCCTGCAAAC CAGCAGCCCG TATTCATCGG CCCGGCCTTC CTGCCGGAAA TCCATGCTGC TGAGGATGTC CATCAAGCCA TCCTCTCCGT CTTTGAAAAA GAATAA
|
Protein sequence | MEFPFQTANI PPMKHPRTRV AVIDVVALSR QMMEHMPRLS AWAEGRSVSS FPPAFPALTC SAQSTYVTGL SPREHAIPGN GWYNRNMCEI QFWKQSNKLV QGPRLWEKLR ERYGSGFTCA KLFWWYNMYS TADWTITPRP MYPADGRKIF DIYTQPMELR ETIKKDLGEF PFPTFWGPMA GIQSSQWIAD SARWVERKHR PDLSLIYLPY LDYDLQKFGP SSTQAAHAAE AMDGLLCDLI DFLEREGVTP VVLSEYGISD VSRSIALNRL FRERGWITVK PEMGTEMLDC GASRAFAVAD HQTAHIYIND PSVKEEVKAL LSATPGVEEI RETDFSGLSS AALERLPEFT AVAAPDAWFT YYYWLDDTKA PDFARCVDIH RKPGYDPAEM FFDPGLTLPM FHAAAFLLKK KLGFRALMKV IPLNGDQVKG SHGRDRVPAN QQPVFIGPAF LPEIHAAEDV HQAILSVFEK E
|
| |