Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1844 |
Symbol | |
ID | 6274599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2240665 |
End bp | 2241624 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642613907 |
Product | PDZ/DHR/GLGF domain protein |
Protein accession | YP_001878442 |
Protein GI | 187736330 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.880462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATGA AAACCTGTAT CTTTCTGACA TTGGGATTGC TGGTAGGTTC GGGCTCCGGA TATTCCATTG ACCGGCCCGC AGGAAGTACG GACAACCTTC AGCCGCCTCC TCTGACGCCG TACGCGCATC AGGTGAAGCC GCTCCCCTCT TCCAAACCGG CCAGACTGGG AATTGTTCCG GGCACGGTGC CCCAAGCGCT CGTTGCGCAG CTGGAGCTTA GTGGATTCCC CGGAGTGCTG GTGACCAAAG TGATGCCGGA CAGCCCCGCC GCCAAGGCCG GGCTCCAGGA AAATGACGTC ATGGTCAAGC TGGGGGATGT CTCCCTGTCC GGTCCGCAGT CTGTGACGGA AGCCCTGTCT GAAAAGGTGC CTGGAGACAG GATTACGGCT GTATTTTACC GGAAAGGAAA GAGGGAGACT GTTGAAATTA CCCTGGATGG GGGAACGCTT TCCGCTGAAG AAATACTGGC GGCCCAGGGG GATCCCCGCA CGCAGCCCCG CGCAGTTCCT TCCGTCCGGC GTCAGACGGC GCCTTTTTCC GGAATGGCTG CACGGCCTAA TCTCCCCCAG CGTATTCTGG ATATGCAGCA GATGATGGAT GAGTTTTTGA AGGATTCCGC CATGGATGAT TACCGGATGG ACGACATCAT CGGCCGGATG AACCTGACTC CCGGCGCGGC GCAAATGCTC CGGAGCTTGC AGGGACTTCA TCAAATGCCC ATGCCTCCCA TGGGCAAGGT TTCCGGAGGG GGCCAGAGCA TGTCTTCCGT CCGGATGTCG GATGCCAACG GGACTATCGT GGTTTCTTCT AATTCCCGGA CGGGAACCAC AGTTCATGTG ACGGACTCTG CGGGAAAGGT TCTGTATTCC GGCCCCTACA ATACGCAGGA GGAAAAAGCC GCCGTGCCGG AAGCCGTCAG GGAACGCTTG AAAAACATAG AAACCAATTT CTGCTTTTAA
|
Protein sequence | MDMKTCIFLT LGLLVGSGSG YSIDRPAGST DNLQPPPLTP YAHQVKPLPS SKPARLGIVP GTVPQALVAQ LELSGFPGVL VTKVMPDSPA AKAGLQENDV MVKLGDVSLS GPQSVTEALS EKVPGDRITA VFYRKGKRET VEITLDGGTL SAEEILAAQG DPRTQPRAVP SVRRQTAPFS GMAARPNLPQ RILDMQQMMD EFLKDSAMDD YRMDDIIGRM NLTPGAAQML RSLQGLHQMP MPPMGKVSGG GQSMSSVRMS DANGTIVVSS NSRTGTTVHV TDSAGKVLYS GPYNTQEEKA AVPEAVRERL KNIETNFCF
|
| |