Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1791 |
Symbol | |
ID | 6274774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2179025 |
End bp | 2179789 |
Gene Length | 765 bp |
Protein Length | 254 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642613854 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001878390 |
Protein GI | 187736278 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.210194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.00588417 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCCCGG CTCTCCGGAA TCCCTGTTTT TCCGCTATTC TCTGCGGCAC AGCGCTGCTG GCCCCGCCAG CGCCGGCTCA GTCCGCTGAT GGAGCCGGTC TTCTGGCTTC CACCCCGCAA TCCGTAGAAG ATTTGCAGCG GATTGAACGC CAGCTTCAGC AAATGCTTCC CCGGGTGCTT CCCGCCCTGG TCTGCATTGA ATTAAACAAC GGCAGCGGCT CCGGCATCCT GGTTTCGGAA AAAGGCCTGG TCTTTTCAGC GGCCCACGTT GTGGACAAGA AAGGAACCAC GCTCAAAATC ATCCTGCCGG ATGGAACGCG CCTTCCGGGA AAAACCACGG CGCAAAACAG CAATTCGGAC GCAGGCATGG CCAAAATCAC ATCCCAATTG AACAAAAAAC TGCCCTGCGT GGAAAAAGCG GAAAAAATGC CTCGTGTGGG GGACTGGGTG TTCGCGCTGG GGCACGGCGG GGGGCTGGAC CGGAAACGCG GCCCGATGGT GCGCCTGGGG AGGGTGGTTT CCCTCAAGAA CGGCGTCATT CAGACGGACT GCAAGTTGAT TCGCGGGGAT TCCGGCGGCC CTCTTTTCAA CCTGGATGGA AAGCTGATTG GCATTCACAG CAGGGTCGGT TCCGGTCTGG AAGACAATCT GCACGTGCCC ATGAAAGATT TCGACGCTCT GACGGAGGAG ACAGCGGAAG GAAAGACCTC CCTGACGCCG CCACCGGAAC AGGACAGCCA GCCATTCTCC ACTCAGCCAT CATGA
|
Protein sequence | MTPALRNPCF SAILCGTALL APPAPAQSAD GAGLLASTPQ SVEDLQRIER QLQQMLPRVL PALVCIELNN GSGSGILVSE KGLVFSAAHV VDKKGTTLKI ILPDGTRLPG KTTAQNSNSD AGMAKITSQL NKKLPCVEKA EKMPRVGDWV FALGHGGGLD RKRGPMVRLG RVVSLKNGVI QTDCKLIRGD SGGPLFNLDG KLIGIHSRVG SGLEDNLHVP MKDFDALTEE TAEGKTSLTP PPEQDSQPFS TQPS
|
| |