Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1987 |
Symbol | |
ID | 6274111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2413869 |
End bp | 2414870 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642614047 |
Product | squalene-hopene cyclase-like protein |
Protein accession | YP_001878579 |
Protein GI | 187736467 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.374107 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.15857 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATC CCCGCCTCCT GCTTACCGCT TTCTTGACCT GCTGCATGAC GGCCGCACCG CAAAACGCAG CGGCCCAATC CCCCATGAGA ACGTCTTCTT CCGTGCCTCC CCAGGTGGAA CTGATGTATG TGAAGGGACT ACGCTATCTT CAGAATGCCC AGAAAACGGA TGGAACCTAT GATGGAACTT ACGGGAGGGA GCCCGGCATC ATCGGCTTTT GCCTTATGTC CGTGCTGGCT CACGGAGACG ATCCGAACGC GGGGCCGTAC GCCACCATGG TGCGCCGCTG CGTAGATTAC ATCCTGTCCA AGCAGAACAA GGTATCCGGC TACATTGGGG ATTCCATGTA CAACCACGGC TTCGCCACTC TCGCCCTGGC GGAAGCCTAC GGCATGGTGC GGGACGACCG CATAGGCCCC GCCCTCCGCA AGGCCGTAGC TCTGACACTG ACCGCCCAGA AAAAAAACAA AACGGGAGGC TGGCGCTATT CCCCGGAATC CACGGACGCA GACAGCACGG TGACCGGCTG CCAGCTTGTC TCCCTGTTCG CGGCGCGCAA CGCGGGCATT CCCGTGCCGG ACGAGGCTTT TGAACGCGGC CTCAAGTACA TGGCTTCCTG CCGCGACAAG AAAGGCGCCT ACGGCTACAC CGGGCCGGCG GGTCCCCGCG TCACCCTCAC GGCCATCGGT TCCCTGACGC TGTCCCTGGC GCGCCTTAAA ACGGACCCGT CCTTCAAGGA TTCCCTGGCC TACCTGAAAA AGCACCTGAA TTACCGGGAT TCCACTTATC CGTTTTATTT TGAATATTAC ATGTCACAGG CCCTGTTCCA TGCGGACCAG GAAGTGTGGA AGGAATGGAA TTACAAAAAT ATGCGCTATC TGGGAGCCTC CCAGGCGCCC AACGGGTCCT GGCTTTCCGA TCATTCCGCG GCATATTCCA CTTCCGCCGC CCTGCTTTCC CTTGCGCTTA ATTACAGATT TCTACCCATC TATGAACAAT AG
|
Protein sequence | MKNPRLLLTA FLTCCMTAAP QNAAAQSPMR TSSSVPPQVE LMYVKGLRYL QNAQKTDGTY DGTYGREPGI IGFCLMSVLA HGDDPNAGPY ATMVRRCVDY ILSKQNKVSG YIGDSMYNHG FATLALAEAY GMVRDDRIGP ALRKAVALTL TAQKKNKTGG WRYSPESTDA DSTVTGCQLV SLFAARNAGI PVPDEAFERG LKYMASCRDK KGAYGYTGPA GPRVTLTAIG SLTLSLARLK TDPSFKDSLA YLKKHLNYRD STYPFYFEYY MSQALFHADQ EVWKEWNYKN MRYLGASQAP NGSWLSDHSA AYSTSAALLS LALNYRFLPI YEQ
|
| |