Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0381 |
Symbol | |
ID | 6274861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 456376 |
End bp | 457710 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642612432 |
Product | protein of unknown function DUF21 |
Protein accession | YP_001877001 |
Protein GI | 187734889 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCTTT TTGTCACTTT CGTTATTCTT TTTCTGATTC TGGTCAACGC CTTTTATGTT GCAGCAGAAT TCGCCGCAGT GAGCGTACGC CGCAACATGA TCCGGGAAAT GGCGGAGAAC GGCAGCAGGG TGGCTGTTCA TCTGCTTAAG ATTTTGGAAA ATACAAAAGA GCTGGACCGC TACATTGCGG CTTGCCAGTT CGGCATTACC ATTTCCAGCC TGGTTCTGGG CGCATACGGA CAGGTTGAAC TGGCCGCCTA TCTGTTCCCT CTGTTTGAGC GGTTTGGTGG GATGGACTCC GTAATGGCCA ATTCTGCAGC CGCTCTTGTT GTACTGATAG GCTTGACGGT TTTTCAGGTA ATTCTTGGAG AATTGATGCC CAAATCCCTG GCTCTTCAGT TTCCCAAGCA GGCGGCCTTG TATACTTATT ATCCCATGCG ATGGACACTG GCTTTCTTCG CATGGTTCAT TGACTTTCTT AATGGAAGCG GTTTGTTGCT TCTTAAACTG TTCCGGCTCC CACCTGGCGG CCATCAGCAC ATCCACTCCC AGCAGGAGAT TAATATGCTG CTGGATGAAA GCCACCAGGG CGGGATGTTG GAGGAAGATG AGCATGAACG GCTGCACAGC GCTTTGTCCC TGGCGGAACG AACGGTGGAA CAAATTATGA TTCCCCGTTT TCAGCTTGTT TGCCTGGATG TGGATGCCAC ACAGGAAGAT ATTTTGAATA TGATCGCAGA TAGGCCTCAT ACCCATATTC CCGTGTATGA AGGGAACCGT GAATCTGTCA TTGGCATGCT GCATATTAAA GATATGGTAT CCGCTTATGC GGAAAAGGGA ATTCTGCCTC CTCTCCGTTC CATGCTGAGG CAGGTGCCAT GCGTGATGGA AATGCAGACG GTAGAGATGC TGATGGCCCG TTTGCGGGAA GACAGGGCCA AGGAAGCCTT TGTTCTGGAT GAATACGGTA AGTTTGTGGG GCTGGTGACG CTGGAACGTC TGCTGGGAGA AATGGTGGGA GATATAGATG AGGAATTCAT CCGTTCCGGG GAAAAGGTGG AAACCCTTCC GGATGGTTCC GTGCGCATCC CCGGCATGAT GCGTGCCCAT AAGGCGGAAT GCCTGGTACC CTTTTTGATG AATGGCGCCA CTACTGTGGG CGGCTGCGTG ATCAAGCACA TGTCCTGCAT TCCGAAGGAT GGGGACCGCC TGATTATTGC CGGACGCGTG CTTGTGGTGG AAAAGATGGA CCATAACCGT GTTTCCTCTA TCCTGCTGCT GCCTCCGGAG AGAAAGGAAA ATGAATTTGC CATGGAAAGC GGGGTGGATG CATGA
|
Protein sequence | MILFVTFVIL FLILVNAFYV AAEFAAVSVR RNMIREMAEN GSRVAVHLLK ILENTKELDR YIAACQFGIT ISSLVLGAYG QVELAAYLFP LFERFGGMDS VMANSAAALV VLIGLTVFQV ILGELMPKSL ALQFPKQAAL YTYYPMRWTL AFFAWFIDFL NGSGLLLLKL FRLPPGGHQH IHSQQEINML LDESHQGGML EEDEHERLHS ALSLAERTVE QIMIPRFQLV CLDVDATQED ILNMIADRPH THIPVYEGNR ESVIGMLHIK DMVSAYAEKG ILPPLRSMLR QVPCVMEMQT VEMLMARLRE DRAKEAFVLD EYGKFVGLVT LERLLGEMVG DIDEEFIRSG EKVETLPDGS VRIPGMMRAH KAECLVPFLM NGATTVGGCV IKHMSCIPKD GDRLIIAGRV LVVEKMDHNR VSSILLLPPE RKENEFAMES GVDA
|
| |