Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2066 |
Symbol | |
ID | 6275440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2509841 |
End bp | 2513422 |
Gene Length | 3582 bp |
Protein Length | 1193 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642614128 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_001878657 |
Protein GI | 187736545 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.0534414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGATT CATCCATCCC GGACATGATG GCGGAGGCTC GCCGAGGCCA ATGGACTGAC CAGCAACTGG CCGCCAAGGC TGTGGAACTG GCGGAGTCCA TTTTAAAGCA GTCGAATGCC GGGATGAGAG GGAAAGAGAA GAGGCAGGCG CAGCAAATGG AGCGCATGAT GAATGATCCG GCGGGCAAGG CGTTTACGCT GGCACTGGCG GATCGTGTGT TCCGTCCTTC TTCTCCGGTG CGGGGTGCTG AGTTGTTCCG CTATCTTTTG GACGGATATG GCGTTCCCCG TTATTTGTCT GCGGCGGACC GTTTCGCCAT AAAAATGGGG GGGAGGTTTT CCGCTCAATT TCCGGGTGTG GTCATGCCCG TGATTACCAG CCAGTTGAGA AAGGAGAGCT CCAATGTAAT TCTTCCTGCC GAGGATGGAA AACTGCGCCC CCACCTGCGC CGCAGGCGCA AGGGCGGCAT CCGGATGAAT ATCAACCAGC TGGGAGAGGC CATTCTGGGT GAAAGCGAGG CGCACCACCG TCTTCAGCAG GTCGTGGACC GGCTGACAGA CAAGGACTGT GACTATATTT CCGTCAAGAT TTCAGCCATT TTCAGCCAGA TTCATCTGGT AGCCTTTGAA GAAACCGTCA AATTGATTCA GGAGCGCCTG CGCATCCTGT ACCGGGCGGC CATTACCAAT GCGGTAACGT TGCCGGACGG TTCCAGAAAG CCCAAGTTCG TGAATCTGGA TATGGAGGAA TACCGTGATC TTCATCTGAC GGCGGAAGCG TTCAAACGTA CGTTGATGGA GGATGAATTC ATGCAGCTGG AGGCGGGGAT TGTGCTCCAG GCCTATTTGC CGGATTCCTG GGAGGAGCAG ATGAAGCTGT GCGCCTGGGC GAAGGAACGC GTGGAGCAGG GGGGGGCGCG CATCAAAATA CGCCTGGTGA AAGGCGCCAA CCTGGCGATG GAGAAGGTGG AGGCCTCCAT GCATGGCTGG GCGCAGGCCC CGTATGGCAC GAAAGCCCAG GTGGATGCCA ATTACAAGAG AATGCTCCAT TACGGCTGCA TGCCGGATAA TGCCAGATAT GTGCAGTTCG GCGTAGCTTC CCACAACCTG TTTGATTTGT GTTATGCCAT GCTGCTGCGT GAGCGGGAAG GCGTGCGTGA CCAGGTGGAA TTCGAAATGC TGGAAGGGAT GGCGAACCAT CAGGCGCGGG TCATCCGCCA GGCGGCGGAC GGTCTGCTCC TTTATGCCCC TGTGGTTTTG AAGGAAGATT TTCACAGCGC TATTGCCTAC CTGGTCCGGA GACTGGATGA AAATACCAGT GAGGAGAACT TCCTGCATGA TCTCTTCGGT ATGACACCGG GGTCCCGGAG CTGGGAGGTC CAGAAAAAGA GGTTTTTGAA GGCTTGTCAG GAAAAGGATG AGGTGAAGTA CGGCCCCAAC CGCACGCAGA ACCGCGCTGC GGATCCCATC CAGCCCTCAC ATTACCGGGA CGCCTTTGCC AACGAGCGGG ATACGGACTG GTCCCTGCGG CAGAATGCGG AATGGATCAA CGGGCTGATT GCCGCGGAGA AGGAAAAATC CGGCGAGGAA ATCCCTCTGG TTATAGATGG TGAGGAAATT ACGACTAATC TATGGGGCGT GGGGCGCGAT CCGTCCCGCC ATAATGAAGT TTCCTATAAA TTCGCTTATG CGGATTTTGA CCAGGTTGAA CATGCCTTGG TCACGGCAGA CAGGGTCCGC TCCTCCTGGG CGTCCAAAAG CATTGGCGAA CGCGCTGAAA TCCTGCACCG GGCGGCGCAG GAGCTGTCCA GAATCAGGGG GGAAGCCATT GCTGCCATGG TGAGGGATGC CGGGAAAGCT CCCACGGAGG CTGATGTGGA GGTGAGCGAG GCCATAGACT TCTGCCGCTA TTACGCGGAA GGCTTGGACC GCGACGGAAT GAACGACGGC GTGGAAATGT CCCCGCTGGG TATCATTTGC GTGATGTCTC CCTGGAATTT CCCCTTCGCC ATTCCGACGG GCGGTGTAGC TGCCGCCCTG ATGGCGGGAA ATGCCGTGGT GTTCAAACCG TCCGAACTGG CCGTTTATAC GGCTTGGCAG ATTGTCCAGG CGTTCTGGCG TGCCGGTGTG CCTAAAAACG TCCTTCAATT CGTGCCGATG CCGCGCAATG AAATTTCCTG CAAGTTTCTG ATGGATCCCC GTTTGAACGG TGTGATCATG ACGGGATCCT ACCGCACCGG AAAAATGCTG CGCGAACTGC GGCCTGACCT GCATGTGCTG GCTGAAACCA GCGGAAAGGA TGCCATGATC ATCACTGCTA CGGCTGATCC GGACCAGGCT GTAAAGGATT TGGTGAAAAG CGCTTTCGGT CATTCCGGAC AGAAGTGTTC CGCCGCCAGC GTGGCTATTG TGGAGGCTTC CGTTTATGAC AATCCCGCCT TTTTGCGTCA GTTGAAGGAT GCCGCCGCCA GCCTGAAGGT AGGCGGATCC TGGGAAGTCA ACTCCGTGGT GACGCCGCTC ATCAGGGAGC CGGAAGGGAA TCTGCTCCGT GCGCTGACGC AGCTGGAACC CGGGGAGGAA TGGCTGCTCA AGCCGGAACC TTCGGAAGAC AACCCGTGCC TCTGGTCTCC CGGCATCCGG CTGGGGGTGA AACCGGGAAG CTGGTTCCAT CAGACGGAAT GCTTCGGTCC GGTATTGGGA ATCATCCGTG CGGAAAACCT GGAGGAAGCC ATTGACATCC AAAACGACTC CGAATTTGGC CTTACCGGCG GCCTTCAGTC CCTGGATGAA CGGGAAATTG CCTTGTGGAA AACTAAAGTG CAGGTGGGCA ACGCGTACAT CAACCGTGTC ATCACCGGCG CCATTGTCCG CCGCCAGCCG TTCGGCGGGT GGAACCATTC CTCCATGGGG CCTGGAGCCA AGGCCGGAGG TCCCAACTAC CTTACCATGC TGGGAAGTTG GGAGGAAAAG GCGCTGCCCC AAAAGCTGCG CACGCCGGGT GAACGTATCT CCGGACTGGT GGAAAAACTG TGTTCCGAGC TGCCGGACTG CGCCAAGCGC ATCCGTTCCG CAGCTGGTTC CCAGGCCAAG TGGTGGATGG AGGAATTCGG CGTGGATCAT GATCCCTCCC GTGTTTACGG AGAAAATAAT ACCTTCCGCT ATATCCCGGT GAAGGGAATT CTGGCCCGTG TGGAAAACAT GTCCGACGAT GACGTCGCCA TTCTGCTGCT GGGGGCGAAA CTGTGCGGGG TACTCCTGCA CCTGAGCATA GGGACGAGCC GCCCCTGGAT TCAGAAAATG CATGGTTATT ATGCTTCCCT GACGGTGGAA ACGGAAGCGG AATTGATCGG ACGGATGCCG GAAGCCCTTC CAGGCATACG GTTCCTGCGT GGAACAGATA TTTCCGAGAC TCTGGCCAAT GCCGCCCGTG CCCGGGATGT GGAAGTACTG GACCGTCCCG TGCTTGCCAA TGGACGGTTG GAACTGCTGG GGTATTTCCG GGAACAGTCT GTTTCCGAAA CCGTTCACCG CTATGGCAAT CTCATCCCGC CACCAGGCAG TTTTAAAACA GACAGCGTGT GA
|
Protein sequence | MTDSSIPDMM AEARRGQWTD QQLAAKAVEL AESILKQSNA GMRGKEKRQA QQMERMMNDP AGKAFTLALA DRVFRPSSPV RGAELFRYLL DGYGVPRYLS AADRFAIKMG GRFSAQFPGV VMPVITSQLR KESSNVILPA EDGKLRPHLR RRRKGGIRMN INQLGEAILG ESEAHHRLQQ VVDRLTDKDC DYISVKISAI FSQIHLVAFE ETVKLIQERL RILYRAAITN AVTLPDGSRK PKFVNLDMEE YRDLHLTAEA FKRTLMEDEF MQLEAGIVLQ AYLPDSWEEQ MKLCAWAKER VEQGGARIKI RLVKGANLAM EKVEASMHGW AQAPYGTKAQ VDANYKRMLH YGCMPDNARY VQFGVASHNL FDLCYAMLLR EREGVRDQVE FEMLEGMANH QARVIRQAAD GLLLYAPVVL KEDFHSAIAY LVRRLDENTS EENFLHDLFG MTPGSRSWEV QKKRFLKACQ EKDEVKYGPN RTQNRAADPI QPSHYRDAFA NERDTDWSLR QNAEWINGLI AAEKEKSGEE IPLVIDGEEI TTNLWGVGRD PSRHNEVSYK FAYADFDQVE HALVTADRVR SSWASKSIGE RAEILHRAAQ ELSRIRGEAI AAMVRDAGKA PTEADVEVSE AIDFCRYYAE GLDRDGMNDG VEMSPLGIIC VMSPWNFPFA IPTGGVAAAL MAGNAVVFKP SELAVYTAWQ IVQAFWRAGV PKNVLQFVPM PRNEISCKFL MDPRLNGVIM TGSYRTGKML RELRPDLHVL AETSGKDAMI ITATADPDQA VKDLVKSAFG HSGQKCSAAS VAIVEASVYD NPAFLRQLKD AAASLKVGGS WEVNSVVTPL IREPEGNLLR ALTQLEPGEE WLLKPEPSED NPCLWSPGIR LGVKPGSWFH QTECFGPVLG IIRAENLEEA IDIQNDSEFG LTGGLQSLDE REIALWKTKV QVGNAYINRV ITGAIVRRQP FGGWNHSSMG PGAKAGGPNY LTMLGSWEEK ALPQKLRTPG ERISGLVEKL CSELPDCAKR IRSAAGSQAK WWMEEFGVDH DPSRVYGENN TFRYIPVKGI LARVENMSDD DVAILLLGAK LCGVLLHLSI GTSRPWIQKM HGYYASLTVE TEAELIGRMP EALPGIRFLR GTDISETLAN AARARDVEVL DRPVLANGRL ELLGYFREQS VSETVHRYGN LIPPPGSFKT DSV
|
| |