Gene Amuc_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2066 
Symbol 
ID6275440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2509841 
End bp2513422 
Gene Length3582 bp 
Protein Length1193 aa 
Translation table11 
GC content57% 
IMG OID642614128 
ProductAldehyde Dehydrogenase 
Protein accessionYP_001878657 
Protein GI187736545 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0534414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATT CATCCATCCC GGACATGATG GCGGAGGCTC GCCGAGGCCA ATGGACTGAC 
CAGCAACTGG CCGCCAAGGC TGTGGAACTG GCGGAGTCCA TTTTAAAGCA GTCGAATGCC
GGGATGAGAG GGAAAGAGAA GAGGCAGGCG CAGCAAATGG AGCGCATGAT GAATGATCCG
GCGGGCAAGG CGTTTACGCT GGCACTGGCG GATCGTGTGT TCCGTCCTTC TTCTCCGGTG
CGGGGTGCTG AGTTGTTCCG CTATCTTTTG GACGGATATG GCGTTCCCCG TTATTTGTCT
GCGGCGGACC GTTTCGCCAT AAAAATGGGG GGGAGGTTTT CCGCTCAATT TCCGGGTGTG
GTCATGCCCG TGATTACCAG CCAGTTGAGA AAGGAGAGCT CCAATGTAAT TCTTCCTGCC
GAGGATGGAA AACTGCGCCC CCACCTGCGC CGCAGGCGCA AGGGCGGCAT CCGGATGAAT
ATCAACCAGC TGGGAGAGGC CATTCTGGGT GAAAGCGAGG CGCACCACCG TCTTCAGCAG
GTCGTGGACC GGCTGACAGA CAAGGACTGT GACTATATTT CCGTCAAGAT TTCAGCCATT
TTCAGCCAGA TTCATCTGGT AGCCTTTGAA GAAACCGTCA AATTGATTCA GGAGCGCCTG
CGCATCCTGT ACCGGGCGGC CATTACCAAT GCGGTAACGT TGCCGGACGG TTCCAGAAAG
CCCAAGTTCG TGAATCTGGA TATGGAGGAA TACCGTGATC TTCATCTGAC GGCGGAAGCG
TTCAAACGTA CGTTGATGGA GGATGAATTC ATGCAGCTGG AGGCGGGGAT TGTGCTCCAG
GCCTATTTGC CGGATTCCTG GGAGGAGCAG ATGAAGCTGT GCGCCTGGGC GAAGGAACGC
GTGGAGCAGG GGGGGGCGCG CATCAAAATA CGCCTGGTGA AAGGCGCCAA CCTGGCGATG
GAGAAGGTGG AGGCCTCCAT GCATGGCTGG GCGCAGGCCC CGTATGGCAC GAAAGCCCAG
GTGGATGCCA ATTACAAGAG AATGCTCCAT TACGGCTGCA TGCCGGATAA TGCCAGATAT
GTGCAGTTCG GCGTAGCTTC CCACAACCTG TTTGATTTGT GTTATGCCAT GCTGCTGCGT
GAGCGGGAAG GCGTGCGTGA CCAGGTGGAA TTCGAAATGC TGGAAGGGAT GGCGAACCAT
CAGGCGCGGG TCATCCGCCA GGCGGCGGAC GGTCTGCTCC TTTATGCCCC TGTGGTTTTG
AAGGAAGATT TTCACAGCGC TATTGCCTAC CTGGTCCGGA GACTGGATGA AAATACCAGT
GAGGAGAACT TCCTGCATGA TCTCTTCGGT ATGACACCGG GGTCCCGGAG CTGGGAGGTC
CAGAAAAAGA GGTTTTTGAA GGCTTGTCAG GAAAAGGATG AGGTGAAGTA CGGCCCCAAC
CGCACGCAGA ACCGCGCTGC GGATCCCATC CAGCCCTCAC ATTACCGGGA CGCCTTTGCC
AACGAGCGGG ATACGGACTG GTCCCTGCGG CAGAATGCGG AATGGATCAA CGGGCTGATT
GCCGCGGAGA AGGAAAAATC CGGCGAGGAA ATCCCTCTGG TTATAGATGG TGAGGAAATT
ACGACTAATC TATGGGGCGT GGGGCGCGAT CCGTCCCGCC ATAATGAAGT TTCCTATAAA
TTCGCTTATG CGGATTTTGA CCAGGTTGAA CATGCCTTGG TCACGGCAGA CAGGGTCCGC
TCCTCCTGGG CGTCCAAAAG CATTGGCGAA CGCGCTGAAA TCCTGCACCG GGCGGCGCAG
GAGCTGTCCA GAATCAGGGG GGAAGCCATT GCTGCCATGG TGAGGGATGC CGGGAAAGCT
CCCACGGAGG CTGATGTGGA GGTGAGCGAG GCCATAGACT TCTGCCGCTA TTACGCGGAA
GGCTTGGACC GCGACGGAAT GAACGACGGC GTGGAAATGT CCCCGCTGGG TATCATTTGC
GTGATGTCTC CCTGGAATTT CCCCTTCGCC ATTCCGACGG GCGGTGTAGC TGCCGCCCTG
ATGGCGGGAA ATGCCGTGGT GTTCAAACCG TCCGAACTGG CCGTTTATAC GGCTTGGCAG
ATTGTCCAGG CGTTCTGGCG TGCCGGTGTG CCTAAAAACG TCCTTCAATT CGTGCCGATG
CCGCGCAATG AAATTTCCTG CAAGTTTCTG ATGGATCCCC GTTTGAACGG TGTGATCATG
ACGGGATCCT ACCGCACCGG AAAAATGCTG CGCGAACTGC GGCCTGACCT GCATGTGCTG
GCTGAAACCA GCGGAAAGGA TGCCATGATC ATCACTGCTA CGGCTGATCC GGACCAGGCT
GTAAAGGATT TGGTGAAAAG CGCTTTCGGT CATTCCGGAC AGAAGTGTTC CGCCGCCAGC
GTGGCTATTG TGGAGGCTTC CGTTTATGAC AATCCCGCCT TTTTGCGTCA GTTGAAGGAT
GCCGCCGCCA GCCTGAAGGT AGGCGGATCC TGGGAAGTCA ACTCCGTGGT GACGCCGCTC
ATCAGGGAGC CGGAAGGGAA TCTGCTCCGT GCGCTGACGC AGCTGGAACC CGGGGAGGAA
TGGCTGCTCA AGCCGGAACC TTCGGAAGAC AACCCGTGCC TCTGGTCTCC CGGCATCCGG
CTGGGGGTGA AACCGGGAAG CTGGTTCCAT CAGACGGAAT GCTTCGGTCC GGTATTGGGA
ATCATCCGTG CGGAAAACCT GGAGGAAGCC ATTGACATCC AAAACGACTC CGAATTTGGC
CTTACCGGCG GCCTTCAGTC CCTGGATGAA CGGGAAATTG CCTTGTGGAA AACTAAAGTG
CAGGTGGGCA ACGCGTACAT CAACCGTGTC ATCACCGGCG CCATTGTCCG CCGCCAGCCG
TTCGGCGGGT GGAACCATTC CTCCATGGGG CCTGGAGCCA AGGCCGGAGG TCCCAACTAC
CTTACCATGC TGGGAAGTTG GGAGGAAAAG GCGCTGCCCC AAAAGCTGCG CACGCCGGGT
GAACGTATCT CCGGACTGGT GGAAAAACTG TGTTCCGAGC TGCCGGACTG CGCCAAGCGC
ATCCGTTCCG CAGCTGGTTC CCAGGCCAAG TGGTGGATGG AGGAATTCGG CGTGGATCAT
GATCCCTCCC GTGTTTACGG AGAAAATAAT ACCTTCCGCT ATATCCCGGT GAAGGGAATT
CTGGCCCGTG TGGAAAACAT GTCCGACGAT GACGTCGCCA TTCTGCTGCT GGGGGCGAAA
CTGTGCGGGG TACTCCTGCA CCTGAGCATA GGGACGAGCC GCCCCTGGAT TCAGAAAATG
CATGGTTATT ATGCTTCCCT GACGGTGGAA ACGGAAGCGG AATTGATCGG ACGGATGCCG
GAAGCCCTTC CAGGCATACG GTTCCTGCGT GGAACAGATA TTTCCGAGAC TCTGGCCAAT
GCCGCCCGTG CCCGGGATGT GGAAGTACTG GACCGTCCCG TGCTTGCCAA TGGACGGTTG
GAACTGCTGG GGTATTTCCG GGAACAGTCT GTTTCCGAAA CCGTTCACCG CTATGGCAAT
CTCATCCCGC CACCAGGCAG TTTTAAAACA GACAGCGTGT GA
 
Protein sequence
MTDSSIPDMM AEARRGQWTD QQLAAKAVEL AESILKQSNA GMRGKEKRQA QQMERMMNDP 
AGKAFTLALA DRVFRPSSPV RGAELFRYLL DGYGVPRYLS AADRFAIKMG GRFSAQFPGV
VMPVITSQLR KESSNVILPA EDGKLRPHLR RRRKGGIRMN INQLGEAILG ESEAHHRLQQ
VVDRLTDKDC DYISVKISAI FSQIHLVAFE ETVKLIQERL RILYRAAITN AVTLPDGSRK
PKFVNLDMEE YRDLHLTAEA FKRTLMEDEF MQLEAGIVLQ AYLPDSWEEQ MKLCAWAKER
VEQGGARIKI RLVKGANLAM EKVEASMHGW AQAPYGTKAQ VDANYKRMLH YGCMPDNARY
VQFGVASHNL FDLCYAMLLR EREGVRDQVE FEMLEGMANH QARVIRQAAD GLLLYAPVVL
KEDFHSAIAY LVRRLDENTS EENFLHDLFG MTPGSRSWEV QKKRFLKACQ EKDEVKYGPN
RTQNRAADPI QPSHYRDAFA NERDTDWSLR QNAEWINGLI AAEKEKSGEE IPLVIDGEEI
TTNLWGVGRD PSRHNEVSYK FAYADFDQVE HALVTADRVR SSWASKSIGE RAEILHRAAQ
ELSRIRGEAI AAMVRDAGKA PTEADVEVSE AIDFCRYYAE GLDRDGMNDG VEMSPLGIIC
VMSPWNFPFA IPTGGVAAAL MAGNAVVFKP SELAVYTAWQ IVQAFWRAGV PKNVLQFVPM
PRNEISCKFL MDPRLNGVIM TGSYRTGKML RELRPDLHVL AETSGKDAMI ITATADPDQA
VKDLVKSAFG HSGQKCSAAS VAIVEASVYD NPAFLRQLKD AAASLKVGGS WEVNSVVTPL
IREPEGNLLR ALTQLEPGEE WLLKPEPSED NPCLWSPGIR LGVKPGSWFH QTECFGPVLG
IIRAENLEEA IDIQNDSEFG LTGGLQSLDE REIALWKTKV QVGNAYINRV ITGAIVRRQP
FGGWNHSSMG PGAKAGGPNY LTMLGSWEEK ALPQKLRTPG ERISGLVEKL CSELPDCAKR
IRSAAGSQAK WWMEEFGVDH DPSRVYGENN TFRYIPVKGI LARVENMSDD DVAILLLGAK
LCGVLLHLSI GTSRPWIQKM HGYYASLTVE TEAELIGRMP EALPGIRFLR GTDISETLAN
AARARDVEVL DRPVLANGRL ELLGYFREQS VSETVHRYGN LIPPPGSFKT DSV