Gene Amuc_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2106 
Symbol 
ID6274754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2562143 
End bp2563453 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content58% 
IMG OID642614168 
ProductMalate dehydrogenase (oxaloacetate-decarboxylating) (NADP(+)) 
Protein accessionYP_001878696 
Protein GI187736584 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCG ATATTAGATT AGATGCCTTG CAGTACCATT CCCAGCCCCG CCCCGGCAAG 
GTGGAAACGC TGCCCTGCAA GCCCTGCTTT TCACAACGGG ATCTGACTCT TGCCTATTCC
CCGGGCGTGG CCGAGCCCTG CCTCCGCATT AAAGAGGATC CTTCTCAAAG CGCCCTGTAC
ACTGGTCGCT CCAATCTGGT GGGCGTAATC ACCAACGGCA CCGCCGTTCT GGGACTGGGC
AATATCGGTC CGGATGCCTC CAAGCCGGTG ATGGAGGGCA AGGGCGTTCT GTTCAAGGTG
TTCGCGGATA TTGACGTTTT TGACATTGAG CTGAACGTGA AGGAACCGGA AAAGCTGATT
GAGACGATCA AGACCATGGA ACCCACTTTC GGCGCCATCA ATCTGGAGGA CATCAAGGCT
CCGGAATGCT TCATGGTGGA AGAACGCCTG CGGGAGGAGA TGAATATTCC CGTGTTTCAT
GACGACCAGC ATGGCACGGC CGTGATTTCC GGCGCCGCCC TGCTGAACGC CGCGGAGTTG
ACGGGCCGCA AGCTGGAGGA TATGAAGGTT GTCGTCGTGG GGGCCGGCGC TGCCGGCATT
TCCTGCGCCA AGTTCTACAT GACGTTAGGG GTGCGTCGCG AACATATCTA CATGTTTGAT
TCCAAGGGGC TGATTCATAC CGGACGCATT GATCTTCATG CCACGAAAGC GCAGTTCTCC
CAGTCGGAAG ACTGCTCCCT GGAGGAGGCC CTTACCGGAG CGGACGTGTT CCTGGGGCTG
TCCACCAAGG GACTGCTCAC GCAGGACATG GTGAAGCTCA TGGCTCCTTC CCCCATCATT
TTCGCCTGTG CGAATCCGGA CCCGGAAATT ACGTATCAGG ATGCTAAAAA AGCGCGGCCT
GACTGCATTA TGGGGTCCGG CCGTTCCGAC TGGCCCAACC AGGTGAACAA TGTTTCCTGT
TTCCCCTTTA TTTTCCGTGC CGCCCTGGAT GTGCGCGCTT CCGTCATCAA TGAACAGATG
AAGATTGCCG CCGCCCGCGC CCTGGCCGAT CTGGCGAAGG AGCCCGTCCC CCAGGAAGTG
ATTGACCTTT ACGGGGGAGC CCCGCTCAGC TTCGGCATCG ACTACGTGAT TCCCAAGCCC
ATTGATCCCC GCATTATTGA ATGGGAGTGC CCGGCGGTAG CCCAGGCGGC CATGATTTCC
GGGGTGGCCC AGTCCCCCAT CCGGGATATG GAAGCCTACA CGCTGGAATT GCGCAAGCGC
ATTGCCGCGG CTCGTGAACG CGTCTCCGGC GTGGTGCGCA GCTATCTTTA A
 
Protein sequence
MSSDIRLDAL QYHSQPRPGK VETLPCKPCF SQRDLTLAYS PGVAEPCLRI KEDPSQSALY 
TGRSNLVGVI TNGTAVLGLG NIGPDASKPV MEGKGVLFKV FADIDVFDIE LNVKEPEKLI
ETIKTMEPTF GAINLEDIKA PECFMVEERL REEMNIPVFH DDQHGTAVIS GAALLNAAEL
TGRKLEDMKV VVVGAGAAGI SCAKFYMTLG VRREHIYMFD SKGLIHTGRI DLHATKAQFS
QSEDCSLEEA LTGADVFLGL STKGLLTQDM VKLMAPSPII FACANPDPEI TYQDAKKARP
DCIMGSGRSD WPNQVNNVSC FPFIFRAALD VRASVINEQM KIAAARALAD LAKEPVPQEV
IDLYGGAPLS FGIDYVIPKP IDPRIIEWEC PAVAQAAMIS GVAQSPIRDM EAYTLELRKR
IAAARERVSG VVRSYL