Gene Amuc_1436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1436 
Symbol 
ID6274593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1722304 
End bp1723293 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content58% 
IMG OID642613495 
Productmalate dehydrogenase 
Protein accessionYP_001878039 
Protein GI187735927 
COG category[C] Energy production and conversion 
COG ID[COG0039] Malate/lactate dehydrogenases 
TIGRFAM ID[TIGR01757] malate dehydrogenase, NADP-dependent
[TIGR01759] malate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.756645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAC CTATCACCGT TACCGTTACA GGAGCCGCCG GCCAAATTGC CTATTCCCTT 
CTGTTCCGTA TTGCCTCCGG CAGCATGCTG GGGCCGGATC AGCCGATCAA CCTGCGCCTG
CTGGAAATTC CCCCGGCCAT GAATGCTCTG GAAGGGGTGG TGATGGAGTT GCGTGACGCC
GCGTTCCCGC TGGTTAATGA AATTGTCCCG ACCAGCGATC CTGATGAAGC GTTCGCCGGC
GCCAACTGGT GCCTGCTGGT AGGCTCCGTT CCCCGCAAGG CCGGTATGGA ACGCAAGGAC
CTGCTGGATA TCAACGGCAA GGTGTTCATC GGCCAGGGGC AGGCCATTGC CCGCAGCGCC
GCCAAGGATG TGCGTGTGCT GGTGGTCGGC AACCCCTGCA ATACGAATGC CCTTATTGCC
ATGCACAATG CGAGCGGCGT TCCCTCCGAC CGCTTTTTTG CCATGACTCG CCTGGATGAA
AACCGTGCCA AGAGCCAGCT TGCGGAAAAG GCCGGCGTCC ATGTGACGGA AGTGACCAAC
ATGGCCATCT GGGGCAATCA TTCCTCCACC CAGTACCCTG ATTTCACCAA CGCCAGGATT
GGCGGAAAGC CTGTGACGGA AGTCATCAAG GATACGGAAT GGCTTAAGGG CGATTTCATT
ACCACCGTGC AGCAGCGCGG TGCCGCCATT ATCAAGGCAC GCGGCGCTTC TTCCGCCGCT
TCCGCCGCTT CCGCAGCCGT GGATACCGTC CGCAGCCTGG CTACCCAGAC TCCGGAAGGC
GACTGGTATT CCGTGGCCGT TTGCTCCGAC GGTTCCTACG GCATTGAAAA AGGTCTTATC
TGCTCCTTCC CGGTCCGCAC CACCAAGGAT GGCGGCTGGG AAATCGTGCA GGGGCTGCCG
GTTGATGCGT TCTCCCGTGA AAAAATTGAC GCTACCGTTA ATGAACTGAA GGAAGAACGT
GACGCTGTTT CCTCTTTGTT GAAGCATTAA
 
Protein sequence
MKTPITVTVT GAAGQIAYSL LFRIASGSML GPDQPINLRL LEIPPAMNAL EGVVMELRDA 
AFPLVNEIVP TSDPDEAFAG ANWCLLVGSV PRKAGMERKD LLDINGKVFI GQGQAIARSA
AKDVRVLVVG NPCNTNALIA MHNASGVPSD RFFAMTRLDE NRAKSQLAEK AGVHVTEVTN
MAIWGNHSST QYPDFTNARI GGKPVTEVIK DTEWLKGDFI TTVQQRGAAI IKARGASSAA
SAASAAVDTV RSLATQTPEG DWYSVAVCSD GSYGIEKGLI CSFPVRTTKD GGWEIVQGLP
VDAFSREKID ATVNELKEER DAVSSLLKH