Gene Amuc_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1839 
Symbol 
ID6274691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2234017 
End bp2235894 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content59% 
IMG OID642613902 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_001878437 
Protein GI187736325 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.207467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGACC GATTGGAAAA ATTGCTGGAA TGGGGAGGGA CAAGGCGGAA AGTCGCACTG 
TTGTCCGTTT CCGGATTAGC TCTGTTGATG AGCATGCTTG GCATGGAACC TTTGCCCTTT
GAAATTTCAT GGATAGCCAT TGTGCTGTGC GGCGTTCCCA TTGTGCTGGA AGCCGTCCTT
GGCCTGGCGA CGGCCTGGGA TATTAAGGCG GATGTTCTTG TTTCCGTCGC TCTGGTTGCC
TCCGTCATCA TTGGGGAGGA CTTCGCTGCG GCGGAAATTG CATTTATCAT GCAGCTTGGG
GAACTCCTTG AGGAGCTGAC CGTAGCCCGG GCCCGGAAGG GTATTGAAAG GCTGGTCCGT
TTGACCCCGG CGACCGCCCG GAAAGTGGAA GGAACCAGGG AGGAGGTTAT TCCGGCGGAA
GAGGTTCGCA TCGGGGATAT TCTCCGCGTA TTGCCCGGAG AGACGATTCC GGCGGACGGC
GTTATTCTTT CCGGGCACAC TTCCGTCAAC CAGTCCGTAA TGACGGGGGA ACCTCTTCCG
ATAGACAAGG AAACCGGGGA TGAGGTTTCC AGCGGGACGG TCAACCAGTT CGGAACCTTT
GACATGAAGG CGGCCAGGAC CGGGAAGGAC AGTTCCATCC AGCGCATGGT GCGCCTCGTC
CAATCCGCAG ATGCGGGGAA GGCTAAAATA GTGAGGCTGG CAGACCGCTG GGCCACCTGG
ATCGTTGCCA TCGCCCTGGT TTCCGCAGGG GGGGCCTGGA TGGCCACGGG CGAACTGATC
CGCGCCGTTA CCATTCTGGT GGTATTCTGT CCCTGCGCGC TTGTGCTGGC AACTCCCACG
GCGGTCATGG CCGCTATCGG GAACGCCACC AGGCATGGCT TCCTGGTGCG GGAAGGGGAT
GCCCTGGAAC GCCTGTCCTG CGTGTCCCAC CTCACTTTCG ACAAGACGGG CACGCTTACC
TGCGGTGCTC CGCGCGTGGC GGCCGTGCGC AGTTTCCTGC CGGAGCTCCC GGAACAGGAA
CTTTACAGAT ATGCCGCCTG TGCGGAACTG CGTTCTGAAC ATCCTCTGGG CAAGGCGATT
GTCCGCTGTT ACCGGAAAGA TTCAGTGCAG AAACTTCCCC AACCGGAACA ATTCCGCATG
ATTCCCGGCA GGGGAGTACG GGCTGTGGCG CAGGGAAAGG AACTTCTGGC CGGCAACCTG
GAATTGTTCA GGGAAAACGG GGTGGAGCTG TCCGATGAGG CCAGACGTGC GGCGGAACGG
TATCAGGAGG AGGGGTGCTC CGTGATTTTC ATAGGAATGG ACCGGCAGGC CGCAGGGTTG
ATCGCCCTGT CGGATACCTT GCGGGAGAAT GCCGTAGCTA CCATCCGGGA AGTACGGGAT
GCCGGAGTGG TTCCCGTATT GCTGACCGGG GACCATGGGA ATGCCGCCGC GCGTGTCGCC
GGACAACTGG GGATTGACCT GGTTTGCGCG GAATGCCTGC CGGAGGACAA ATTCAACTGG
ATTGACGCGT GCCAGAAAGA GCGGAACCGT GTGTGCATGG TTGGGGACGG AATCAATGAT
GCCCTTGCCT TGAAAACCTC TCATGCGGGA ATTGCCATGG GCGGCATAGG AAGTGACATT
GCCGTGGATG CCGCAGATAT TGTCCTGGTC AATGACGATA TCCGGGAACT GCCCCATCTG
CTCCGTCTGT CAAGGCGCAT GATGGGGACA ATCAAATGCA ATCTGACATT CTCCATGGCC
CTGAACTTCA TCGCCATCAT CCTGGCGGCG GGCGGCATCC TGAATCCCGT GGCTGGCGCG
CTGGTTCACA ATGCGGGCTC CGTCGTCGTG ATCGCCAACT CCGTTTTTCT GCTGAAATGG
AGGAGGAAAA ACGCGTGA
 
Protein sequence
MLDRLEKLLE WGGTRRKVAL LSVSGLALLM SMLGMEPLPF EISWIAIVLC GVPIVLEAVL 
GLATAWDIKA DVLVSVALVA SVIIGEDFAA AEIAFIMQLG ELLEELTVAR ARKGIERLVR
LTPATARKVE GTREEVIPAE EVRIGDILRV LPGETIPADG VILSGHTSVN QSVMTGEPLP
IDKETGDEVS SGTVNQFGTF DMKAARTGKD SSIQRMVRLV QSADAGKAKI VRLADRWATW
IVAIALVSAG GAWMATGELI RAVTILVVFC PCALVLATPT AVMAAIGNAT RHGFLVREGD
ALERLSCVSH LTFDKTGTLT CGAPRVAAVR SFLPELPEQE LYRYAACAEL RSEHPLGKAI
VRCYRKDSVQ KLPQPEQFRM IPGRGVRAVA QGKELLAGNL ELFRENGVEL SDEARRAAER
YQEEGCSVIF IGMDRQAAGL IALSDTLREN AVATIREVRD AGVVPVLLTG DHGNAAARVA
GQLGIDLVCA ECLPEDKFNW IDACQKERNR VCMVGDGIND ALALKTSHAG IAMGGIGSDI
AVDAADIVLV NDDIRELPHL LRLSRRMMGT IKCNLTFSMA LNFIAIILAA GGILNPVAGA
LVHNAGSVVV IANSVFLLKW RRKNA