Gene Amuc_1095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1095 
Symbol 
ID6274007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1306359 
End bp1308284 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content61% 
IMG OID642613146 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_001877702 
Protein GI187735590 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000123465 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000000122053 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACGG AACATTCCCA CGAACATTTT CCGGAAGGTG CTTCCTGTTG TTCCGGAAGC 
TGCGGTTCTG GAAGCTGTTG CTGCTCCGGA GACGCCATCG GGCCTGTTCC CGCCGTGCTG
GGCATTGCCC TGTTCATCAG TGCGCTGGCG GCGGGCGGCG ATTCCCTGGC CGGTATGGCC
GCCTACGTGG GCTCCTACCT GCTGATAGGC TGGGATGTAC TGAAGGCCGC CTTTCTGGGG
ATGATGCGCG GCCGCGCCAT CAATGAAAAT TTTCTGATGA GTATTGCTTC CCTGGGGGCG
ATGTTCCTGG GGGATTATTC CGAGGCGGTG GGGGTGATGC TGTTCTACCG CGTGGGAGAG
TACCTTCAGG AACGTGCGGT GGGCAGCTCC CGCCGGTCCG TAAGCGAACT GATGAGTCTG
AGGCCTGAGG CCGTCCATGT GAAAAAAGCC GGAGCCGTAG ATGACGTTCC TCCTTCCGAG
GTTCTGCCCG GCTCCCTGAT TGAAGTGCGC CCCGGCGAAC GCGTTCCGTT GGACGGTGTG
GTAACGGGAG GCTGCTCCGT GCTGGATACC TCCGCCATGA CCGGGGAATC CCTGCCGGTG
GAAGCCGGAT CCGGAAGCTC CGTGCTGGCG GGGTACATCA ACGGGCAGGG TGTGCTGGAA
GTGTGTACGG AGCGGGACTG GCGGCATTCC GCCCTGGCGC GGGTCCAGGA ACTGGTGGAA
GCCGCATCAG GCCGCAAATC CCCTCTGGAG GGGAGGCTGT CCTCATTTTC CCGCATTTAT
ACGCCGCTGG TGATTTCCAT AGCCGCTCTG GTATTTTTGC TTTATCCCCT TGTGACGGGC
GGAAGCTGGG CGGACGGTCT GTTCCGCGCC CTGGTCCTGC TGGTGATTTC CTGCCCCTGC
GCGCTGGTGC TGTCCATCCC GCTGGGCTTT TTTGCCGGAA TAGGGAGAGC GGCGCGGCAG
GGGATTTTGC TGAAAGGGAG CAACTATCTG GATGCCCTGC GGAAGGTGAA GACGGTGGTG
TTTGACAAAA CCGGAACCTT GACGGAAGGC GTTTTTTCCG TGGATGAAGT GCTTCCCCGT
GACGGCGTTT CCCCGGAGGA ACTGCTGTAC TGGGCGGCCC ATGCGGAACT CTCCGCCTCC
CATCCGCTGG GGCGTTCCAT TGTGAAGGCA TATGAAGGGA CTCTGTTTCC GGACCGCGTG
GCGGAACTGG TGGAGGTGAC GGGCGGCGGC GTTTCAGCCC GCGTGGAAGG GAGACCGGTT
CTGGTGGGGA AAAAGTCTTT TCTTCAAGAG GCCGGCGTAA GGACGGAGGA GGGAGAAGAC
CGTGGCGTGA CCGTTTATGC GGCGTTGGAC GGCATACTTC TGGGATGCCT CCGCCTGTCT
GACCGCGTCA AGCCGGGAGC GGAGCGTGCG GTGCGGAAAT TGAGGGAACT GGGCGTTTCC
AACCTGGTCA TGCTGACGGG GGATTCTTCT TCCGCCGGAA CGGAAGTGGG GCTTAAGCTG
GGGATGGACG GGGTATTCTG CGGGCTGATG CCTGCCGGCA AGCTGGAGCA TGTGCGCCGG
CTGAAACCTG AAACGGGGCT GCTTGCCTTT GTGGGGGACG GTATGAATGA CGCCCCTTCC
CTCGCTGCTG CCGACATCGG AATTGCCATG GGCGGCGTAG GGTCCGATAC GGCTCTTCAG
GCGGCTGACG TAGTGGTGAT GAAGGGAGAT CCTTTGGCTG TTCCGCTGGG GATGATGCTT
TCCCAAGCCA CAGAGCGCAT CATTGTGCAG AACATCGTTC TTATTTTGGG CGTCAAAATT
CTGGTCATGG TGCTGGGTAT TCTGGGGCTG GCTGGGATGT GGGCCGCCGT GATGGCGGAT
GTGGGCGTCT GCCTGCTTGC GGTGGGCAAC TCCATGCGTA TTTTCCGGGT GAAGCTGGAC
ATGTGA
 
Protein sequence
MSTEHSHEHF PEGASCCSGS CGSGSCCCSG DAIGPVPAVL GIALFISALA AGGDSLAGMA 
AYVGSYLLIG WDVLKAAFLG MMRGRAINEN FLMSIASLGA MFLGDYSEAV GVMLFYRVGE
YLQERAVGSS RRSVSELMSL RPEAVHVKKA GAVDDVPPSE VLPGSLIEVR PGERVPLDGV
VTGGCSVLDT SAMTGESLPV EAGSGSSVLA GYINGQGVLE VCTERDWRHS ALARVQELVE
AASGRKSPLE GRLSSFSRIY TPLVISIAAL VFLLYPLVTG GSWADGLFRA LVLLVISCPC
ALVLSIPLGF FAGIGRAARQ GILLKGSNYL DALRKVKTVV FDKTGTLTEG VFSVDEVLPR
DGVSPEELLY WAAHAELSAS HPLGRSIVKA YEGTLFPDRV AELVEVTGGG VSARVEGRPV
LVGKKSFLQE AGVRTEEGED RGVTVYAALD GILLGCLRLS DRVKPGAERA VRKLRELGVS
NLVMLTGDSS SAGTEVGLKL GMDGVFCGLM PAGKLEHVRR LKPETGLLAF VGDGMNDAPS
LAAADIGIAM GGVGSDTALQ AADVVVMKGD PLAVPLGMML SQATERIIVQ NIVLILGVKI
LVMVLGILGL AGMWAAVMAD VGVCLLAVGN SMRIFRVKLD M