Gene Amuc_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1083 
Symbol 
ID6274021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1293567 
End bp1295663 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content57% 
IMG OID642613134 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_001877690 
Protein GI187735578 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0265246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.000000000000296825 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATTACC AGGTTATCCA TCATACGCCC GGCAGGCTCC GTGTACGCAG CGGCCGCTGC 
GCATTCAATC ATGACCAGGC TTACGGCATC GAATGCAGGC TCCTGAAACA GAAAGGCGTA
TACTCCGTCA AGGCCACCCC CTGCAACGGA GGTGTGTTCA TCCTGTATGA AGGAACCAGC
CCCCAGGCCA TTTTCCGCAC GCTGGACAGG CTCAGGCCGG AAACACTCCG CAGCCACGAA
CCGGAAGAGC GCGCGGAGGC CAGGAAACTG GCCCGGTCCT TTTTTCTCCG CATAGCGGGG
AAAGCTTCCT CTTTCCTGTT TTGCAAGGTG TTTCTGCCGC CCCCGCTCAG AATGGCCAAA
GCATTCTGGA ATTACAGTTC CTATTTACGG CGCGGAATGG CCGGCCTGGG AAGCGGACGG
CTCAACGTAG CCGTTCTGGA CGCCGTCTCC ATCGGGGTTT CCATAGGCAC GAAGGCTTAC
GGAACGGCCA ATTCCATCAT GTTCCTTCTT TCCATTTCAG ACCTGCTGGA AAATTACACT
CGGAAAAAGA CCCGGGCGGC GCTTGCGGCC AGCCTGAGCA TAAATATAGA CCGCGTCTGG
CGGGTGGAAG AAGGAGGCAT GCGCCAGGTT CCCATGGAGC AAATCCGCCC GGGGGAGAAA
ATCCGCGTGG ACGCGGGGAA CGTCATCCCT GTGGACGGTA CCGTGCTGTC CGGAGAAGCG
GAAGTCAACC AGGCCGCCAT GACGGGGGAA TCGGAAGCGG CTTCCAAACG GGAAGGGTCC
GTCGTCTTTG CCGGAACCAC GCTGGAAACG GGTTCCCTCG TCATCCGGGT GGACGCCGCA
GGAGACCAGT CCCGCATTAA CAATATCATC GCCCTGATCG ACCATTCGGA GGAACTGAAA
GCGCGCATCC AAAGCCGGGC GGAAAAGCTG GCGGATTCTA TCGTTCCCTA TACCCTGCTC
ACGGCAGGGG CGCTGTTTCT GTTCACCAGG AATATCTCCA AGGCTCTCTC CGTGCTGATG
GTGGATTATT CCTGCGCCAT CAAGCTCGCC ACCCCCATCT CCGTCATCTC CGCCATGAAA
GAGGCCGCAG CGCGCAAGAT CATGATCAAG GGCGGGAAAT TCATGGAGCT GTTCGCCAAA
ACGGACACCA TCGTTTTTGA CAAGACGGGC ACGCTGACCT CCGCCTGCCC TCAGGTAACG
CAGATCATCC CTTTGAGCGA CTGCAGCAGG GAATATATCC TGAAAACGGC AGCATGCCTG
GAAGAACACT TCCCCCACAG CGTGGCGCGG GCCGTTGTGC GCAAAGCGCT GGAGGAGGGC
CTACACCATG AAGAAGAACA TGCGGAGGTG GAATACATCG TGGCCCACGG CATTTCCTCC
CGCCTTCATG GAAAAAAGGT TCTTGTGGGC AGCTACCATT TTCTCTTTGA AGATGAACGC
ATCCCCCTCA CGGAAGAGCA AAGGCTGACC ATACGCAATC ACGCCAGAGG AAAATCCAAC
ATCTTTCTCG CCATAGGCCG CAGAGCAATC GGCATGATCG GCGTCAGCGA CCCGCCCAGG
CCGGAAGCCG CAGAGACGAT CGCCCGGCTG AAACGGCAGG GCATTTCCTC CATCATCATG
CTCACGGGGG ACAGCGAATC CGCAGCCCGC GCCATCAGCC GCCAGTTGGG CATCACGGAA
TACCGTTCCC AGGTTCTGCC TGAAGATAAG GCCCGCTTTA TCCAGCAATT GAAGAAATCC
GGCAAAACAG TGTGCATGGT GGGAGACGGC ATCAACGATT CCCCCGCTCT TTCCTGTGCA
GATGTTTCCG TCTCCATGAA GGATTCTTCC GATATCGCCC GTGAAGTGGC GGATATTTCC
CTGTTGTCCA GCTCTCTGGC TGAACTGGTT GTTTTAAGGG AATTGAGCTG TGCCGTCCTT
GAAAAAATAG AACGCAACTA CCGCTTTATC GTCGGGTTCA ACTCCTCTCT GATCCTGCTA
GGCATGTTCG GGCTGATTAC CCCGGACCTT TCCGCGTTCC TCCATAACGC TTCCACAGTG
TACGTCAGCG CGCGCAGCAC ACGCCGATGC CTTCCCCCTG TCCGGGTTCC GAAGTAA
 
Protein sequence
MHYQVIHHTP GRLRVRSGRC AFNHDQAYGI ECRLLKQKGV YSVKATPCNG GVFILYEGTS 
PQAIFRTLDR LRPETLRSHE PEERAEARKL ARSFFLRIAG KASSFLFCKV FLPPPLRMAK
AFWNYSSYLR RGMAGLGSGR LNVAVLDAVS IGVSIGTKAY GTANSIMFLL SISDLLENYT
RKKTRAALAA SLSINIDRVW RVEEGGMRQV PMEQIRPGEK IRVDAGNVIP VDGTVLSGEA
EVNQAAMTGE SEAASKREGS VVFAGTTLET GSLVIRVDAA GDQSRINNII ALIDHSEELK
ARIQSRAEKL ADSIVPYTLL TAGALFLFTR NISKALSVLM VDYSCAIKLA TPISVISAMK
EAAARKIMIK GGKFMELFAK TDTIVFDKTG TLTSACPQVT QIIPLSDCSR EYILKTAACL
EEHFPHSVAR AVVRKALEEG LHHEEEHAEV EYIVAHGISS RLHGKKVLVG SYHFLFEDER
IPLTEEQRLT IRNHARGKSN IFLAIGRRAI GMIGVSDPPR PEAAETIARL KRQGISSIIM
LTGDSESAAR AISRQLGITE YRSQVLPEDK ARFIQQLKKS GKTVCMVGDG INDSPALSCA
DVSVSMKDSS DIAREVADIS LLSSSLAELV VLRELSCAVL EKIERNYRFI VGFNSSLILL
GMFGLITPDL SAFLHNASTV YVSARSTRRC LPPVRVPK