Gene Amuc_0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0107 
Symbol 
ID6274946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp133142 
End bp136318 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content58% 
IMG OID642612152 
Producttransporter, hydrophobe/amphiphile efflux-1 (HAE1) family 
Protein accessionYP_001876733 
Protein GI187734621 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGT TTTTCATCAA GCATCCCGCA ATCGCAGCCG TGATTGCCAT TGTGACCACG 
CTGCTGGGGC TGGTCTGCAT GTTCAACCTG CCTATTTCCC AGTATCCGGA AATCACGCCG
CGCACCATCC AGCTTCAGGC CATGTTTCCG GGGGCCAGCG CGCAGGCGGT GGCGGATTCC
GTGGGCACTC CCCTGGAACG CCAGATTTCC GGCGTGCAGG GCATGGACTA CATGACGTCC
GTTTCCTCCA ACAACGGCGT GTATAACCTT TCCGTCATTT TTGAACCGGG TTCTGATACG
GATATTGACC AGGTGCTGAC CAATATGCGC TATGGGCAGG CCTCCTCCCA GCTTCCGCAG
GAAGTTCAGA GCACGGGCGT GACCATCAAG CAGCAGCCGG GGCTTCCGCT GATGATTTAT
TCCCTGACAT CCCCGGACGG CAGTTATAAT TCCGTGGATT TGGCGAACTA TGCCCAGGTG
AAGCTGGTGG ATGAACTCAA GCGCGTGGAG GGCGTGGGGG AAGTTCAGGT GTACGGCGCG
GGGCGTTATG CCATCCGCAT CTGGCTGGAT ACGTCCAAAA TGACCCATTA CGGCGTTTCC
GTCAATGAAG TGCGGGGCGC CATCTCCGCC CAGAACACCA CCAACCCCGG CGGCAAGATT
GGGGCGGACC CCGTGCCGGA CGGCCAGGAG CAGACCATTA CCGTGCGCAC TCAGGGGCGT
CTTTCGGAGC CCCATGAATT TGAAGACATC ATCATTCGCC AGAATGGGGA TGAAATCCTG
TACCTGAAAG ATATTGCCAA GGTGGAACTG GGAGCTGAAG ACTATTCCGC CACCGGCCGC
CTGAACGGAG AAGTTTCCGC GGCTATCGTT ATCTTCCAGT CTCCGGGTTC CAACGCCATC
GCTACGGCGG ACCGCGTGGA AAAACTGCTG GAAGCCAAGG CTCCGCTGAT GCCGGAAGGA
ATTACCGGGC GCGTTTCCCT GGATACGACT ACGGCGGTGC GCTATTCCAT TGATGAAATC
AAGCACACGT TCATGGAAGC CGTCATCCTG GTGGCCCTGG TCGTGTATGT TTTCCTTCAA
AACTGGCGCG CCACCCTCAT TCCGCTCATC GCCGTTCCGG TTTCCCTTAT TTCCACGTTC
TGCCTCTTTC CCGTCCTGGG GTTTTCTCTG AACACCATTT CCCTTCTTGG CATGGTGCTG
GCCATCGGCC TGGTTGTGGA TGACGCCATT GTGGTGGTGG AAGCCGTGCA GGAGCACATA
GACAAGGGGC TCAATCCGCG CATGGCTTCC TTCGCCGCCA TGCAGGAGGT ATCCGGCCCC
GTCATCGCCA TTGCCTTGGT GCTGGCGGCG GTATTCCTGC CTTCCCTGCT GCTGGAGGGC
ATCACGGGAA CGCTGTTCAA GCAGTTCGCC GTGACCATCG CCATTTCCAT GCTCATTTCC
GCCTTTAATG CGCTGACGCT TTCCCCTGCC TTGTGCGCCC TGTTGCTGAA ACCCCGGAAT
GCGGGCAGGA AGAGCCTCTT TTCCCCATTT CACCGTTTGT TCAACTGGTG TTACGGACGT
GTCTCCAACG GCTATACCCG CATGTGCGGA AGCCTGGCCC GCAAGCTGGC GATTTCCATC
CCCCTCCTGT TGTTATTCTG GGGGGCGGTG GCTCCTGTGG CGGAGCGCGT TCCCGGCGGC
TTTCTTCCGG ATGAGGACCA GGGCTTTCTG CTGGCCTGCC TGATCCTGAA GCCGAATACC
TCCCTGCAGG TTGCCTATGA ACAGGACAAA AAATTTGAGG CGGCCCTCCA GGATCCCGCC
GTGAAGAACC TTACTACCGT GGTGGGGCTC AACATCCTCA ACAGCGTGCA GACTCCCGGC
GCCTGCATTG CCTACATTGA GCTCAAGGAC TGGAGCGAAC GGCCGGAAAC CTCCGCGGAG
CTGGCCGGAA AGCTTCAGGG AAAGCTGGCC CAGGCCGGCC TGGACGGTAT GGCGATGGTG
CTGGAACCTC CCGCCATCCC CGGCGTGGGA ACCGCCAACG GGGTTACGAT GGTACTGGAG
GATCTGGAAG GGCAGGGCGT AGCTTATCTG CATGAACAGG TGCAACGGTT CCAGGAAGCG
GCTTCCCGGC GTCCGGAAAT AGCCTTGTGC ATAGACATGA TGATGGCGGA CATGCCGCAG
AAGTACGTGA ACCTGGACAA GGAAAAATGC AAATTCCACA AGGTGGATAT TGACGTGGCC
AACGGCATTC TGGCCTCCTA CAACGGTTCC TCCTTCATCA ATTACTTCAA TGCCTTCGGC
CAGCAGTGGC AAGTGTATAT CCAGGCGCAG GGCGAAGACC GCGCCAGCCT GGAGAAGATG
GACGGCTTCT TTGTCACGAA CGCGGACGGA GCCCGCGTGC CTCTCTCCGC CCTGGTAAAC
ATCAGAGAGA TAGAGGATAC GGAATTCGTG ATGCACCATA ACATCTACAA CGCCGCCAAA
TTGAACGTGA TGCCCCGGCC CGGCCACTCC ACCCAGCAGG TGATGGACGC GCTGGAGGAA
GTGTTCCATC AGACGATGGA TCCTACGAAG GTTGGCTTCG ACTACCAGGA CATGAGCTTC
CAGGAAAATA AAGTGCGCAA CAGTATTGGG CTGGGGGCCA TCTTCACCAT GTCCGCCGTT
TTCGCCTTCC TTATCCTGGT GGCGCTTTAT GAGAAATGGT CTCTTCCCCT GGCGGTTTTC
CTCACGGTTC CCATTGCCGT GCTGGGGGCC TATGCAGGGT TGTTCTGGCA GGGCATGGAG
CTGACGCTCT ACGCGCAGAT CGGTCTGGTG ATGCTGGTGG GTCTGGCGGC CAAGAACGCC
ATCCTCATCG TGGAATTCGC CAATCTGGAG ATGCAGCGCG GCAAGGGCCT GATGGAGGCT
ACGCTGGTTG CCGCGCGTCT GCGCCTGCGT CCCATTTTGA TGACTTCTCT GGCATTCGTT
CTGGGATGTA TCCCCCTGAT GCTTTCCTCC GGTTCCGGAG CTCTGGCGCG CAACGCTATC
GGTACGGTGG TGGTGATTGG CATGGGGGTG GCTACACTGG TGGGCGCTTT TCTGATTCCA
TGTTCTTATG TGTTCATCAT GAGATTATTC CGTATTAAAT TCTCCCTGAA TGATCTGAAG
GAGGATCCGG ATGAAGTGGG AGCCAGGAAA TACCTGGCCG CCCATACGAA GGACTGA
 
Protein sequence
MSAFFIKHPA IAAVIAIVTT LLGLVCMFNL PISQYPEITP RTIQLQAMFP GASAQAVADS 
VGTPLERQIS GVQGMDYMTS VSSNNGVYNL SVIFEPGSDT DIDQVLTNMR YGQASSQLPQ
EVQSTGVTIK QQPGLPLMIY SLTSPDGSYN SVDLANYAQV KLVDELKRVE GVGEVQVYGA
GRYAIRIWLD TSKMTHYGVS VNEVRGAISA QNTTNPGGKI GADPVPDGQE QTITVRTQGR
LSEPHEFEDI IIRQNGDEIL YLKDIAKVEL GAEDYSATGR LNGEVSAAIV IFQSPGSNAI
ATADRVEKLL EAKAPLMPEG ITGRVSLDTT TAVRYSIDEI KHTFMEAVIL VALVVYVFLQ
NWRATLIPLI AVPVSLISTF CLFPVLGFSL NTISLLGMVL AIGLVVDDAI VVVEAVQEHI
DKGLNPRMAS FAAMQEVSGP VIAIALVLAA VFLPSLLLEG ITGTLFKQFA VTIAISMLIS
AFNALTLSPA LCALLLKPRN AGRKSLFSPF HRLFNWCYGR VSNGYTRMCG SLARKLAISI
PLLLLFWGAV APVAERVPGG FLPDEDQGFL LACLILKPNT SLQVAYEQDK KFEAALQDPA
VKNLTTVVGL NILNSVQTPG ACIAYIELKD WSERPETSAE LAGKLQGKLA QAGLDGMAMV
LEPPAIPGVG TANGVTMVLE DLEGQGVAYL HEQVQRFQEA ASRRPEIALC IDMMMADMPQ
KYVNLDKEKC KFHKVDIDVA NGILASYNGS SFINYFNAFG QQWQVYIQAQ GEDRASLEKM
DGFFVTNADG ARVPLSALVN IREIEDTEFV MHHNIYNAAK LNVMPRPGHS TQQVMDALEE
VFHQTMDPTK VGFDYQDMSF QENKVRNSIG LGAIFTMSAV FAFLILVALY EKWSLPLAVF
LTVPIAVLGA YAGLFWQGME LTLYAQIGLV MLVGLAAKNA ILIVEFANLE MQRGKGLMEA
TLVAARLRLR PILMTSLAFV LGCIPLMLSS GSGALARNAI GTVVVIGMGV ATLVGAFLIP
CSYVFIMRLF RIKFSLNDLK EDPDEVGARK YLAAHTKD