Gene Amuc_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2042 
Symbol 
ID6274779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2481336 
End bp2484551 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content59% 
IMG OID642614103 
Producttransporter, hydrophobe/amphiphile efflux-1 (HAE1) family 
Protein accessionYP_001878633 
Protein GI187736521 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.798214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.0985103 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCT TCTTCATCAA ACACCCGACC ATTGCCATCG TCATCTCGAT CGTGATGATC 
CTGCTCGGCG GCCTGTCCCT GATGGGGCTG CCCATTGAGC AGTATCCGAA CATCGTCCCG
CCCACCATCA AGATGCAGGC GACCTACCCG GGCGCGAATG CGGAGACGGT GGCCAATTCC
GTAGCCTCCC CCATTGAGCA GTCCATTTCC GGCGTCGTGG GGATGGATTA CATGACTTCC
ACCAACGCCA ACAACGGCAT TTGCTCCCTG AGCATCGTCT TTGAAGTTGG CACGGACCCC
AACATGGACC AGACTCTGGC CTACATGCGC TACGGGCAGG CCACCGCCCA GATTCCCGCG
GAGGTCTCCC AGATGGGCAT CACCATCACG CAGTCCACAG GCAGCCCGCT GGCCGTAATC
AACCTGTATT CTCCGGATGA CAGCCTGAAC GCCATCTTCC TGAGCAACTA TGCCTACGTG
AGCCTGGTGG ACCCCGTGAA ACGCGTTCCC GGCGTGGGCG ACGTGCAGGT GTTCGGCGCA
GGCCGCTACG CCATGCGCAT CTGGCTGGAC ACCACCAAAA TGGCCGCTCA GAACATCTCC
GTGGGAGAAG TCCAATCCGC CATCCAGGCG CAGAACACCG TGATTCCCGG CGGCCAGATC
GGCGCGGAGC CGGCCCCTCC CGGAACGGAA TTCACATACA GCATCCAGAC CAAGGGACGC
CTTCAGACGG CTGAAGAATT CGGCAACATC ATCATCCGTG CGGACGGGAA CAAGCTTCTT
TACCTGAAAG ACATCGCCAA GGTGGAGCTG GGTTCCCAGA CTTATAACGT ATCCAGTAAA
TACAACGGCA GGGATTCCGG CGCCATCGCC GTGTACGCCG CTCCCGGCTC CAACGCCATC
AACACAGTGG ACGCCCTGGT CAAGCTCTTT GAAGACCGCT CCCGCAGCTT CCCCGCCGGA
ATGGAATACA ACCTGACGCT GGACACCACG CTGGCCGTGC GCGCCTCCAT TGAGGAAATT
GAGCACACGC TGCTGGAAGC CCTCCTGCTG GTGGTTCTGG TGGTGTTCGT CTTCCTGCAA
GGCTGGCGCG CCACGCTGAT TCCGGCCATC GCCGTTCCGG TTTCCATCAT CGCCACCTTC
GCGCTGTTTC CGCTCCTGGG CTTCACGTTG AATACCATCT GCCTCATGGG CATGGTGCTG
GCCATCGGCC TGGTGGTGGA TGACGCCATT GTGGTGGTGG AAGCCGTGGA ATCCCACATG
GAGCGCGGCC TCACGCCCCG CCAGGCGGCC TTCGCCGCCA TGGAGGAAGT ATCCGGCCCC
GTCATCGCCA TCGCCCTGGT GCTGGCGGCG GTGTTCCTGC CCTCCCTGCT GCTGCCGGGC
ATTACAGGCA CGCTCTTCCA GCAGTTTGCC GTGACCATCG CCATTTCCAT GCTCATTTCC
GCGTTCAACG CGCTGACGCT CTCCCCGGCG CTGTCCGCCA TTCTGCTCAA ACCCAAGGAC
CCGACCAAGG GCGGCCCGCT GAAATTCTTC TACCGCGTTT TCAACCGCAG TTATGACGCC
ACGGCCAGTG GCTACACCAA AGTGTGCCAC TTCCTGACCC GCAAGCTCAT CATCTCCATT
CCGCTGCTGG CTCTGATCGC CTACGCGATT GTCCCCGTAG CCAAGAAAAT CCCCAACGGC
TTCCTGCCGG ACGAAGACCA GGGCTACCTG TTCGCGGCCC TGATCATGCC GGAAGCCCGT
TCCCTCCAGC TGACTACGGC CGCCGCGGAC AAAGTTTCCG AACTCATCCG CCAGAACCCG
AACGTAAAAG ACGTGATTGC CATCTCCGGC TTCAGCCTGC TGACGGGCGT GCAGAGCACG
AACAACGCCT TCTTCTTCGT CATGCTCAAG CCCTGGGAGG AACGCCCCAA TCCGGACCAG
AGCGCCCAGG CGGTCACCGC CCAGCTGAAC GCCCTGCTGA CCACGAAAGT TTCCGAAGGC
ATCACCATGT GCTTCCAGCC TCCGGCCATT GCGGGGGTAG GTTCCGCCAA CGGCGTCACC
TTCATGCTGG AAGACCGCGA CGGCAAGGGC ACGGAATACC TGGCGGAACA AACGGACATC
TTTGTGAAGG AAGCGAACAA GCTTCCGATA TTCGACCCGA ACAACAACGG CGGCGTGCGC
AGCGTGATGT CCTTCGCCGT GGAACAGAAA GATGTGCGGC TGGATGAAGA AAAATGCGCC
ACGCTGGGCG TCAGCATCAG TGAAGCCAAC AGCCTGCTCC AGGCCTACAT GGGTTCCCTG
TTCATCAACT ACATCACCCT TTACGGCCAG CAGTGGCAAG TGTACATCCA GGCGCAGGGC
AGTGACCGCA CCGGAACGGA CATGCTCAAA AACTTCTACG TGAAGAACAA CACGGGAAGC
TCCGTCCCCC TCTCCACCCT CGTGAAAATC ACGGATATCA AAGGGCCGGA ATTCCTGCTG
CGCCAAAACC TGTACAACTC ATCCAAGCTC ATGGTCACGC CCGCCCAGGG CTACTCCAAC
TCCCAGGCCA TGGAGGCGCT GGAAAAAACC TTTGAAGCCA GCATGCCTTC CGACATGGGC
TACAGCTATG CGGACATGAG CTACCAGGAA CAAAAAATCC AGAACGGTAT CGGCATCGTG
CAGATTTTCC TGCTCTCCTC CGTCTTTGTC TTCCTGATTC TGGCGGCCCT GTATGAACGG
TGGTCCCTGC CCCTCAGCAT CTTCATGACG GTGCCCATCG CTGCCCTCGG CGCGTTCCTG
GGCCTGTACT GGTTCGGTTA TGAATTGAAC CTGTACGCGC AAATCGGCCT GGTGATGCTC
ATCGGCCTGG CGGCCAAGAA TGCCATTCTG ATTGTGGAAT TCGCCGTCAT TGAAATGGAA
CGCGGCAAAA CGCTGATGGA AGCGACGCTT TCCGCCGCAA GAATCCGCCT GCGCCCCATC
CTGATGACAT CCTTCGCTTT CATTCTGGGC TGCGTTCCGC TGGCGCTGGC CTCCGGTTCC
GGCGCTTATT CCCGCAATAT CATCGGAATC GTGGTCATTG CCGGGATGAC GATGGCGACG
GTCGTGGGCA TTTTCCTGAT TCCGTGCTCC TTCTACTTCA TCATGAAGCT CTTCCGCGTA
CGCATCGCCC GGAAGACGGT GGAAACGGAA GACCCGGACG AAATCATCGC CCGCAAACAC
CTGTTCCACG AAGCCCATGA ATCATTAAAA GGGTAA
 
Protein sequence
MSSFFIKHPT IAIVISIVMI LLGGLSLMGL PIEQYPNIVP PTIKMQATYP GANAETVANS 
VASPIEQSIS GVVGMDYMTS TNANNGICSL SIVFEVGTDP NMDQTLAYMR YGQATAQIPA
EVSQMGITIT QSTGSPLAVI NLYSPDDSLN AIFLSNYAYV SLVDPVKRVP GVGDVQVFGA
GRYAMRIWLD TTKMAAQNIS VGEVQSAIQA QNTVIPGGQI GAEPAPPGTE FTYSIQTKGR
LQTAEEFGNI IIRADGNKLL YLKDIAKVEL GSQTYNVSSK YNGRDSGAIA VYAAPGSNAI
NTVDALVKLF EDRSRSFPAG MEYNLTLDTT LAVRASIEEI EHTLLEALLL VVLVVFVFLQ
GWRATLIPAI AVPVSIIATF ALFPLLGFTL NTICLMGMVL AIGLVVDDAI VVVEAVESHM
ERGLTPRQAA FAAMEEVSGP VIAIALVLAA VFLPSLLLPG ITGTLFQQFA VTIAISMLIS
AFNALTLSPA LSAILLKPKD PTKGGPLKFF YRVFNRSYDA TASGYTKVCH FLTRKLIISI
PLLALIAYAI VPVAKKIPNG FLPDEDQGYL FAALIMPEAR SLQLTTAAAD KVSELIRQNP
NVKDVIAISG FSLLTGVQST NNAFFFVMLK PWEERPNPDQ SAQAVTAQLN ALLTTKVSEG
ITMCFQPPAI AGVGSANGVT FMLEDRDGKG TEYLAEQTDI FVKEANKLPI FDPNNNGGVR
SVMSFAVEQK DVRLDEEKCA TLGVSISEAN SLLQAYMGSL FINYITLYGQ QWQVYIQAQG
SDRTGTDMLK NFYVKNNTGS SVPLSTLVKI TDIKGPEFLL RQNLYNSSKL MVTPAQGYSN
SQAMEALEKT FEASMPSDMG YSYADMSYQE QKIQNGIGIV QIFLLSSVFV FLILAALYER
WSLPLSIFMT VPIAALGAFL GLYWFGYELN LYAQIGLVML IGLAAKNAIL IVEFAVIEME
RGKTLMEATL SAARIRLRPI LMTSFAFILG CVPLALASGS GAYSRNIIGI VVIAGMTMAT
VVGIFLIPCS FYFIMKLFRV RIARKTVETE DPDEIIARKH LFHEAHESLK G