Gene Amuc_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2036 
Symbol 
ID6273735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2472372 
End bp2473766 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content59% 
IMG OID642614097 
ProductMATE efflux family protein 
Protein accessionYP_001878627 
Protein GI187736515 
COG category[V] Defense mechanisms 
COG ID[COG0534] Na+-driven multidrug efflux pump 
TIGRFAM ID[TIGR00797] putative efflux protein, MATE family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000288761 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0000000485587 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAAGGGA ATAAAAAAAT GAATAAGACG TGGAATTTGA AGGAAATGAA AAAGTTGATA 
CCGCTGGCCC TTCCGGTGCT GGTGGTCAAC CTCTCCATTG TGGGCATGGG GGCGGTGGAC
GCTATTGTGG CCGGGCGCGC CGGCGTGACG GACATGGCCG CCGTGGCCCT GGGGTCTTCC
GTGTACCTGC CCGTGGCTCT CTTCGCCTGC GGCGTGTTGA TGATCATCGG CCCCGTAATT
GCCAACATGC GGGGGAAAAG CCATGAAAGC CGCGTGGGCT ACATGACCAA CCACGGCCTG
TGGCTGGCGT TCATGCTCAG TCTGGTTTCC ATGCCGGTCA TTTATGTGTT GAGAAATGTG
TTCGGCTGGA TTTCCGATGA CGCCGCCATG TGCCAGATGG CCTCCGCCTA CATGTTCGCC
ATTATGTGGG GCCTTCCCGC CAACCTGGGG TTCGTGGCCC TGAAGAGCCT GAACGAAGGC
TCCAACATGA CACGTCCCGC CATGTACGTG GGATTGTGCG GCCTGTTGCT CAACATTCCC
CTGAACTACA TGTTCGTCTT TGGCATGTAC GGTTTTCCCC GCATGGGTGG AGCGGGGTGC
GGTGCGGCCA CGGCAGTCAT TTTCTACATT GAATTCCTGC TGATGTTCCT GCTGGTTTAC
TTCAATCCCA AGCACAGGCC GTACCGCAGG CACATCATTT CCTGGCGGCG GCCTACGCCT
TCCGTCATCA CGCACCTGGT GCGGCTGGGC GTGCCTATAG GCGTTTCCCA GCTGTGCGAG
GTGATGCTCT TCTGCGCGGC GGCTCTGGTG CTGGCTCCGC TGGGAGAGAC GCAGGTGGCT
AGCCACCAGA TTGCCGGGAA CGTGGGCGGC CTGGTGTTCA TGCTCCCGCT TTCCGTAGGG
CTGGCGGCTT CCATCCGCGT GGCGTACCAT CACGGCAGGA ATGACCTGGC AGGCACCAGA
TCCGCCATTC TGTCTTCCTA TGTGCTGGTG CTCACCATCT GTCTGTGCAC CTTTGGAGGC
ATCACCCTGT TCCGCGAGCA GATCGTGCAC CTGTACAATG ACTCGGAGCT GATTGTCAGC
ACGGCTTCCG TCCTGCTGGT TCTGGCGGCG GCCTACCAGC TTCCGGACTG TTTGCAGGTG
CTTTCCGTCG GGGTTCTGAG AGGATTCCGG GATACGGCGT CCATTACCTT CATTACGTTT
TTCTCTTATT GGATGGTAGG ATTTCCGGCG TGCTACATCC TTGCCCGTAC GGACTGGATT
GTCCCGGCCA TGGGAGCGCG GGGCATCTGG ACGGGATTCA TCATCGGCCT GGCAGTAGCG
GCGGTGCTGC TGCTCTGGCG CGTAAGGCGC ACTACCAGGC GGGAATTTTC CCTGATGAGG
CAGGCGGGGG AATAA
 
Protein sequence
MEGNKKMNKT WNLKEMKKLI PLALPVLVVN LSIVGMGAVD AIVAGRAGVT DMAAVALGSS 
VYLPVALFAC GVLMIIGPVI ANMRGKSHES RVGYMTNHGL WLAFMLSLVS MPVIYVLRNV
FGWISDDAAM CQMASAYMFA IMWGLPANLG FVALKSLNEG SNMTRPAMYV GLCGLLLNIP
LNYMFVFGMY GFPRMGGAGC GAATAVIFYI EFLLMFLLVY FNPKHRPYRR HIISWRRPTP
SVITHLVRLG VPIGVSQLCE VMLFCAAALV LAPLGETQVA SHQIAGNVGG LVFMLPLSVG
LAASIRVAYH HGRNDLAGTR SAILSSYVLV LTICLCTFGG ITLFREQIVH LYNDSELIVS
TASVLLVLAA AYQLPDCLQV LSVGVLRGFR DTASITFITF FSYWMVGFPA CYILARTDWI
VPAMGARGIW TGFIIGLAVA AVLLLWRVRR TTRREFSLMR QAGE