Gene M446_5072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5072 
Symbol 
ID6135297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5560228 
End bp5563416 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table11 
GC content69% 
IMG OID641645207 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_001771832 
Protein GI170743177 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.193812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0596255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTTG CCCACTTCTT CGTCGACCGG CCGATCTTCG CCTCGGTGAC GTCGATCGTC 
ATCCTGATCA TCGGCTACGT CTCGTACATC TCGCTGCCGG TCTCGCAATA TCCCGAGATC
GTGCCGCCCA CCGTGGTGGT GCGCGCCTCC TACCCGGGCG CCAATGCCGA GACCGTGGCG
GCCACCATCG CGACGCCGAT CGAGCAGGAG ATCAACGGCG TCGACAACAT GCTCTACATG
TCGTCGCTGT CGACGAACGA CGGCAACATG CAGTTGACCA TCACGTTCGC GCTCGGCACC
AACCTCGACA TCGCCAACGT GCTGGTCCAG AACCGGCTCT CGGTGGCGAC GCCGCGGCTG
CCGCCGGACG TGCGCAACCT CGGCGTCACG GTCCGCAAGT CCTCGCCCGA CCTGATGATG
GTCGTTCACC TGCTGTCGCC CGACAACACC TACGACCAGA ACTACATCGC CAACTACATC
TACCTGCGCA TCCGCGACCA GCTGCTGCGC CTGAACGGCG TCGGCGACAT CACGGTGTTC
GGCGGCAGCG AGTACGCCCT GCGGTTGTGG CTCGACCCGG ACAAGCTCGC GGCCTACCAG
CTCTCGACCA CCGACGTGAT CGGCGCCCTG CAGGAGCAGA ACGTCCAGGT CGCCTCGGGC
GCCCTCGGCG CCCCGCCCTC GCCGACGAGC CAAGCCTTCC AGCTCGTGGT GCAGACGCAG
GGCCGCTTCC AGGACCCGAA CGAGTTCCGC AAGGTGATCG TGAAGGCCTC GGAGGGGCGG
CTCGTGCGCA TCTCCGACAT CGCCCGCGTC GAGATGGGCC AGAAGGACTA CGTCACCCAG
TCCTTCCTGA ACGGCCAGCC GGCGATCGGC GTCGGCGTCT TCCAGCGCCC CGGCACCAAC
GCGCTCGAAG CGGCCGAGAC CGTGCAGAGC CTGATGAAGG AGCTGGCCAA GGACTTCCCG
CCCGGGCTCG AGTACCGCAT CGCCTACAAT CCGACCGAGT TCATCGCCGA GTCGGTGCAC
GAGGTCTACA AGACGCTCGG CGAGGCGGTG GTCCTCGTCG TGGTGGTGAT CCTGGTCTTC
CTGCAGAGCT GGCGCACCGC GCTGGTGCCG ATCATCGCGA TCCCGGTCTC CCTCGTCGGG
ACCTTCGCGG TGATGGCGGC GCTCGGCTTC TCGCTCAACA ACCTGACCCT GTTCGGGCTG
GTGCTCGCCA TCGGCATCGT GGTCGACGAC GCGATCGTGG TGGTGGAGAA CGTCGAGCGC
AACATCTCGG ACGGACTCTC CCCCGGCGAG GCCGCCCACA AGACCATGGA CGAGGTCGGC
GGCGCCGTCG TCGCCATCGC CCTGGTGCTC TCGGCGGTGT TCATCCCCAC CGCCTTCATC
CCGGGCATCT CCGGCCAGTT CTACCGGCAG TTCGCGCTGA CGATCGCGGC CTCGACCATC
ATCTCGATGT TCAACTCGCT GACGCTGTCG CCCGCGCTGT GCAAGCTGCT GCTGCAGCCC
CACCACGCCC ACGGCCGGTC GCGCTTCGTC CTGGCGCGGC TCGGCAGCTT CGCGGCCAAC
ACCTTCAACC GGGCCTTCGA CGCGACCGCG AACGGCTACG CGGCGACGAT CCGCTTCCTC
ACCGGCCGCA CCGTGCCGCT GCTCGTGATG CTCGCCCTCT ACGGCGGGGT GATCGCCGGC
ACCCTCCACC TCGCCCGCAC CACCCCCACG GGCTTCATCC CACTCCAGGA CCAGGGCTAC
CTGATCGTGG TGGTGCAGCT GCCGCCGGGC TCGGCGCTGG AGCGCACGAC CGCGGTGGTG
AAGGAGGCGG CCACCCGCGC GCTTACCATC GACGGCGTCG CCAACGCGGT GGTGATCTCG
GGCTTCGACG GCTCGACCTT CACCAACACC ACCAACGGCG CGGTGATGTT CCTGACGCTC
AAGCCGTTCA AGGAGCGGCA GGCGCGCGGG CGCACCGCCG CCGCCATCGT GGGCGACGTC
TTCGGCAAGA CCGCCTCGAT CACCGAGGCG CGCATCATCG CCATCCCGCC GCCGCCCGTG
CGCGGCCTCG GCAACGCGGG CGGCTACAAG ATGCAGGTGC AGAACCGCTC CGGCTCCGAC
ATCGCGAGCC TGCTCGCCGC CTCGGGCGAC CTGATCGCGG CCGCCAACCA GGACCCCAAC
CTGACCCGCG TCTTCACCAC CTTCGGGAAC GACACGCCGC AGATCTACCT CGACATCGAC
CGCACCAAGG CGCGCATGCT GAACGTGCCG CTCGCCAACG TGTTCTCGAC CCTGCAGGTC
AATCTCGGCG GCGCCTACGT CAACGACTTC AACACGTTCG GGCGCATCTA CCAGGTCCGG
GCCCAGGCCG ACGCCAAGTT CCGTCTGGAG AAGGACGACA TCGCGCGGCT GAAGGTGCGC
TCCTCGACGG GCGCGCTGGT GCCGATGGGG TCGCTCGCGG GCATCCGCGA CATCGCCGGG
CCGCAGATCG TGCAGCGCTA CAACCTGTTC TACGCGATCC CGGTCCAGGG CGACACGCGG
CCCGGCGTGT CGACCGGCCA GGCGCTCGGC GCCATGGAGG CCCTGGCCAG GAAGACCCTG
CCGGAGGGGA TGAGCTTCGA GTGGACCGAG ATCGCCTTCC AGCAGAAGGC GGTGGGCAAC
ACGGCGCTCT ACGTCTTCGC GCTCGGCGTG CTCCTGGTCT TCCTGGTGCT GGCCGCCCAG
TACGAATCCT GGGCGCTGCC GCTCGCGATC CTGCTGGTGG TGCCCACGGG CGTGCTCGCG
GCCTTCGCGG GGGTGCAGCT CCGGGCGCAG GACAACAACA TCCTGACCCA GATCGGCCTG
ATCGTGCTGA TCGGCCTCGC GGCCAAGAAC GCGATCCTGA TCGTGGAATT CGCCCACCAG
ATCGAGGAGA GCGAGCATCG CGGCCCGGTC GCCGCGGCGG TCGAGGCCTG CCGCCTGCGC
CTGCGCCCGA TCCTGATGAC GGCCTTCGCC TTCATCCTCG GCGTGGTGCC GCTCGCCATC
GCGACCGGCC CGGGCGCCGA GATGCGCCAG GCGCTCGGCA CCGCGGTGCT GTTCGGCATG
CTCGGCGCCA CGGTGTTCGG CCTGTTCCTG ACGCCGGTCT TCTACGTGGT GATCCGCCAC
GTCCTGATCC GCATCAACCG CTGGCGCGGC AGGGACGAGC CGCCGCCGAA CCTCGCGGCC
GCGCAGTAG
 
Protein sequence
MRFAHFFVDR PIFASVTSIV ILIIGYVSYI SLPVSQYPEI VPPTVVVRAS YPGANAETVA 
ATIATPIEQE INGVDNMLYM SSLSTNDGNM QLTITFALGT NLDIANVLVQ NRLSVATPRL
PPDVRNLGVT VRKSSPDLMM VVHLLSPDNT YDQNYIANYI YLRIRDQLLR LNGVGDITVF
GGSEYALRLW LDPDKLAAYQ LSTTDVIGAL QEQNVQVASG ALGAPPSPTS QAFQLVVQTQ
GRFQDPNEFR KVIVKASEGR LVRISDIARV EMGQKDYVTQ SFLNGQPAIG VGVFQRPGTN
ALEAAETVQS LMKELAKDFP PGLEYRIAYN PTEFIAESVH EVYKTLGEAV VLVVVVILVF
LQSWRTALVP IIAIPVSLVG TFAVMAALGF SLNNLTLFGL VLAIGIVVDD AIVVVENVER
NISDGLSPGE AAHKTMDEVG GAVVAIALVL SAVFIPTAFI PGISGQFYRQ FALTIAASTI
ISMFNSLTLS PALCKLLLQP HHAHGRSRFV LARLGSFAAN TFNRAFDATA NGYAATIRFL
TGRTVPLLVM LALYGGVIAG TLHLARTTPT GFIPLQDQGY LIVVVQLPPG SALERTTAVV
KEAATRALTI DGVANAVVIS GFDGSTFTNT TNGAVMFLTL KPFKERQARG RTAAAIVGDV
FGKTASITEA RIIAIPPPPV RGLGNAGGYK MQVQNRSGSD IASLLAASGD LIAAANQDPN
LTRVFTTFGN DTPQIYLDID RTKARMLNVP LANVFSTLQV NLGGAYVNDF NTFGRIYQVR
AQADAKFRLE KDDIARLKVR SSTGALVPMG SLAGIRDIAG PQIVQRYNLF YAIPVQGDTR
PGVSTGQALG AMEALARKTL PEGMSFEWTE IAFQQKAVGN TALYVFALGV LLVFLVLAAQ
YESWALPLAI LLVVPTGVLA AFAGVQLRAQ DNNILTQIGL IVLIGLAAKN AILIVEFAHQ
IEESEHRGPV AAAVEACRLR LRPILMTAFA FILGVVPLAI ATGPGAEMRQ ALGTAVLFGM
LGATVFGLFL TPVFYVVIRH VLIRINRWRG RDEPPPNLAA AQ