Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_5072 |
Symbol | |
ID | 6135297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 5560228 |
End bp | 5563416 |
Gene Length | 3189 bp |
Protein Length | 1062 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641645207 |
Product | hydrophobe/amphiphile efflux-1 (HAE1) family protein |
Protein accession | YP_001771832 |
Protein GI | 170743177 |
COG category | [V] Defense mechanisms |
COG ID | [COG0841] Cation/multidrug efflux pump |
TIGRFAM ID | [TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.193812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0596255 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTTG CCCACTTCTT CGTCGACCGG CCGATCTTCG CCTCGGTGAC GTCGATCGTC ATCCTGATCA TCGGCTACGT CTCGTACATC TCGCTGCCGG TCTCGCAATA TCCCGAGATC GTGCCGCCCA CCGTGGTGGT GCGCGCCTCC TACCCGGGCG CCAATGCCGA GACCGTGGCG GCCACCATCG CGACGCCGAT CGAGCAGGAG ATCAACGGCG TCGACAACAT GCTCTACATG TCGTCGCTGT CGACGAACGA CGGCAACATG CAGTTGACCA TCACGTTCGC GCTCGGCACC AACCTCGACA TCGCCAACGT GCTGGTCCAG AACCGGCTCT CGGTGGCGAC GCCGCGGCTG CCGCCGGACG TGCGCAACCT CGGCGTCACG GTCCGCAAGT CCTCGCCCGA CCTGATGATG GTCGTTCACC TGCTGTCGCC CGACAACACC TACGACCAGA ACTACATCGC CAACTACATC TACCTGCGCA TCCGCGACCA GCTGCTGCGC CTGAACGGCG TCGGCGACAT CACGGTGTTC GGCGGCAGCG AGTACGCCCT GCGGTTGTGG CTCGACCCGG ACAAGCTCGC GGCCTACCAG CTCTCGACCA CCGACGTGAT CGGCGCCCTG CAGGAGCAGA ACGTCCAGGT CGCCTCGGGC GCCCTCGGCG CCCCGCCCTC GCCGACGAGC CAAGCCTTCC AGCTCGTGGT GCAGACGCAG GGCCGCTTCC AGGACCCGAA CGAGTTCCGC AAGGTGATCG TGAAGGCCTC GGAGGGGCGG CTCGTGCGCA TCTCCGACAT CGCCCGCGTC GAGATGGGCC AGAAGGACTA CGTCACCCAG TCCTTCCTGA ACGGCCAGCC GGCGATCGGC GTCGGCGTCT TCCAGCGCCC CGGCACCAAC GCGCTCGAAG CGGCCGAGAC CGTGCAGAGC CTGATGAAGG AGCTGGCCAA GGACTTCCCG CCCGGGCTCG AGTACCGCAT CGCCTACAAT CCGACCGAGT TCATCGCCGA GTCGGTGCAC GAGGTCTACA AGACGCTCGG CGAGGCGGTG GTCCTCGTCG TGGTGGTGAT CCTGGTCTTC CTGCAGAGCT GGCGCACCGC GCTGGTGCCG ATCATCGCGA TCCCGGTCTC CCTCGTCGGG ACCTTCGCGG TGATGGCGGC GCTCGGCTTC TCGCTCAACA ACCTGACCCT GTTCGGGCTG GTGCTCGCCA TCGGCATCGT GGTCGACGAC GCGATCGTGG TGGTGGAGAA CGTCGAGCGC AACATCTCGG ACGGACTCTC CCCCGGCGAG GCCGCCCACA AGACCATGGA CGAGGTCGGC GGCGCCGTCG TCGCCATCGC CCTGGTGCTC TCGGCGGTGT TCATCCCCAC CGCCTTCATC CCGGGCATCT CCGGCCAGTT CTACCGGCAG TTCGCGCTGA CGATCGCGGC CTCGACCATC ATCTCGATGT TCAACTCGCT GACGCTGTCG CCCGCGCTGT GCAAGCTGCT GCTGCAGCCC CACCACGCCC ACGGCCGGTC GCGCTTCGTC CTGGCGCGGC TCGGCAGCTT CGCGGCCAAC ACCTTCAACC GGGCCTTCGA CGCGACCGCG AACGGCTACG CGGCGACGAT CCGCTTCCTC ACCGGCCGCA CCGTGCCGCT GCTCGTGATG CTCGCCCTCT ACGGCGGGGT GATCGCCGGC ACCCTCCACC TCGCCCGCAC CACCCCCACG GGCTTCATCC CACTCCAGGA CCAGGGCTAC CTGATCGTGG TGGTGCAGCT GCCGCCGGGC TCGGCGCTGG AGCGCACGAC CGCGGTGGTG AAGGAGGCGG CCACCCGCGC GCTTACCATC GACGGCGTCG CCAACGCGGT GGTGATCTCG GGCTTCGACG GCTCGACCTT CACCAACACC ACCAACGGCG CGGTGATGTT CCTGACGCTC AAGCCGTTCA AGGAGCGGCA GGCGCGCGGG CGCACCGCCG CCGCCATCGT GGGCGACGTC TTCGGCAAGA CCGCCTCGAT CACCGAGGCG CGCATCATCG CCATCCCGCC GCCGCCCGTG CGCGGCCTCG GCAACGCGGG CGGCTACAAG ATGCAGGTGC AGAACCGCTC CGGCTCCGAC ATCGCGAGCC TGCTCGCCGC CTCGGGCGAC CTGATCGCGG CCGCCAACCA GGACCCCAAC CTGACCCGCG TCTTCACCAC CTTCGGGAAC GACACGCCGC AGATCTACCT CGACATCGAC CGCACCAAGG CGCGCATGCT GAACGTGCCG CTCGCCAACG TGTTCTCGAC CCTGCAGGTC AATCTCGGCG GCGCCTACGT CAACGACTTC AACACGTTCG GGCGCATCTA CCAGGTCCGG GCCCAGGCCG ACGCCAAGTT CCGTCTGGAG AAGGACGACA TCGCGCGGCT GAAGGTGCGC TCCTCGACGG GCGCGCTGGT GCCGATGGGG TCGCTCGCGG GCATCCGCGA CATCGCCGGG CCGCAGATCG TGCAGCGCTA CAACCTGTTC TACGCGATCC CGGTCCAGGG CGACACGCGG CCCGGCGTGT CGACCGGCCA GGCGCTCGGC GCCATGGAGG CCCTGGCCAG GAAGACCCTG CCGGAGGGGA TGAGCTTCGA GTGGACCGAG ATCGCCTTCC AGCAGAAGGC GGTGGGCAAC ACGGCGCTCT ACGTCTTCGC GCTCGGCGTG CTCCTGGTCT TCCTGGTGCT GGCCGCCCAG TACGAATCCT GGGCGCTGCC GCTCGCGATC CTGCTGGTGG TGCCCACGGG CGTGCTCGCG GCCTTCGCGG GGGTGCAGCT CCGGGCGCAG GACAACAACA TCCTGACCCA GATCGGCCTG ATCGTGCTGA TCGGCCTCGC GGCCAAGAAC GCGATCCTGA TCGTGGAATT CGCCCACCAG ATCGAGGAGA GCGAGCATCG CGGCCCGGTC GCCGCGGCGG TCGAGGCCTG CCGCCTGCGC CTGCGCCCGA TCCTGATGAC GGCCTTCGCC TTCATCCTCG GCGTGGTGCC GCTCGCCATC GCGACCGGCC CGGGCGCCGA GATGCGCCAG GCGCTCGGCA CCGCGGTGCT GTTCGGCATG CTCGGCGCCA CGGTGTTCGG CCTGTTCCTG ACGCCGGTCT TCTACGTGGT GATCCGCCAC GTCCTGATCC GCATCAACCG CTGGCGCGGC AGGGACGAGC CGCCGCCGAA CCTCGCGGCC GCGCAGTAG
|
Protein sequence | MRFAHFFVDR PIFASVTSIV ILIIGYVSYI SLPVSQYPEI VPPTVVVRAS YPGANAETVA ATIATPIEQE INGVDNMLYM SSLSTNDGNM QLTITFALGT NLDIANVLVQ NRLSVATPRL PPDVRNLGVT VRKSSPDLMM VVHLLSPDNT YDQNYIANYI YLRIRDQLLR LNGVGDITVF GGSEYALRLW LDPDKLAAYQ LSTTDVIGAL QEQNVQVASG ALGAPPSPTS QAFQLVVQTQ GRFQDPNEFR KVIVKASEGR LVRISDIARV EMGQKDYVTQ SFLNGQPAIG VGVFQRPGTN ALEAAETVQS LMKELAKDFP PGLEYRIAYN PTEFIAESVH EVYKTLGEAV VLVVVVILVF LQSWRTALVP IIAIPVSLVG TFAVMAALGF SLNNLTLFGL VLAIGIVVDD AIVVVENVER NISDGLSPGE AAHKTMDEVG GAVVAIALVL SAVFIPTAFI PGISGQFYRQ FALTIAASTI ISMFNSLTLS PALCKLLLQP HHAHGRSRFV LARLGSFAAN TFNRAFDATA NGYAATIRFL TGRTVPLLVM LALYGGVIAG TLHLARTTPT GFIPLQDQGY LIVVVQLPPG SALERTTAVV KEAATRALTI DGVANAVVIS GFDGSTFTNT TNGAVMFLTL KPFKERQARG RTAAAIVGDV FGKTASITEA RIIAIPPPPV RGLGNAGGYK MQVQNRSGSD IASLLAASGD LIAAANQDPN LTRVFTTFGN DTPQIYLDID RTKARMLNVP LANVFSTLQV NLGGAYVNDF NTFGRIYQVR AQADAKFRLE KDDIARLKVR SSTGALVPMG SLAGIRDIAG PQIVQRYNLF YAIPVQGDTR PGVSTGQALG AMEALARKTL PEGMSFEWTE IAFQQKAVGN TALYVFALGV LLVFLVLAAQ YESWALPLAI LLVVPTGVLA AFAGVQLRAQ DNNILTQIGL IVLIGLAAKN AILIVEFAHQ IEESEHRGPV AAAVEACRLR LRPILMTAFA FILGVVPLAI ATGPGAEMRQ ALGTAVLFGM LGATVFGLFL TPVFYVVIRH VLIRINRWRG RDEPPPNLAA AQ
|
| |