Gene Mlg_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1408 
Symbol 
ID4270630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1612009 
End bp1615185 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content68% 
IMG OID638126164 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_742247 
Protein GI114320564 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.902436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.728656 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGGT TCTTCATCGA CCGGCCGATC TTCGCCTCAG TCATCTCGAT CATCATCCTG 
GTGGCCGGGC TCGCGTCGCT GCGCGCCCTG CCCATCGAGC AGTACCCGGA CGTGGTGCCA
CCGGAGATCG TGGTCCAGGC CGCTTACCCC GGCGCCAGCT CGGAGGTGTT GGCGGAGGCG
GTGGCCGCAC CCCTGGAGCA GGAGATCAAC GGCGTGGACG ACATGATCTA CATGGAGTCC
ACCAGTACCG ACGCCGGCAC GGTCCAGATC TCGGTCTCCT TCGAGATGGG CACCGATCCG
GATCAGGCCG AGATCAACGT CAACAACCGG GTGCAGGCCG CCCTGCCCCG ACTGCCCCAG
GCGGTACGGG ACCAGGGGGT GCGGGTGGAG GCCCGCTCCA CCAACATCCT GTTGGTGGCG
ACGCTCAGCT CGCCGGACGG GCGCCACGGC ACGCTCGAGC TCAGTAACTA CGCCCTGCTG
AACATCATCG ACGAACTGGA GCGCCTGCCC GGTGTGGGCG AGGCCTCTCT ATTCGGCCAG
CAGGACTACG CCATGCGGAT CTGGCTACGG CCGGACAAGC TGGCCCAGTA CGACCTGACG
CCGGCGGAGG TGGCGGCGGC CATCCGCGAG CAGAACGCCC AGTTCGCCGC CGGCCAGATG
GGCGCTGAGC CGGCCCCCGA CGGCCAGGCC TTCACCCTCA CCGTGACCAC CCGGGGCCAA
CTGGAGGGGG CGGAGGAGTT CGAGGCCATC ATCCTGCGCT CGGACGAGTC CGGGGCCACC
CTGCGCCTCG GCGACGTGGC CAGGGCCGAA CTGGGTGCCC AGAGCTACGC CTTCTCCGCC
ACCTACAACG GCGAGCCGAC CGTCCCCATC GGGGTCTATC AGGCGCCCGG GGCCAACGCC
CTGGAGACCG CCGAGCAGGT GCGGGCCGCG CTGGCGGACA GCGCCGAGCG CTTTCCCGAG
GGCGTGGAGT ACACCATCCC CTATGACACC ACCGAGTTCG TGGAGGTCTC CATCCGGGAG
GTCTACACCA CACTGCTCAT CGCGGTAGGG CTCGTGGTGC TGGTGACCTT CGTCTTCCTG
CAGCACCTGC GGGCGACCCT GATCCCCATC ACTGCGATAC CGGTCTCCCT GATCGGGACC
TTTGCGGGCA TGCAGGCCAT GGGCTTTTCG GTCAACCTGC TGACCCTCTT CGGGCTGGTG
CTGGCCATCG GCATCGTCGT GGACAACGCC ATCATCGTGA TGGAGAACGT CGAGCGGCTG
ATGCGCGAGA AGGGGCTGAA GGCCAGGGAG GCCTCGGTGG AGACCATGAA GCAGGTTTCC
GGGGCGGTGG TCTCCTCCAC CCTGGTGCTG GTGGCGGTCT TCGCGCCCGT GGCCTTCCTG
GGCGGTCTGA CCGGTGAGTT GTACCGCCAG TTCGCGGTCA CTATCGCCTT CTCGGTGGTG
ATCTCCGGCG TGGTGGCGCT GACCCTGACC CCGGCGATGT GTGCCCTGCT CCTGGATAAG
CAGCCGAAGA CACCCTGGCT GCCCTTCCGC CTCTTCAACG CCGGCTTTGA ACACCTGACC
CGGGCCTTCG TGGCCGCGGT CGGCTTCCTG GTCAGAAACC GCACTGTGGG TGTCGGGCTG
TTCGCACTCG CCGTGGGCGG CGCGGTGTTC CTGGTGGAGC GCATGCCGGA CGGCCTGGTG
CCCCAGGAGG ACCAGGGGTT TGCCCTGGTG GTCGCACAGT TGCCGCCCAC CTCAGCGCTG
AACCGCACCG AAGCGGTCCG CGACGCCCTG GCGGCGCAGC TCACGCAACT GGAAGAGATC
CAGGAGTTCA CCGCCTTCGC CGGCTTCGAC ATCATCGCCG GCTCGCTGCG GACCAACGCC
GCGGTCGGTT TCGTGAACTT CACCGACTGG GCCGATCGGC CCCGGCCCGA CCAACACGCC
GCGGCCATGA GTGAGCGAAT CTCCGGGATG GGGTTCGGCC TGCAGGAGGC CAATGTGTTC
GCCTTCATTC CGCCGCCCAT CCAGGGGCTG TCACTGACCG GCGGCGTGGA GGGCTTTCTG
CAGGTGCGCG AGGACATGAG CGCCCGCCAG GTGGAGGCCT TGGCCAACCG GGTGGTCCAG
CGGGCCAATG AGCGCCCGGA GCTGGTGAAC GCCCGCTCGA CGCTGGACAC CGGCATCCCC
CGCTACCGTG CGGAGTTGGA CCGCGAGAAG GCCAAGGCGG CCGGGGTGCG GATTGACGAG
GTGTTTAATA CCATGCGGGC CACCTTCGGC GCCCTCTATG TCAACGATTT CACCTTCGCC
GGACGGCTAT GGCAGGTGAA CCTCCAGTCC GAGCAGGACT TCCGGAGCCA TCCGGAGGAC
CTGCGCCATG TCTTCGTGCG CTCCGAGAGC GGTGATCTCG TGCCGCTGAG CGCGCTGGTC
CGTCTGACCC GCGAGTCGGG CGCGGACATC ATCAACCGCT TCAACATCTA TCAGGCCGCC
AAGCTGATGG CCGACCCCGC CCCCGGCTAC ACCAGCGGTG ACGCGAAGGC GGCGCTCGAG
GCGGTGGTGG CCGAGGTCCG CGACGAGGAG GGGGCCGACG CTCTGCTCGG CTGGATCGGC
GAGGCCTACC AGTTGGAGGT GGCGGCCGGC GCCGGCGCCG CGGCCTTCGC CATGGGCCTG
CTAATGGTGT TCCTGATCCT GGCCGCGCAG TACGAACGCT GGACGCTGCC TCTGGCCGTG
GCCACCGCGG TCCCCTTCGC CGTCCTGGGC GCCGCCCTCT TCGCCCTGCT CCGCGGTTTC
CCCAACGACA TCTACTTCCA GGTGGGCCTG CTGGTGCTGA TCGGCCTGGC GGCCAAGAAC
GCCATTCTCA TCGTGGAGTT CGCGGCCCAG AACCGGGCCA CCGGCATGAC CTCCACCGAG
GCGGCCATGG CTGCGGCACG CCAGCGTTTC CGCGCCATTA TGATGACCGC CCTGACCTTC
ATCATCGGCA CCCTGCCTCT GGTGTTCGCC ACCGGGGCCG GCGCCGCCAG CCGCCAGGAG
ATCGGGACCG TGGTGGTGGG CGGCATGCTG GCCGCCAGCA CCCTGGCGCT GCTCTTCGTG
CCGCTGTTCT ACAAACTGCT TGAGGATGTC GCCACCTGGC GCAACGAGCG GCGGGCCCGG
CGTGAGCAGG AGAAGGCGGC GCAGGCCGAT GACAAGGAGG CCGGGAACCA TGCGTAA
 
Protein sequence
MLRFFIDRPI FASVISIIIL VAGLASLRAL PIEQYPDVVP PEIVVQAAYP GASSEVLAEA 
VAAPLEQEIN GVDDMIYMES TSTDAGTVQI SVSFEMGTDP DQAEINVNNR VQAALPRLPQ
AVRDQGVRVE ARSTNILLVA TLSSPDGRHG TLELSNYALL NIIDELERLP GVGEASLFGQ
QDYAMRIWLR PDKLAQYDLT PAEVAAAIRE QNAQFAAGQM GAEPAPDGQA FTLTVTTRGQ
LEGAEEFEAI ILRSDESGAT LRLGDVARAE LGAQSYAFSA TYNGEPTVPI GVYQAPGANA
LETAEQVRAA LADSAERFPE GVEYTIPYDT TEFVEVSIRE VYTTLLIAVG LVVLVTFVFL
QHLRATLIPI TAIPVSLIGT FAGMQAMGFS VNLLTLFGLV LAIGIVVDNA IIVMENVERL
MREKGLKARE ASVETMKQVS GAVVSSTLVL VAVFAPVAFL GGLTGELYRQ FAVTIAFSVV
ISGVVALTLT PAMCALLLDK QPKTPWLPFR LFNAGFEHLT RAFVAAVGFL VRNRTVGVGL
FALAVGGAVF LVERMPDGLV PQEDQGFALV VAQLPPTSAL NRTEAVRDAL AAQLTQLEEI
QEFTAFAGFD IIAGSLRTNA AVGFVNFTDW ADRPRPDQHA AAMSERISGM GFGLQEANVF
AFIPPPIQGL SLTGGVEGFL QVREDMSARQ VEALANRVVQ RANERPELVN ARSTLDTGIP
RYRAELDREK AKAAGVRIDE VFNTMRATFG ALYVNDFTFA GRLWQVNLQS EQDFRSHPED
LRHVFVRSES GDLVPLSALV RLTRESGADI INRFNIYQAA KLMADPAPGY TSGDAKAALE
AVVAEVRDEE GADALLGWIG EAYQLEVAAG AGAAAFAMGL LMVFLILAAQ YERWTLPLAV
ATAVPFAVLG AALFALLRGF PNDIYFQVGL LVLIGLAAKN AILIVEFAAQ NRATGMTSTE
AAMAAARQRF RAIMMTALTF IIGTLPLVFA TGAGAASRQE IGTVVVGGML AASTLALLFV
PLFYKLLEDV ATWRNERRAR REQEKAAQAD DKEAGNHA