Gene GM21_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1850 
Symbol 
ID8137181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2151025 
End bp2154183 
Gene Length3159 bp 
Protein Length1052 aa 
Translation table11 
GC content62% 
IMG OID644869461 
Producttransporter, hydrophobe/amphiphile efflux-1 (HAE1) family 
Protein accessionYP_003021661 
Protein GI253700472 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.0763559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGGT TTTTCATAAA CAGGCCCATA TTCGCCTGGG TCATAGCCAT AGTCGTCATG 
CTGGGCGGTC TGCTCGCCAT CAAGGCGCTG CCGGTCTCCC AGTACCCACC GATCGCGCCG
CCGCAGATCT CGATCAACGC CGTCTATCCG GGCGCCTCGG CGCAGACGGT CCAGGACACC
GTAACGCAGG TGATCGAGCA GAAGATGAAC GGCATCGACA ACCTGCTCTA CATGTCCTCC
ACCAGCGATT CCGCCGGTGC GGTCAGCATC AACCTGACCT TCAGAGCGGG GACCGACCCC
AACGTGGCCC AGGTACAGGT GCAGAACAAG CTGCAGCTTG CCACGCCGCT TTTGCCGCAG
GTAGTGCAGA GGCAGGGGGT GCAGGTGGTG AAGTCCACCC GCAACTTCCT CTTGATCGTC
GGTCTCGTCT CCGAGGACGA GTCGCTCAAC CGGCATCAGT TGACCGACTA CATGGTCTCC
AACATCCAGG ACATCGTGAG CCGGGTCCAG GGGGTAGGGG AGGTCACCGT CTTCGGCTCC
CAAAACGCCA TGCGCGTCTG GATGGATGCC GAGAAGCTGA ACAACTACAA GCTGACCCCA
AACGACGTGG TCAACGCTCT GCAGGCGCAG AACGCCCAGG TCTCCGCAGG TCAGTTCGGC
GGGCAGCCCG CCGTCCAGGG ACAACAGTTA AACGCCACCA TTACCGCCAG GACCCTTTTG
CAGAGCCCGG AGCAGTTCGA CCAGATCGTT TTGCGCACCA ACCCCGACGG CTCCACGGTG
AAGCTCAGGG ATGTCGCCAA GACCGACGTC GGCACAGAAA ACTACGACAT CCTGGCCCGC
TACAAGGGCA AACCGGTGGC GGCCATGGCG CTTAGGCTTG CCGCAGGCGC CAACGCGCTC
GACACCGCGG ACAGGGTCAA GGCCAAGATG GCCGAGCTGG AGAAGTTCGT TCCGGCAGGG
GCCAAGGTGG TGTACCCCTA CGACACCACG CCCTTCGTCA AGATCTCCAT CGAGGAGGTG
CTGAAGACCC TGATGGAGGC GGTATTCCTG GTCTTCATCA TCATGTTCCT GTTCCTGCAG
AACATCCGCG CCACGTTGAT CCCGACCATC GCGGTGCCGG TAGTTCTCCT GGGTACCCTG
GGGATCCTCT TCGCCGCGGG TTTTTCCATC AACACCCTGA CCATGTTCGC CCTGGTCATC
GTCATCGGCC TTTTGGTCGA CGACGCCATC GTCGTGGTCG AGAACGTGGA AAGGATCATG
ACCGACGAGG GGCTCTCCCC GCACGACGCG ACGGTCAAGT CGATGGGACA GATCACCTCG
GCTCTTTGGG GCATCGCGAC CGTGCTTTGC GCCGTCTTCA TCCCGATGGC GTTTTTCGGG
GGCTCCACCG GCGTCATTTA CCGCCAGTTC TCCGTCACCA TCGTCTCGGC GATGATCCTG
TCGGTTCTGA CCGCCCAGAT ACTCACCCCT GCGCTTTGCG CGACGCTCCT CAAGCCCGTG
CAGAAGGGGC ACCTCCCCGG CGAGGGGGGG TGGTTCAGCG GTTTCTTCCG CTGGTTCAAC
AAGGTATTCG ACGCGGCACG CCACAAGTAC GAATCCATTG TCGGCAACTC CTTTGGCAAA
CCGCTGCGCT ACCTCTTCAT CTACGGCTGC CTGGTCGGCA TCATGATATT CCTGTTCCTG
CGCCTCCCCA CGGCGTTCCT GCCGGACGAG GACCAGGGCT TCATCGTCTG CCAGATCCAG
CTTCCGGCCG GCGCCACCCA GGAGAGGACC CTCAAGGCGC TGGAGCAGGT GGAGCGCTAC
TTCCTGGAGA AGGAGAGTAA GACGGTCGAA TCCCTGATCA CCGTCGCCGG TTTCAGCTTC
GCCGGCCGCG GCCAGAACAT GGGCCTCGCC TTCGTGAAGC TCAAGGACTG GAAACTGAGA
CCCACCCCAG ACCTCAAGGC TCCGGCCCTG GCGGGACGCG CCATGGGGGC TTTCTCCCAG
ATCAAGGACG GCATGGCCTT CGCCTTCTCG CCTCCCGCCG TAGTCGAGCT CGGTCAAGCC
AACGGCTTCG ACTTCCAGCT GCAGGACCGC GGCGGTCTGG GGCACCAGGC CTTGATGGAT
GCCCGCAACC AGCTCCTCGG CATGGCCATG AAGAACCCGA AGCTGATGGC GGTCCGCCCC
AACGGCCAGG ACGACACACC CGAGTTCAAG CTGAACATCG ACGACGTTCG CGCCGGGGCG
CTCGGGGTCT CCCTCGCCGA CGTCAACAAC GTCCTCGCCA CCGCCTGGGG CTCCTCCTAC
GTCAACGACT TCCTGCAAGA CGGCAGGGTA AAGAAGGTCT ACGTGCAGGC GGACCCAAAG
TACCGCATGG TCCCTGAGGA CATCAACAGG TGGTACGTGA GAAACAACAA GGGCGAGATG
GTTCCCTTCT CCTCCTTCGC CACCGCGCGC TGGGAGTACT CCTCGCCGCG TCTGGAGCGC
TACAACGGCA TCCCGTCCAT GGAGATCATG GGGAGCGCCG CTCCCGGGGT AAGCACCGGC
GAGGCGATGG CCGAGATGGA GGCCATAGCC GAAAAGCTCC CGCAGGGGAT CAGTTACGAG
TGGACCGGCC TCTCCTATGA GGAGAAGGCG GCCGGCGCCC AGGCCCCGGC GCTTTACGCC
ATCTCGCTAC TGGTTGTCTT TTTGGCGGTC GCCGCACTCT ACGAGAGCTG GACCATACCG
TTCGTAAACC TCTTGATGCT TCCGCTCGGC CTGGTCGGCG CCATCACCGC GGTCACCCTC
AGGGTGCTCC CCAACGACAT CTACCTGCAG ATCGGCCTTT TGACCACGGT CGGCCTCTCC
ACCAAGAACG CCATCCTCAT CATCCAGTTC ATCAAGGACC AGATGAACCA GGGGCACGAA
CTGGTGGAGG CGACGCTCAC GGCCGTGAAG ATCAGGCTCC GGCCGGTGAT CATGACCTCG
CTGGCGTTCT TCTTCGGCAC GCTCCCCCTG GCGCTCACCA AGGGGGCAGG CGCCGGCGCC
CAGAACGCGA TCGGTACCGC GGTCACCGGC GGACTCTTGT CGGCGACATT CATCGACCTG
ATCTTCATCC CGTTCTTCTT CGTCATGGTG ACCAAGTACT TCATGAAGAA GAAGCCGGCA
ACGGAACCTG CAGCAACCCC CGTTTCGGAG GTACATTAA
 
Protein sequence
MSRFFINRPI FAWVIAIVVM LGGLLAIKAL PVSQYPPIAP PQISINAVYP GASAQTVQDT 
VTQVIEQKMN GIDNLLYMSS TSDSAGAVSI NLTFRAGTDP NVAQVQVQNK LQLATPLLPQ
VVQRQGVQVV KSTRNFLLIV GLVSEDESLN RHQLTDYMVS NIQDIVSRVQ GVGEVTVFGS
QNAMRVWMDA EKLNNYKLTP NDVVNALQAQ NAQVSAGQFG GQPAVQGQQL NATITARTLL
QSPEQFDQIV LRTNPDGSTV KLRDVAKTDV GTENYDILAR YKGKPVAAMA LRLAAGANAL
DTADRVKAKM AELEKFVPAG AKVVYPYDTT PFVKISIEEV LKTLMEAVFL VFIIMFLFLQ
NIRATLIPTI AVPVVLLGTL GILFAAGFSI NTLTMFALVI VIGLLVDDAI VVVENVERIM
TDEGLSPHDA TVKSMGQITS ALWGIATVLC AVFIPMAFFG GSTGVIYRQF SVTIVSAMIL
SVLTAQILTP ALCATLLKPV QKGHLPGEGG WFSGFFRWFN KVFDAARHKY ESIVGNSFGK
PLRYLFIYGC LVGIMIFLFL RLPTAFLPDE DQGFIVCQIQ LPAGATQERT LKALEQVERY
FLEKESKTVE SLITVAGFSF AGRGQNMGLA FVKLKDWKLR PTPDLKAPAL AGRAMGAFSQ
IKDGMAFAFS PPAVVELGQA NGFDFQLQDR GGLGHQALMD ARNQLLGMAM KNPKLMAVRP
NGQDDTPEFK LNIDDVRAGA LGVSLADVNN VLATAWGSSY VNDFLQDGRV KKVYVQADPK
YRMVPEDINR WYVRNNKGEM VPFSSFATAR WEYSSPRLER YNGIPSMEIM GSAAPGVSTG
EAMAEMEAIA EKLPQGISYE WTGLSYEEKA AGAQAPALYA ISLLVVFLAV AALYESWTIP
FVNLLMLPLG LVGAITAVTL RVLPNDIYLQ IGLLTTVGLS TKNAILIIQF IKDQMNQGHE
LVEATLTAVK IRLRPVIMTS LAFFFGTLPL ALTKGAGAGA QNAIGTAVTG GLLSATFIDL
IFIPFFFVMV TKYFMKKKPA TEPAATPVSE VH