Gene Ava_3838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3838 
Symbol 
ID3678731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4774887 
End bp4778060 
Gene Length3174 bp 
Protein Length1057 aa 
Translation table11 
GC content46% 
IMG OID637719190 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_324338 
Protein GI75910042 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00914] heavy metal efflux pump (cobalt-zinc-cadmium)
[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGTTG ACTTCTTTAT CAGGCGACCT GTATTTACAA GTGTCTGCGC CATTATCATT 
TTGCTGGTAG GCGCAATCAG TATACCCACC CTACCTACTG ATCGCTATCC AGAAATTAGC
CCTACACAAA TTGTTGTTAA TGCTAACTAC ACTGGGGCTA GCGCGGAAGT CGTGGAAAAT
ACGGTGACAA GTATCTTGGA GCGCCAAATC AACGGCGTTC AAGGTATGAA GTACATGACA
TCTAGCAGTA GTAACGATGG TACTAGCACC ATTACCGTCA CCTTTGACCC ATCACGGGAC
AAAGATATTG CCGCCGTTGA TGTCCAAAAC CGCGTATCTA TTGCCCAGCC CCAGTTACCA
GAGGCTGTAC AACGTACTGG AGTCACAGTC AGCAAACAGT CTAACAATAT CCTGTTAGCG
ATGGGGCTGT ATAGCGAACA ACAGGAATAT GACACCGTAT TTTTAAGTAA CTACGCCGAC
CAATATATAG TAGATGCCCT GAAAAGGCTT GATGGTGTGG GCGAGGCGCG GATTTTTGGT
GAACGGCGCT ATGCTATGCG ATTGTGGCTT GACCCGAATC GTCTAACCAG TCGCAAGCTG
ACTGCTCAAG ATGTAATTGA TGCCATCAAC GAACAAAACA TACAGGTAGG GGTGGGGCAA
ATTGGTCAGC AGCCGTCCCT GCCAGACCAG ATGTATCAAA TAGACTTACA GGCTCGCGGC
AGACTTTCCG AAGCAACGGA ATTTGCCGAC ATGGTAATTA AGACTGGAAC CGATGGCACA
CTTATCAAGC TCAAAGATGT GGGTCGGGCA GAATTAGGGG CAGAAAATTA TAGTTCGTTT
CTGCGATTCC GAGGTAATGA GGGTGTGGGT ATAGGGATAT TTCCTACCCC TGGTAGTAAT
GCTTTGGAAG TATCCAAGGC AGTAAAAGCC GAAATGGCTC GGTTAGCACA AAGCTTTCCC
CCAGGGTTGA AATATCAAGT GGCGTTTGAT ACGACCCTGT TTGTGGAAGA ATCCCTCGCG
GAAGTGGTCA AGACACTGAT AGAAGCCCTG ATTCTGGTTG TCTTGGTGAT TTTTCTCTTC
TTGCAAGACT GGCGTACAAC CCTGATTCCG GTAATTACCA TTCCCTTAAC TTTAATTGGG
ACTTTTGCCT TTATCAAAGT CTTTGGATTT TCCATTAACA CCTTAACTTT GTTTGGTTTG
ACATTAGCTA CGGGGTTAGT GGTAGACGAT GCGATTATCG TGGTGGAAAA CATCTCCCGG
TTAATTGAGG AAGAGGGAAT GTCGCCCCGT CAGGCTGCAT CCGAGTCCAT GCGGGAACTG
TTTGGGGCTG TAATTGCCAC TTCCTTAGTG TTGATGGCGG TATTTGTTCC TGTTGCCTTT
TTCCCAGGTA CTACAGGACA GATTTATAAA CAGTTTGCCC TGACCATCGC CTTTTCGATG
GCTATTTCCA CCTTTTTGGC TCTGACACTC ACACCCTCCC TTTCAGCCTT ATTATTACGT
CGGGGACAGC GCCCCCGTGG TTGGTTAGGC AAAATTTTTC ACCAGATTAA TAGGTTTTTG
GATTGGATGC GCCGGGGATA TGAGCGATCG CTCAATATTC TTGTGAAGGT TAGGGCGATC
GTTGTCTTAT TATTCCTAGC TTCCATCGCC TTGACAGGCT GGGTTTATCT CAGCGTACCC
ACAGCATTTA TCCCTGACGA AGATCAAGGC TATTTCATCA CCATCATCCA AGGGCCAGAA
GGAGTTTCCC TCAACTACAC CAGCAATGTC ATGGCTCAGG TGGAAAAAGA AATCCTCAAA
TTACCGGAAG TTACAGGAAC CTTTGCCATT GGTGGCTTTA GTTTTAGTGG TAACAGTGCC
AACAGTGGGG TAATTTTTAC TACCCTCCAG CCTTGGGATG AGCGCCAACA ACCAGGACAA
TCAGCACCAG AAATTATTGG CAAGTTGGCA GGAGTCTTCG CCAACATCAC CGAAGCCAGA
ATTTTCCCCG TTAGTCCTCC ACCAATCAAT GGTTTAGGCA ACTTCGGCGG TTTTCAATTT
GAACTGCAAG ATAGAGCAGG TAACAGTGGC TTAGATAATC TGCTGCAAGT CATGGGTCAG
ATCATGATGA AGGGGAATCA GACTCCTGGT TTGCAAGGTG TATTTAGTAC TTTTAGTGCT
AACACACCCC AGATGTTAAT AGACATCGAC CGCAACAAAG TCAAAGCCTT ACAAGTTAAT
GTAGATGACG TGTTCAACAC CCTGCAATCT TACTTGGGTT CCAGGTACGT CAACGATTTT
AATTTCCAAC AAAGAACTTA CCGGGTTTAC GTGCAAGCAG ATGCGCCATT TCGCTCTAAC
CCTGAGGATA TCGGTAAGTT ATATGTCCGC TCTGCCAACG ATCAAATGAT TTCCCTGAGT
AACCTAGTGA AAATTACTCC GGCGACTGGA GCGCAAACAA TCAACCACTA CAACCTATTC
CGCTCAATTA CCATCAACGG TGCGGCTGCG CCTGGTTATA GTTCTGGTCA AGCAACTACA
GCGATGGAGA AATTAGCAAA AGAAGTTTTA CCCGCCAGTA TGGGTTACGA ATGGTCGGGG
ATAACTGCGG AAGAACAAGA ATCAGGAGGT CAAGCACCTG TAATTTTTGG ATTAGGGTTA
GTCTTTGTTT TTCTCGTGTT GGCAGCTCAA TATGAGAACT ACGTTGACCC ATTAATTATT
ATGCTGTCAG TTCCCCTAGC TATTTTAGGA GCCTTATCAG CTCAGTGGCT GCGGGGTCTG
AGTAATGATG TCTTTTGCCA AGTAGGTTTA GTCATGTTGA TTGGCTTGGC TAGTAAAAAC
GCCATTTTGA TTGTGGAATT TGCCAATCAA TTGCGCGAAC GGGGCTTGCC CATTATCAAA
GCCACAGTGC AAGCATCACA AGAAAGGTTG CGACCTATTC TCATGACTTC CCTATCTTTT
ATTCTTGGTG TCTGGCCATT GATTAATCCT ATAGGAGCAG GTGCAGCGAG TAGAAAATCT
CTCGGTACAG CCGTGGCAGG GGGAATGATT GTCTCAACTG TGTTGAGTCT ATTTGTTGTC
CCCATTCTCT ACATCGTAAT TAGTAAAATC CGCGATCGCT TTAGACCAAA ACCCCCACAC
ATACCACTAG ATTCTACCCG TGATGACAAA ATTCCCTCGA CTAGTCATAG ATGA
 
Protein sequence
MFVDFFIRRP VFTSVCAIII LLVGAISIPT LPTDRYPEIS PTQIVVNANY TGASAEVVEN 
TVTSILERQI NGVQGMKYMT SSSSNDGTST ITVTFDPSRD KDIAAVDVQN RVSIAQPQLP
EAVQRTGVTV SKQSNNILLA MGLYSEQQEY DTVFLSNYAD QYIVDALKRL DGVGEARIFG
ERRYAMRLWL DPNRLTSRKL TAQDVIDAIN EQNIQVGVGQ IGQQPSLPDQ MYQIDLQARG
RLSEATEFAD MVIKTGTDGT LIKLKDVGRA ELGAENYSSF LRFRGNEGVG IGIFPTPGSN
ALEVSKAVKA EMARLAQSFP PGLKYQVAFD TTLFVEESLA EVVKTLIEAL ILVVLVIFLF
LQDWRTTLIP VITIPLTLIG TFAFIKVFGF SINTLTLFGL TLATGLVVDD AIIVVENISR
LIEEEGMSPR QAASESMREL FGAVIATSLV LMAVFVPVAF FPGTTGQIYK QFALTIAFSM
AISTFLALTL TPSLSALLLR RGQRPRGWLG KIFHQINRFL DWMRRGYERS LNILVKVRAI
VVLLFLASIA LTGWVYLSVP TAFIPDEDQG YFITIIQGPE GVSLNYTSNV MAQVEKEILK
LPEVTGTFAI GGFSFSGNSA NSGVIFTTLQ PWDERQQPGQ SAPEIIGKLA GVFANITEAR
IFPVSPPPIN GLGNFGGFQF ELQDRAGNSG LDNLLQVMGQ IMMKGNQTPG LQGVFSTFSA
NTPQMLIDID RNKVKALQVN VDDVFNTLQS YLGSRYVNDF NFQQRTYRVY VQADAPFRSN
PEDIGKLYVR SANDQMISLS NLVKITPATG AQTINHYNLF RSITINGAAA PGYSSGQATT
AMEKLAKEVL PASMGYEWSG ITAEEQESGG QAPVIFGLGL VFVFLVLAAQ YENYVDPLII
MLSVPLAILG ALSAQWLRGL SNDVFCQVGL VMLIGLASKN AILIVEFANQ LRERGLPIIK
ATVQASQERL RPILMTSLSF ILGVWPLINP IGAGAASRKS LGTAVAGGMI VSTVLSLFVV
PILYIVISKI RDRFRPKPPH IPLDSTRDDK IPSTSHR