Gene PCC8801_3855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3855 
Symbol 
ID7105606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4036970 
End bp4040095 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content43% 
IMG OID643476859 
Producttransporter, hydrophobe/amphiphile efflux-1 (HAE1) family 
Protein accessionYP_002373960 
Protein GI218248589 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00914] heavy metal efflux pump (cobalt-zinc-cadmium)
[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTTT TTATCAAACG CCCCGTTTTT GCCACTGTTT GCGCTTTAGT TATCCTACTG 
GTGGGAATAA TTTGTTTATT AACCTTACCA GTCGCTCGCT TTCCTGATAT TAGTCCGACA
ACGATTGAAG TACAAGCCAC CTATAGCGGC GCAAGTGCAG AAGTCGTCGA AAATGCCGTA
ACCAATATTC TAGAACGACA AATTAATGGG GTAGAAGGAC TCAGATACCT GACATCAAGT
AGCAGTAACA ACGGAACCAG TACTATTAGT GCTGTCTTTG ATGCTTCGCG CAATAAAGAC
ATCGCCGCCG TTGATATTCA AAATCAAGTC TCTATCGTTG AATCGCAACT GCCCCAAGAG
GTACAACAGT CAGGTATCTC TGTGACCAAA CAATCGAATA ATTTTTTGTT GGGAATCGCC
TTATATAGTG ACGATAATCG ATACGATAAC GTTTTCTTGA GTAATTACGC CGATCGCTAC
CTCGTTGATA ATTTAAAACG TATTAAAGGG GTCAGTAATA TTCAGATTTT TGGGGAACGT
CGCTATGCTA TGCGTCTTTG GCTTGATGCC AACCGTTTAG CGAGTCGGGG ACTCACCACC
CAAGATGTGG TAGATGCCCT GTCTAAACAA AATCGACAAG TAGGGGCAGG AAAAATTGGC
GGTGAACCTG CCATTGAAGG GCAACAATAT CAACTCGATC TTAGGGCAGT CAGTCAATTA
ACAACCCCTG AAGAATTTAA CAATCTGTTG CTCAAAACCG ATGAAAACGG CGCAATAATT
CGCTTAAAAG ACGTAGGACG CGCAGAATTA GGAGCCCAAG ATTATAACAC CTTCCTGCGG
TACCGTGGCC AAGACGCGAT CGGGATGGGG GTTTTACAGC TAACGGGCTC AAATGCCCTC
GACGTTGCCC ACAAAGTCAA AGAGGAAATA AAAAAAATCG CCCTTTCCTT TCCCCCTGGA
CTGAAATACA CTATCGCTTT TGATACCACG CAATTTGTTG AAGAATCTCT AGCGGAAGTT
GTTAACACGA TTATCATGTC TGTGGTGTTA GTTGTGTTAG TTATCCTCCT ATTTCTGCAA
AACTGGCGCA CGACTCTGAT TCCTTCTTTA ACGATTCCCT TTGCCTTATT CGGAACCTTT
GCTTTTGTTA AATTCTTTGG TTTTTCGATC AATACTTTAA CCCTATTTGG CTTGAGTTTA
GGGACAGGAA TGGTCGTCGA TGATGCGATT ATTGTGGTTG AACAAATCAG TCGCTACATT
GAAGATCAGG GGATGTCTCC CTATGAAGCG ACTATTCAAG CCATGAGAGA ATTATTTAGC
GCAGTCATTG CGACCTCTTT GGTGTTAATG GCAGTATTTA TCCCTGTCGC TTTCTTTCCT
GGGACAACAG GGGCATTATA TCAACAATTT GCCTTAACTA TTGCTTTTTC CATCACTATC
TCCACCTTTC TCGCTGTCAC CTTAACCCCT GCCTTGTGTG CTTTAATTCT CAGGAAAGGA
CAAGGGTTTC CCGGGTGGGT CGGGGTGATG TTCCATCGCT TTAATGAGTT TTTAGACTGG
GTTAGACGAC AATATCGGCG ATCGCTTTTA GGGTTAGTCC AGATTAAACG GTTAATGGTG
GGAGTCTTTA TTCTTTTACT AGGCCTAACC GCTTGGCTTT ATCTGAAAGT TCCTACCGCG
TTCCTCCCCG ATGAAGATCA AGGCTATTTT CTAACCATTA TCAAAGCTCC TGACGGCGTA
TCGCTGCAAT ATACCAATGA GGTGATGCGA AAAGTGGAAA CCGCAATTTT AGAAGAACCC
GAAATTGACG GAACTTTTGC GATCGCCGGG TTTAGCGTCA CCGGAAGTAG TGCCAATGAT
GCGGTCATTT TTAGCCCCCT CAAGCCTTTT TCTGAACGCC GCAGTCCCCA AAATTCAGCC
CAAGCGGTTA TTGGACGACT TTGGGGAAAA TTTCCCCAAA TACCTGAAGC CCAAATTTTC
CCGGTTAACC CCCCATCTAT CCAGGGATTA GGCAGTTTTA GCGGGTTTAA CTTCCAATTA
CAAGATCGCT CAGGGTCCGA AGATATCAGC ACGCTAGTCG AGGCCGCCGG GCAATTATTA
GGATTAGCCA ATCAAAGTCC CCAATTAGCC GGAGCTTTTA GTCGGTTTTC TGCTAATACT
CCCCAATTAT TAATCGAAGT GAACCGCAAT AAAGCTCAAG CCCTCCAGGT TTCTATTGAT
GATATTTTCA GTACTCTCCA AACCGCGTTA GGGTCACGGT ATGTCAATGA TTTCACCCTC
CAACAACGAA CCTATCGGGT TTATGTGCAA GCTGATCGAG AATTTAGATC GAACCCCGAT
GATATTAAAA AGCTCTATGT TCGCTCAGCA ACGGGTCAAA TGATTCCTCT GTCTAACTTA
GTGAAGGTGA CTTCTACGGT CGGACCCCAA ACAATTAATC ATTATAATCT CTACCGCTCT
ACTGAAATTA ACGGCGATGC AGCCCCTGGA GTTAGTTCTG GACAAGCCAT GCAAGCAATG
GCAATGGCAG CCAAACAAGC TCTACCGCAA GGGTTTGGTT ATGAATGGTC AGGAATTTCT
TTAGAAGAAA TCGAATCAGG AGGTTTAGCT CCGATTATTT TCGGATTAGG CTTAATTTTT
GTCTTTTTAG TCCTAGCAGC CCAATATGAA AATTATATCG ATCCATTTAT TATTCTTTTG
GCGGTTCCTT TAGCCATTTT AGGTGCATTA ATTGCTCAAA GTTTACGCGG ATTTCCTAAT
GATATTTACT GTCAAATTGG GTTAGTTATG TTGATTGGAT TAGCTAGTAA AAATTCGATT
CTTATTGTAG AATTTGCTAA CCAATTACGG GAAGAAGGTT TACCTATTGT GAAAGCCGCC
GTAGAAGCTG CCCAAGAACG ACTCAGACCC ATTGTTATGA CAGCTTTTTC AACTTTATTA
GGGACTTTAC CCTTAATGAT TGCAACAGGA GCCGGATCCG GAAGTCGTCA ATCTTTAGGA
ACAGCCGTTT TTGGGGGAAT GTTTATCGCT ACTTTTTTAA GTTTATTTCT CGTTCCTATT
TTGTATATTA TGGTTAAAAT AATGACTGAA AAAATCTCAA ATTCGAGCCT AGAAAATAGG
GAATAA
 
Protein sequence
MDFFIKRPVF ATVCALVILL VGIICLLTLP VARFPDISPT TIEVQATYSG ASAEVVENAV 
TNILERQING VEGLRYLTSS SSNNGTSTIS AVFDASRNKD IAAVDIQNQV SIVESQLPQE
VQQSGISVTK QSNNFLLGIA LYSDDNRYDN VFLSNYADRY LVDNLKRIKG VSNIQIFGER
RYAMRLWLDA NRLASRGLTT QDVVDALSKQ NRQVGAGKIG GEPAIEGQQY QLDLRAVSQL
TTPEEFNNLL LKTDENGAII RLKDVGRAEL GAQDYNTFLR YRGQDAIGMG VLQLTGSNAL
DVAHKVKEEI KKIALSFPPG LKYTIAFDTT QFVEESLAEV VNTIIMSVVL VVLVILLFLQ
NWRTTLIPSL TIPFALFGTF AFVKFFGFSI NTLTLFGLSL GTGMVVDDAI IVVEQISRYI
EDQGMSPYEA TIQAMRELFS AVIATSLVLM AVFIPVAFFP GTTGALYQQF ALTIAFSITI
STFLAVTLTP ALCALILRKG QGFPGWVGVM FHRFNEFLDW VRRQYRRSLL GLVQIKRLMV
GVFILLLGLT AWLYLKVPTA FLPDEDQGYF LTIIKAPDGV SLQYTNEVMR KVETAILEEP
EIDGTFAIAG FSVTGSSAND AVIFSPLKPF SERRSPQNSA QAVIGRLWGK FPQIPEAQIF
PVNPPSIQGL GSFSGFNFQL QDRSGSEDIS TLVEAAGQLL GLANQSPQLA GAFSRFSANT
PQLLIEVNRN KAQALQVSID DIFSTLQTAL GSRYVNDFTL QQRTYRVYVQ ADREFRSNPD
DIKKLYVRSA TGQMIPLSNL VKVTSTVGPQ TINHYNLYRS TEINGDAAPG VSSGQAMQAM
AMAAKQALPQ GFGYEWSGIS LEEIESGGLA PIIFGLGLIF VFLVLAAQYE NYIDPFIILL
AVPLAILGAL IAQSLRGFPN DIYCQIGLVM LIGLASKNSI LIVEFANQLR EEGLPIVKAA
VEAAQERLRP IVMTAFSTLL GTLPLMIATG AGSGSRQSLG TAVFGGMFIA TFLSLFLVPI
LYIMVKIMTE KISNSSLENR E