Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1989 |
Symbol | |
ID | 4710320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2192146 |
End bp | 2195241 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639856462 |
Product | acriflavin resistance protein |
Protein accession | YP_001003555 |
Protein GI | 121998768 |
COG category | [V] Defense mechanisms |
COG ID | [COG0841] Cation/multidrug efflux pump |
TIGRFAM ID | [TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTCT CCGAGCTCTC CATACGCCGC CACGTCCTGG CGGTGATGAT CAGCGCGCTG ATCGTGCTCG TCGGCGCGAT CGCCTATCAG GATATCGGCA CCGACCGGAT CCCCAACATC GACTTCCCCA TGGTCAGCGT CACCGTTCAT CAGGACGGCG CCGACCCGGA GCTGATGGAC TCGGCGGTGA CCGACGAGGT CGAGCGAGCA GTGAACACGG TCCCCGGGAT CGACGACATC CAGTCGGTCT CCCTGCCCGG GACCTCGGTG GTTACCGTGC AGTTCGACCT GGACGTCGAC GTCGATGTGG CCTTCAACGA GGTCAACGCC AAGGTCTCCG AGATCGTCCA GGACCTGCCG GAGGACGCCG AGGCGCCGGT GGTCGATAAG GTGGAGATGG ACGCCCAGCC GATCATGTGG CTATCGCTCC AGGGCGACCG CACGGCGCAG GATCTCGACC GCTACGCCCG CAACGAGGTG CGCCCGCAGA TTGAGGGGAT CACCGGCGTC GGCGAAGTGC GGATCGGCGG CTCCCGGGAG CGGACCATGC GCCTGGAGGT CGACCCCGAC CGCATGGCCG CCCACAACGT CGACGCCCAA TCGATCATGC AGGCCCTCGA AGAGGAGCAC GCCCAGATGC CGGGCGGCTT CCTGGTCGGC GGTGACACCG AGGAGATGCT CAAGCTCGAC GTGGAGTACC ACGACGCCGA TGGCCTCGAA GCGATGATCA TCGCCGAGGA CGGCGACGAT CGGGTCCGTT TCCGCGATAT CGGCACCGTC GAGGACGGCC TGGCCGATAT GCGCCAGCTC GCCCGCTTCA ATGGCGAGCC GACTATCGGC CTCGGCGTGG TGAAGGTCTC CGGGGCGAAT ACCGTCGCCA TCATCGACGA GGTCAAGGAA CGCCTGGACG ACCAGATCCG GCCGCAGTTG CCGCCGGGGC TCGACTTGCA CATCTCCACC GACGAGTCGC AGTTCATCCT CGAGCAGATC GACTCGCTCT ATCTGACCAT CGCCCTGGGC ATCCTTTTCG CCGCCCTGGT GATGTGGCTG TTCCTGCGCA ACCTGCGCTC GACGGCCATC GTCTCGCTGG CCATCCCCAT CTCGCTGATG GCCGCGATTG CGGTGATCTA CTTCTTCGGC TACACGCTGA ACTCGATCAC CATGCTGGCC ATGCTCCTGC TCATCGGCGT GGTGGTCGAC GACGCCATCG TCGTGCTGGA GAACATCTAC CGACACCGCC AGCAGTACCG ACAAGGGCCC ATGGCGTCGG CGATCTCCGG CTCCAACGAG GTCTTCTTCG CGGTCATCGC CTCCACCCTC ACCCTGGTCT CGATCTTCGG TTCGGTGATC TTCATGGAGG CGATCATCGG CCGGTTCTTC GAGTCGTTCG CGGTGGTGGT CGCCTTCGGC GTGCTGGCCT CGAGCATCGT CGCCCTGACC CTGATCCCGA TGCTCGCCTC GCGCTTCCTC CACGTCCCGG AGCAGGAGGG GGCCTTCTAC CGGGCTCTCG AGCAAGGCTT CCAGGCCATC GAGCGGACCT ACCGCTGGTC CCTGGGCTAC GTCCTGCGCT TCCGCTGGAC CACCCTGGGC CTGGCCCTGC TCTTTGCCGT GGCGGTGGCC GTGCTCATCG CACCGCGTCT CGGCGGCGAG TTCGCCCCGG ACGAGGACAC CGGCGAGTTC ATGGTCAACT TCCAGACCCC GCTGGGCTCC GGGATCGAGT ACACCGATCG GCAGATGCGC GAGATCGAGG CCATCCTCGA GGAACAGCCC GAGGTGGACC GCTACTTCTC GGCCATCGGC ATCGGTGACC GCGGCCAGGT CAACCGCGGC ATCGCCTTCG TCCGGATGGT CGAGCGCGAT GAGCGGGACG CCTCGCAGCA GGAGGTGGTG CAGCGGATCC GGCCCGAACT GGCCGAACTG CCCGGGGTGC GCGCCTTCGC CTCGGACGTG CCCTTCGTCC CCGGCCAGCG CGGCGAACCC CTGCAGTTCG TGGTCACCGG CCCCAACGTC GAGGAACTGG GCGACCACGC CCGGGAGATC CAGGAGCGCC TGGAGGCCGT CGACGGCATG GGCAGCATCG ACTTGGAGCT CGACCTGGAG CTGCCCCAGG TCGAGGTCGA GGTGGACCGG GAACAGGCCC GGAGCCTCGG CCTCAACGCC CGGGACATCG CCCAGACCAT CAACGTCCTC GCCGGTGGCT TCAACGTCGC CCGCTTCGAC GACGGCCCCG GCGGTCAGGA CGGGCGGCGC TACGACATCC GCCTCAAGGC CCAGGAGGAC ATGATCGGCG ATATCGATGA CCTACAGCGG ATCCAGCTCC TCTCCGAGCA CGGCGAGATG GTCCCGCTAG AGGCGGTGAC CACCATGGAG GAGACCCTAG GACCGGCGGC GATCACCCGC CACAACCTCT CCTACTCGGC GGAGTTCTAC GCCGACCCGG ATCTGCCCCT GGCCGAGGCC GTCGACGAGC TCAATGCGGT GGCCGACGAC GTCCTGCCGC TGAACTTCGA CGTGGAGCTG GTCGGCCAGG CCGAGGAGAT GGAGCGGGCC GTGACGGCGA TGCTCTTCGT GCTCTTTCTG GCGGCGACGC TGGTCTATAT CGTGCTGGCC AGCCAGTTCA ACTCCTTCGT CCAGCCGCTG CTGATCATGG CAGCGCAGCC CCTGGCGCTC ATCGGCGGGC TGCTCGGCCT GTGGGTGGGC GGCTTCACGC TCAACATCTA CTCGATGATC GGCATGGTGC TGCTCATGGG GCTGGTCACC AAGAACGGGA TCCTGCTGGT GGACCTGACC AACCAGTACC GGGAGAAACG GGATCTCTCC ATCAACGAGG CCCTGGCCGA GGCGTGCCCG ATCCGTCTGC GCCCGGTGCT GATGACCTCG CTGACCCTGA TCCTTGCCCT GATCCCGGCG GCCGCAGGGC TGGGCGCCGG ATCGGAGACC AACGCCCCGA TGGCAGCGGC GATCATCGGC GGTATGATCA CCGCCATGCT GCTCACCCTG GCCGTGATCC CGGCGGCCTA CTCCCTGCTC GAGGGCTACC TCTCGCGGCG GGGCTGGGGC CGGCTGAGCA AGATCGCCGA GGACTACTTC GGATGA
|
Protein sequence | MTLSELSIRR HVLAVMISAL IVLVGAIAYQ DIGTDRIPNI DFPMVSVTVH QDGADPELMD SAVTDEVERA VNTVPGIDDI QSVSLPGTSV VTVQFDLDVD VDVAFNEVNA KVSEIVQDLP EDAEAPVVDK VEMDAQPIMW LSLQGDRTAQ DLDRYARNEV RPQIEGITGV GEVRIGGSRE RTMRLEVDPD RMAAHNVDAQ SIMQALEEEH AQMPGGFLVG GDTEEMLKLD VEYHDADGLE AMIIAEDGDD RVRFRDIGTV EDGLADMRQL ARFNGEPTIG LGVVKVSGAN TVAIIDEVKE RLDDQIRPQL PPGLDLHIST DESQFILEQI DSLYLTIALG ILFAALVMWL FLRNLRSTAI VSLAIPISLM AAIAVIYFFG YTLNSITMLA MLLLIGVVVD DAIVVLENIY RHRQQYRQGP MASAISGSNE VFFAVIASTL TLVSIFGSVI FMEAIIGRFF ESFAVVVAFG VLASSIVALT LIPMLASRFL HVPEQEGAFY RALEQGFQAI ERTYRWSLGY VLRFRWTTLG LALLFAVAVA VLIAPRLGGE FAPDEDTGEF MVNFQTPLGS GIEYTDRQMR EIEAILEEQP EVDRYFSAIG IGDRGQVNRG IAFVRMVERD ERDASQQEVV QRIRPELAEL PGVRAFASDV PFVPGQRGEP LQFVVTGPNV EELGDHAREI QERLEAVDGM GSIDLELDLE LPQVEVEVDR EQARSLGLNA RDIAQTINVL AGGFNVARFD DGPGGQDGRR YDIRLKAQED MIGDIDDLQR IQLLSEHGEM VPLEAVTTME ETLGPAAITR HNLSYSAEFY ADPDLPLAEA VDELNAVADD VLPLNFDVEL VGQAEEMERA VTAMLFVLFL AATLVYIVLA SQFNSFVQPL LIMAAQPLAL IGGLLGLWVG GFTLNIYSMI GMVLLMGLVT KNGILLVDLT NQYREKRDLS INEALAEACP IRLRPVLMTS LTLILALIPA AAGLGAGSET NAPMAAAIIG GMITAMLLTL AVIPAAYSLL EGYLSRRGWG RLSKIAEDYF G
|
| |