Gene Hhal_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1989 
Symbol 
ID4710320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2192146 
End bp2195241 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content67% 
IMG OID639856462 
Productacriflavin resistance protein 
Protein accessionYP_001003555 
Protein GI121998768 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTCT CCGAGCTCTC CATACGCCGC CACGTCCTGG CGGTGATGAT CAGCGCGCTG 
ATCGTGCTCG TCGGCGCGAT CGCCTATCAG GATATCGGCA CCGACCGGAT CCCCAACATC
GACTTCCCCA TGGTCAGCGT CACCGTTCAT CAGGACGGCG CCGACCCGGA GCTGATGGAC
TCGGCGGTGA CCGACGAGGT CGAGCGAGCA GTGAACACGG TCCCCGGGAT CGACGACATC
CAGTCGGTCT CCCTGCCCGG GACCTCGGTG GTTACCGTGC AGTTCGACCT GGACGTCGAC
GTCGATGTGG CCTTCAACGA GGTCAACGCC AAGGTCTCCG AGATCGTCCA GGACCTGCCG
GAGGACGCCG AGGCGCCGGT GGTCGATAAG GTGGAGATGG ACGCCCAGCC GATCATGTGG
CTATCGCTCC AGGGCGACCG CACGGCGCAG GATCTCGACC GCTACGCCCG CAACGAGGTG
CGCCCGCAGA TTGAGGGGAT CACCGGCGTC GGCGAAGTGC GGATCGGCGG CTCCCGGGAG
CGGACCATGC GCCTGGAGGT CGACCCCGAC CGCATGGCCG CCCACAACGT CGACGCCCAA
TCGATCATGC AGGCCCTCGA AGAGGAGCAC GCCCAGATGC CGGGCGGCTT CCTGGTCGGC
GGTGACACCG AGGAGATGCT CAAGCTCGAC GTGGAGTACC ACGACGCCGA TGGCCTCGAA
GCGATGATCA TCGCCGAGGA CGGCGACGAT CGGGTCCGTT TCCGCGATAT CGGCACCGTC
GAGGACGGCC TGGCCGATAT GCGCCAGCTC GCCCGCTTCA ATGGCGAGCC GACTATCGGC
CTCGGCGTGG TGAAGGTCTC CGGGGCGAAT ACCGTCGCCA TCATCGACGA GGTCAAGGAA
CGCCTGGACG ACCAGATCCG GCCGCAGTTG CCGCCGGGGC TCGACTTGCA CATCTCCACC
GACGAGTCGC AGTTCATCCT CGAGCAGATC GACTCGCTCT ATCTGACCAT CGCCCTGGGC
ATCCTTTTCG CCGCCCTGGT GATGTGGCTG TTCCTGCGCA ACCTGCGCTC GACGGCCATC
GTCTCGCTGG CCATCCCCAT CTCGCTGATG GCCGCGATTG CGGTGATCTA CTTCTTCGGC
TACACGCTGA ACTCGATCAC CATGCTGGCC ATGCTCCTGC TCATCGGCGT GGTGGTCGAC
GACGCCATCG TCGTGCTGGA GAACATCTAC CGACACCGCC AGCAGTACCG ACAAGGGCCC
ATGGCGTCGG CGATCTCCGG CTCCAACGAG GTCTTCTTCG CGGTCATCGC CTCCACCCTC
ACCCTGGTCT CGATCTTCGG TTCGGTGATC TTCATGGAGG CGATCATCGG CCGGTTCTTC
GAGTCGTTCG CGGTGGTGGT CGCCTTCGGC GTGCTGGCCT CGAGCATCGT CGCCCTGACC
CTGATCCCGA TGCTCGCCTC GCGCTTCCTC CACGTCCCGG AGCAGGAGGG GGCCTTCTAC
CGGGCTCTCG AGCAAGGCTT CCAGGCCATC GAGCGGACCT ACCGCTGGTC CCTGGGCTAC
GTCCTGCGCT TCCGCTGGAC CACCCTGGGC CTGGCCCTGC TCTTTGCCGT GGCGGTGGCC
GTGCTCATCG CACCGCGTCT CGGCGGCGAG TTCGCCCCGG ACGAGGACAC CGGCGAGTTC
ATGGTCAACT TCCAGACCCC GCTGGGCTCC GGGATCGAGT ACACCGATCG GCAGATGCGC
GAGATCGAGG CCATCCTCGA GGAACAGCCC GAGGTGGACC GCTACTTCTC GGCCATCGGC
ATCGGTGACC GCGGCCAGGT CAACCGCGGC ATCGCCTTCG TCCGGATGGT CGAGCGCGAT
GAGCGGGACG CCTCGCAGCA GGAGGTGGTG CAGCGGATCC GGCCCGAACT GGCCGAACTG
CCCGGGGTGC GCGCCTTCGC CTCGGACGTG CCCTTCGTCC CCGGCCAGCG CGGCGAACCC
CTGCAGTTCG TGGTCACCGG CCCCAACGTC GAGGAACTGG GCGACCACGC CCGGGAGATC
CAGGAGCGCC TGGAGGCCGT CGACGGCATG GGCAGCATCG ACTTGGAGCT CGACCTGGAG
CTGCCCCAGG TCGAGGTCGA GGTGGACCGG GAACAGGCCC GGAGCCTCGG CCTCAACGCC
CGGGACATCG CCCAGACCAT CAACGTCCTC GCCGGTGGCT TCAACGTCGC CCGCTTCGAC
GACGGCCCCG GCGGTCAGGA CGGGCGGCGC TACGACATCC GCCTCAAGGC CCAGGAGGAC
ATGATCGGCG ATATCGATGA CCTACAGCGG ATCCAGCTCC TCTCCGAGCA CGGCGAGATG
GTCCCGCTAG AGGCGGTGAC CACCATGGAG GAGACCCTAG GACCGGCGGC GATCACCCGC
CACAACCTCT CCTACTCGGC GGAGTTCTAC GCCGACCCGG ATCTGCCCCT GGCCGAGGCC
GTCGACGAGC TCAATGCGGT GGCCGACGAC GTCCTGCCGC TGAACTTCGA CGTGGAGCTG
GTCGGCCAGG CCGAGGAGAT GGAGCGGGCC GTGACGGCGA TGCTCTTCGT GCTCTTTCTG
GCGGCGACGC TGGTCTATAT CGTGCTGGCC AGCCAGTTCA ACTCCTTCGT CCAGCCGCTG
CTGATCATGG CAGCGCAGCC CCTGGCGCTC ATCGGCGGGC TGCTCGGCCT GTGGGTGGGC
GGCTTCACGC TCAACATCTA CTCGATGATC GGCATGGTGC TGCTCATGGG GCTGGTCACC
AAGAACGGGA TCCTGCTGGT GGACCTGACC AACCAGTACC GGGAGAAACG GGATCTCTCC
ATCAACGAGG CCCTGGCCGA GGCGTGCCCG ATCCGTCTGC GCCCGGTGCT GATGACCTCG
CTGACCCTGA TCCTTGCCCT GATCCCGGCG GCCGCAGGGC TGGGCGCCGG ATCGGAGACC
AACGCCCCGA TGGCAGCGGC GATCATCGGC GGTATGATCA CCGCCATGCT GCTCACCCTG
GCCGTGATCC CGGCGGCCTA CTCCCTGCTC GAGGGCTACC TCTCGCGGCG GGGCTGGGGC
CGGCTGAGCA AGATCGCCGA GGACTACTTC GGATGA
 
Protein sequence
MTLSELSIRR HVLAVMISAL IVLVGAIAYQ DIGTDRIPNI DFPMVSVTVH QDGADPELMD 
SAVTDEVERA VNTVPGIDDI QSVSLPGTSV VTVQFDLDVD VDVAFNEVNA KVSEIVQDLP
EDAEAPVVDK VEMDAQPIMW LSLQGDRTAQ DLDRYARNEV RPQIEGITGV GEVRIGGSRE
RTMRLEVDPD RMAAHNVDAQ SIMQALEEEH AQMPGGFLVG GDTEEMLKLD VEYHDADGLE
AMIIAEDGDD RVRFRDIGTV EDGLADMRQL ARFNGEPTIG LGVVKVSGAN TVAIIDEVKE
RLDDQIRPQL PPGLDLHIST DESQFILEQI DSLYLTIALG ILFAALVMWL FLRNLRSTAI
VSLAIPISLM AAIAVIYFFG YTLNSITMLA MLLLIGVVVD DAIVVLENIY RHRQQYRQGP
MASAISGSNE VFFAVIASTL TLVSIFGSVI FMEAIIGRFF ESFAVVVAFG VLASSIVALT
LIPMLASRFL HVPEQEGAFY RALEQGFQAI ERTYRWSLGY VLRFRWTTLG LALLFAVAVA
VLIAPRLGGE FAPDEDTGEF MVNFQTPLGS GIEYTDRQMR EIEAILEEQP EVDRYFSAIG
IGDRGQVNRG IAFVRMVERD ERDASQQEVV QRIRPELAEL PGVRAFASDV PFVPGQRGEP
LQFVVTGPNV EELGDHAREI QERLEAVDGM GSIDLELDLE LPQVEVEVDR EQARSLGLNA
RDIAQTINVL AGGFNVARFD DGPGGQDGRR YDIRLKAQED MIGDIDDLQR IQLLSEHGEM
VPLEAVTTME ETLGPAAITR HNLSYSAEFY ADPDLPLAEA VDELNAVADD VLPLNFDVEL
VGQAEEMERA VTAMLFVLFL AATLVYIVLA SQFNSFVQPL LIMAAQPLAL IGGLLGLWVG
GFTLNIYSMI GMVLLMGLVT KNGILLVDLT NQYREKRDLS INEALAEACP IRLRPVLMTS
LTLILALIPA AAGLGAGSET NAPMAAAIIG GMITAMLLTL AVIPAAYSLL EGYLSRRGWG
RLSKIAEDYF G