Gene Hhal_1738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1738 
Symbol 
ID4710443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1905560 
End bp1908676 
Gene Length3117 bp 
Protein Length1038 aa 
Translation table11 
GC content68% 
IMG OID639856206 
Productacriflavin resistance protein 
Protein accessionYP_001003304 
Protein GI121998517 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCCTCT CCGACATTGC GATCCGTCGT CCCGTCCTGG CCACCGTGGC CAGCGTGTTG 
ATCGTCGTCC TCGGCATCGC CTCCCTCGCT CAGTTGCCGG TACGCGAGTA CCCCGACATC
GATCCCCCGC TGGTCTCCAT CGAGACCCGC TATACCGGTG CCGCCCCGGC GGTGATCGAC
ACCGAGATCA CCGAGCGGAT CGAGTCGGCA GTGAGCAGCG TCGACGGCAT CCGTTCGATG
ACCTCGGAGA GCCGCGACGG GCGGGGGGAG ACGAACATCG AATTCGAGCT GGGGCGGGAT
ATCGACGCTG CGGCCAACGA TGTGCGCGAC GCCATCGGGC GGATCCTCGA TGACTTGCCG
GTGGATGCCG ACCCGCCGGT CGTCGCCAAG ACCGAGGCCG ATGCCCGGCC GATGATGTGG
TTGTCGCTAA GCAGCAGCCG GATGACCCCC GAGGAGATGA CCGACTACGC CGATCGACAA
CTGGTCGATC GTCTCTCGGT GCTTGACGGT GTGGCGCGGG TGCGCATTGG TGGCGAGCGC
CGCTACGCCA TGCGCGTCTG GCTCGACAGT GACGCCCTGG CGGCCCGTGA TCTAACCGTC
GAGGACGTGG AGAACGCCCT GCGCCGCGAG AATATCGAGT TGCCTGCCGG CGAGATCGAG
TCGACCGCCC GGCAGCTGAC CGTGCGGACC GACACTCGAT TGCGCGACCG CGACGCGTTC
GCCCGGCTGG TAATCGAGCG CCGCGGTGAC GAGCTGGTCC GTCTGGAGGA AGTGGCGCGA
GTGGAGCGGG GCGTCGAAGA CGACGACACC GCCGTGCGCC TGAACGGTGA GACCGCCGTA
GGCCTGGGCG TCATCCGGCA GTCCCGTGCC AACACCATCG CCGTTGCCGA TGCCGTACTC
GAGGAGATGG CGCGGATCGA GGAGCAGCTG CCCGGCGATC TGCGCCTGGT CGTCGGTTAC
GATGAATCGC AGTTCGTGCG TCAGTCGATC CGCGAGGTTG TGCGCACCCT GCTGATCGCC
GTGGCTCTCG TGGTCCTGAC CATCTTCATC TTCCTCCGCA GCCTGCGGGC GACCCTGATC
CCTTCGGTGA CCATCCCGGT GGCGGTGATT GGTGCCTTCA CCGTCATGGC TCCCCTGGGG
TTCTCACTGA ATGTGCTGAC GCTGCTGGCC CTGATCCTGG CCATCGGGCT GGTGGTCGAC
GACGCCATCG TGGTGCTGGA GAACATCCAG CGGCGCATCG ATGAGGGCGA GCCGCCCTTG
CTGGCGGCCT ACCGCGGCGT GCGGCAGGTG GGGTTCGCGG TCATCGCCAC CACCATCTCC
CTGGTAGCAG TCTTCGTGCC CATCGCCTTC ATGGAGGGCA ATGTGGGGCG GCTGTTCACT
GAGTTCGGGC TGGTGCTGGC GGCGGCGGTG GTCTTCTCCA GCTTCGTCGC CCTGACCCTG
ACCGGCATGC TCTGCTCCAA GTGGCTGCGC CCTCGCGCGG AGCACCCGGG GCGGCTGCAG
GTGGCCACCG AACGCGTCTT GACCGGTCTG ACACTCGGCT ACCGCCGCCT GCTCGGTCGC
GCCCTGGGGA TGCCCATCGC CGTGCTCGGC GTCGGGGTGG CCGCCGCCGT GGGCGCCTAC
GCCCTGTACC AGGCACTGCC TCAGGAGCTG ATCCCCACCG AGGATCGCGG GGTGTTCATC
GTCCCGGCCA GCGCGCCGGA GGGCAGCACC GCGGCGCACA CGGACGCGAG TGTGCGCGAG
CTGGAGGCGA TTCTGCGCCC CCTGCGCGAG GACACCGGCG AGGCCCGTCG CGTGCTGACC
ATCCTCGGTT TTGGTGGGCG CGCCAACAGC GCCTTCATCA TTGTCGGCTT GGAGGACTGG
GCGGAGCGCT CGCGTAGCCA ACAGCAGATC GTCGCCGAGA CCATGCCTCG GCTGATGACT
GTGCCCGGTG TGCGTGCCTT CGCTGTGAAC CCGCCAGGGC TCGGCCAGAG CGGTTTCCAG
CAACCGGTGC AGTTCGTCAT CGGCGGCACC GATTACCACG AGGTGGCCGA TTGGGCCGAG
CGGGTGCTGG ATCGGGCCCG GGCCGACAAC CCGCGGCTGC TCAATCTCGA CAGCAGCTAC
GACGCAACGC GTCCGCAGCT GAACGTGCGG ATCGACCGGG ATCGGGCCGC CGACCTCGGG
GTCAGTGCCG AGGCGCTGGG GCGTACCGTG CAGACCCTAC TGGCGTCGCG CACCGTGACG
GCCTACCTGG ATCGGGGCCG GGAGTACGAC GTCATGCTGC AGGCCGAGGC CGGCGACCGG
CGCTACCCCG CCGACCTCGA TCGCCTGCAC GTGCGCAGCC TCAATGACGG GGCACTGATC
CCGCTCTCGG CCCTGGTGAC GCTGGAGGAG GAGGGGGCGC CTCCGGTGCT CAGTCGCGTC
GACCGCTTGC CTGCGGTGAC GCTCACCGCG TCGCTGGCGC CGGGCTATGA CCTGGGCGCG
GCCCTGGCCT ATCTCGAGCA GGTGGCTGCC GAGGAGTTGC CGGCGACGGC GCGGGTCAGC
TACCTGGGGC TGTCGGATGA GTTCAAGCGC GCCGGCGGCG CTGTGTTCCT CACCTTCGGC
CTGGCCCTGC TGATCGTCTT CCTGGTGCTC TCGGCGCAGT TCGAGAGCTT CATCCACCCG
CTGATCATCA TGACCGCCGT GCCCCTGGCC ATTACCGGCG CCCTCGGCGC GCTCCTGCTC
AGCGGCGGCA GCCTGAACCT CTACAGCCAG ATCGGTATGA TCCTGCTCAT CGGCCTGATG
ACCAAGAACG GGATCCTGAT CGTCGAGTTC GCCAATCAGC TCCGGGACCA GGGCTACACC
GTGCGCGAGG CGATCCACGC CGGGGCCGCC CTGCGCTTCC GCCCGGTGTT GATGACCGCC
GTCTCCACGG TCTTCGGGGC GGTCCCCCTG GTCCTGGCCC TGGGTGCCGG GGCGGAGAGC
CGGGCTGCCA TCGGTACGGT GATCATCGGC GGGATGGGAT TCGCTACCCT GCTGACGCTG
TTCGTCATCC CGGTGCTTTA TGATTGGCTG GCGCGGTATA CCACGCCGGT CAACGTGGTC
GGAGCCGAAT TGGAGCGCCT GGAGCACGCC TCGGGTCCGC CTCGAGGCAG GGTCTGA
 
Protein sequence
MILSDIAIRR PVLATVASVL IVVLGIASLA QLPVREYPDI DPPLVSIETR YTGAAPAVID 
TEITERIESA VSSVDGIRSM TSESRDGRGE TNIEFELGRD IDAAANDVRD AIGRILDDLP
VDADPPVVAK TEADARPMMW LSLSSSRMTP EEMTDYADRQ LVDRLSVLDG VARVRIGGER
RYAMRVWLDS DALAARDLTV EDVENALRRE NIELPAGEIE STARQLTVRT DTRLRDRDAF
ARLVIERRGD ELVRLEEVAR VERGVEDDDT AVRLNGETAV GLGVIRQSRA NTIAVADAVL
EEMARIEEQL PGDLRLVVGY DESQFVRQSI REVVRTLLIA VALVVLTIFI FLRSLRATLI
PSVTIPVAVI GAFTVMAPLG FSLNVLTLLA LILAIGLVVD DAIVVLENIQ RRIDEGEPPL
LAAYRGVRQV GFAVIATTIS LVAVFVPIAF MEGNVGRLFT EFGLVLAAAV VFSSFVALTL
TGMLCSKWLR PRAEHPGRLQ VATERVLTGL TLGYRRLLGR ALGMPIAVLG VGVAAAVGAY
ALYQALPQEL IPTEDRGVFI VPASAPEGST AAHTDASVRE LEAILRPLRE DTGEARRVLT
ILGFGGRANS AFIIVGLEDW AERSRSQQQI VAETMPRLMT VPGVRAFAVN PPGLGQSGFQ
QPVQFVIGGT DYHEVADWAE RVLDRARADN PRLLNLDSSY DATRPQLNVR IDRDRAADLG
VSAEALGRTV QTLLASRTVT AYLDRGREYD VMLQAEAGDR RYPADLDRLH VRSLNDGALI
PLSALVTLEE EGAPPVLSRV DRLPAVTLTA SLAPGYDLGA ALAYLEQVAA EELPATARVS
YLGLSDEFKR AGGAVFLTFG LALLIVFLVL SAQFESFIHP LIIMTAVPLA ITGALGALLL
SGGSLNLYSQ IGMILLIGLM TKNGILIVEF ANQLRDQGYT VREAIHAGAA LRFRPVLMTA
VSTVFGAVPL VLALGAGAES RAAIGTVIIG GMGFATLLTL FVIPVLYDWL ARYTTPVNVV
GAELERLEHA SGPPRGRV