Gene Haur_3692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3692 
Symbol 
ID5735541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4644463 
End bp4645848 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content52% 
IMG OID641280844 
ProductMATE efflux family protein 
Protein accessionYP_001546456 
Protein GI159900209 
COG category[V] Defense mechanisms 
COG ID[COG0534] Na+-driven multidrug efflux pump 
TIGRFAM ID[TIGR00797] putative efflux protein, MATE family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000025823 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATTC CATTAGGCGC AGCACAGGGG CGAAGCAGCC AACGGAGTGC TGTATTAAAG 
TTGGGCTTGC CAGCAGTTGG CGAGCAATTA TTGAGTTTGA TGGTTGGGTT GGTTGATACC
TATGTGGTGG GACACTTGAG CTTAGCCGTC GCAACTGCCA ATGGCTACGA TCGCCAAATC
GCCTTGGCAG CGACCGGCAT TTCGAGCCAA GTCACATGGA CATTAATCAC CTTTTTTATG
GCAGTAGCCC TCGGTAGCAC GGTGGTTATT GCGCGGTTTG TGGGGGCAGG CGAGCGCGAA
CAAGCCAACC AAACCCTGCG CCAAGCCCTG CTAATTGGGC TAGCCATGGG CCTGCTGAGT
TTATTTTTGG CCTATAGCTT TGCCCCTCAA CTGATGGATT TACTCGGCGC AAACGAGCAA
GTGCGCCAAT ATGGGGCTGG CTATTTGCGT ATTTCCGCCT TATCAATGCC CTTAATGGCC
ATGCTTTACG TGGGCAATGC CGCCTTACGT GGCTCCGGCG ATACGCGCAC CCCACTCAAG
GTTATGCTGG TCGTCAATGG GATCAACGCA GGGTTATCGT TGCTCTTGGT CAATGGCTAT
TTTGGTTTTC CGGCGATGGG GATTAATGGG GCAGCATTTG CCGCGATGAG TGGGCAAGGC
ATCGGTGGCT TAATGGTGCT TGCAACACTG ATTCGTGGCC GTTCAGGCTT GAAGCTTGAT
CAAATTCCAC GCCCAGATGG CAATTTGATC TGGCGGATTT TACGCCAAGG GCTGCCATAT
GGGGCTGAGC AATTTATTTT TCAGGCCGCA TTATTAATTT TTATCCATTT GATCAACGAT
ATTGGCACGG CGGCTTATGC TGCGCATAAC ACCATTATCA CGATTGAAAG TATTTCGTTT
TTGCCAGGCA TGGGCTTGGC GGTAGCCGCC ACAACCTTAG TCGGCCAACA TATGGGAGCA
AATCAGCCAC AACAAGCTAG CGAAAGTGGC TTTGAGGCAT TTCGGCTGGG AGCACTCTTC
ATGGGGGCAA TTGGCTTATT GTTTGTAGTT GCGCCAGAAG TCTTTTTGCG CTTTTTCGTT
GCTGACGAAG AGGTAGTGCA ACTCGCCGCC TTACCGTTGC GCATGGTTGG GTTTGCTCAG
CCCGCTTTGG CCGCTAATTT CATCTTCAGC GGCAGTTTAC GTGGTGGTGG CGAGCCAAAA
TGGCCACTGA TTAGCAAAAT GCTGAGTGTT TGGTGTGTCC GCTTACCGCT GGCATGGCTG
CTTGTCAAGC ACTTCGACCT TGGCTTGAAT GGCATTTGGC TGGCAATGTG TACCGATTTT
GCCGTCCAAG GCAGCTTGGC ATGGTGGCGC TTCCGACAAG GCAAATGGCA AAGTGCAAAA
GTTTAG
 
Protein sequence
MAIPLGAAQG RSSQRSAVLK LGLPAVGEQL LSLMVGLVDT YVVGHLSLAV ATANGYDRQI 
ALAATGISSQ VTWTLITFFM AVALGSTVVI ARFVGAGERE QANQTLRQAL LIGLAMGLLS
LFLAYSFAPQ LMDLLGANEQ VRQYGAGYLR ISALSMPLMA MLYVGNAALR GSGDTRTPLK
VMLVVNGINA GLSLLLVNGY FGFPAMGING AAFAAMSGQG IGGLMVLATL IRGRSGLKLD
QIPRPDGNLI WRILRQGLPY GAEQFIFQAA LLIFIHLIND IGTAAYAAHN TIITIESISF
LPGMGLAVAA TTLVGQHMGA NQPQQASESG FEAFRLGALF MGAIGLLFVV APEVFLRFFV
ADEEVVQLAA LPLRMVGFAQ PALAANFIFS GSLRGGGEPK WPLISKMLSV WCVRLPLAWL
LVKHFDLGLN GIWLAMCTDF AVQGSLAWWR FRQGKWQSAK V