Gene Hlac_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2100 
Symbol 
ID7400620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2090787 
End bp2092262 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content73% 
IMG OID643709170 
ProductMATE efflux family protein 
Protein accessionYP_002566747 
Protein GI222480510 
COG category[V] Defense mechanisms 
COG ID[COG0534] Na+-driven multidrug efflux pump 
TIGRFAM ID[TIGR00797] putative efflux protein, MATE family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.114433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACG TCGACCGCGA GGCGATCACG GACGGGCCGG TGGGGCGAGT GCTCCTGCTA 
CTCGCGGCCC CGCTCGTCGC GCAGAACCTC GTGTACGTCG CGAACGCCCT CGTCGACACG
TTCTGGCTCG GCCGGGTGAG CGAGGACGCG GTCGCCGCGG TCGGGCTCAG CCTCCCGATC
CAGTCGCTTT TGGGCGCGGC GGTGGTCATC GCCGCCGTCG GGACGCAGAT CCTCGTCGCG
CAGCGCGCCG GCGGCGAGGA TGGTTCGGGC GCGCGACGGG TCGCGGTCAA CGGCGCGCTC
GTCGCGCTCG CGGTCACCGG AGCGATCGCG GTCCCCGTCG TCGCGTATCC GGAGGAGCTG
GTTCGACTCC TCGGCGCGGA CCCGGCGCTC GCCGGCACCA CCGCGACGTA CCTCGCGATT
ATCGTCGCCG TGCTCCCGGT CGGCGCGGTC GGCGACACGG TGGAGAACTG CTTCACCGCC
TACGGCGACA CGCGGGCCGT CCTCCACGTG AGCGTCGTCA GCGTCCTCGT CAACCTCGTT
GCGGCGCCCG CGCTCATCTT CGGCGTCGGA CCGGTCCCGG AACTCGGCGT CGCGGGCGCG
GCCCTCGGTA CCGTACTCTC CGGCATTGTC GGATTCGTTC ACATACTCGC GTACGCCGCG
GGGATCGGGC GGGACACGTT CCGGCTCACC CGCGACGCGT TCGCCGTCGA TCTCTCTGTC
GTGCGCGAGG TCGTGGCAGT CGGGCTCCCC CTCGGTGGCC AGCGCGGAGT GAGCGAACTG
GTCCGCGTGC TCGTGGTGAG CCTCGTAGCG ATCGCCGGCG GCGCGGCCGG GGTCGCGGCG
TACACGGTCG GCGCGCGCGT CGCCACGCTC GCGGTCGTCC CCGCACTCGG GATGCAGCAG
GCGGCCCAGT CGATGATCGG CCAGAACCTC GGCGCGGACG CCCCGCACCG CGCCCGGCGG
ACGACGATGG TCGGCACCAA ACTGGTCGTC GTCGGCTTCC TCGCGCTCGG CGCCGTCCAG
TTCCTGTTCG CCGGCGCGAT CGCCGACCTC CTCGCGCCCG ACCTCACCGC GACCGGCCGG
TCGCTATCGG TGTTGTATCT ACGGATTCTG GCGGTCACCT ACTGGGCGCT CGGCGGGACC
TACACCCTGC TCGCCGGCTT CAACGGCGCT TCGCGCACCC GAACCTCGTT CGTCGCGGAT
CTGATCAAGT ACTGGGCGAT TCGGTTCCCC ATCGCCGTCG CCGCGGTGCC CGCCACCGCG
ACGTTCGGGG CGTTCGGGGC GTTCGGCGTC GCGGTCGCGC CCGGACTCGG CTGGGGAGTT
GAGGCCGTCT TCTGGGCGGT CGCCGCCTCG AACGTCGTCG GCTTCCTCGG ACTGGGCGCG
TACTTCTGGT ACACGACGCG GCGGGGGATG TTCGCGAACG CCGCCGAGCG CGCGAGCGGC
GGGGACGGTG CGGGCGCGGA CCCGGCGGAC GACTGA
 
Protein sequence
MLDVDREAIT DGPVGRVLLL LAAPLVAQNL VYVANALVDT FWLGRVSEDA VAAVGLSLPI 
QSLLGAAVVI AAVGTQILVA QRAGGEDGSG ARRVAVNGAL VALAVTGAIA VPVVAYPEEL
VRLLGADPAL AGTTATYLAI IVAVLPVGAV GDTVENCFTA YGDTRAVLHV SVVSVLVNLV
AAPALIFGVG PVPELGVAGA ALGTVLSGIV GFVHILAYAA GIGRDTFRLT RDAFAVDLSV
VREVVAVGLP LGGQRGVSEL VRVLVVSLVA IAGGAAGVAA YTVGARVATL AVVPALGMQQ
AAQSMIGQNL GADAPHRARR TTMVGTKLVV VGFLALGAVQ FLFAGAIADL LAPDLTATGR
SLSVLYLRIL AVTYWALGGT YTLLAGFNGA SRTRTSFVAD LIKYWAIRFP IAVAAVPATA
TFGAFGAFGV AVAPGLGWGV EAVFWAVAAS NVVGFLGLGA YFWYTTRRGM FANAAERASG
GDGAGADPAD D