Gene Hlac_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0078 
Symbol 
ID7401433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp82260 
End bp84206 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content64% 
IMG OID643707139 
ProductABC transporter related 
Protein accessionYP_002564754 
Protein GI222478517 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.213516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGCG CCGAATGGGA GGACGACGAC CCCTTCGAGG AACAGCGGGA CAAGATCGAG 
AACCCGATGA AACGACTGTT CCTCGCGTAC GGTCGCGACT ACCTCCCGCA GGTCTCCGTC
GGTATCTTCG CCAGCGTCTT CGCGCGGCTC CTCGACCTCC TCCCGCCGCT CATGCTCGGG
ATCGCCATCG ACGCCGTGTT CTACGAGGAC GCGCTGTTCA GCGAGCAGAT CCCGCTCGTG
ATCCTCCCCG ACGCGTGGCT CCCGACGGGA CAGACCGAAC AGTTCTGGTT CACTATCGCC
GTCATCGGGG CCGCGTTCGG TATCGGGGCC GGCTTCCACT GGATCCGGAA CTGGGGGTTC
AACGCCTTCG CGCAGAACAT CCAGCACGAC GTTCGGACCG ACACCTACGA CAAGATGCAG
CGGCTCAACA TGGAGTTCTT CTCCGACAAG CAGACGGGGG AGATGATGTC CATTCTCTCG
AACGACGTGA ACCGGCTCGA ACGATTCTTA AACGACGGAA TGAACTCGCT GTTCCGCCTC
TCCGTGATGG TCGTCGGGAT CGGCGTGCTG CTGTTCTGGA TAAACTGGCA GCTCGCGTTG
GTCGCGCTCC TCCCGGTCCC GATCATCGGC GGATTCACCT ATCTGTTCAT CAAGATCATC
CAGCCGAAGT ACGCTGAGGT GCGCTCGTCG GTCGGGAAGA TGAACTCTCG GCTGGAGAAC
AACCTCGGCG GCATTCAGGT GATCAAGTCG TCGAACACCG AACCGTACGA GTCCGACCGC
GTCGACGACG TCTCGATGGA CTACTTCGAC GCCAACTGGG ACGCGATCAC GACCCGGATC
AAGTTCTTCC CGGCGCTCCG CGTGCTCGCC GGTATCGGCT TCGTGCTCAC GTTTGTCATC
GGTGGGCTCT GGGTGTTTCA GGACACTCCG CCGGGCCCGT TCACCGGCGA CCTCTCGGTC
GGGATGTTCG TCGTCTTCAT CCTCTACACC CAGCGGTTCA TCTGGCCGAT GGCGCAGTTC
GGGCAGATCA TCAACATGTA CCAGCGCGCC CGCGCCTCCT CCGCGCGGAT CTTCGGGCTG
ATGGACGAGC CGTCGCGACT CGCCGAGGAC CCCGACGCCG AGGATCTGAC CGTCGGCGAC
GGCGACGTGG TCTACGACGA CGTGAGCTTC GGCTACGACG AGGAGACCAT CGTCTCCGAC
ATCGACTTCG CGGTCGAGGG CGGCGAGACG CTCGCCCTCG TCGGTCCGAC CGGTGCCGGG
AAATCGACGG TCCTCAAGCT GCTGCTCCGG ATGTACGATG TCGATGAGGG ATCGATCCGG
ATCGACGGGC AGGACGTGCG CGACGTGACG CTCAAATCGC TCCGCCGCTC GATCGGCTAC
GTCGGGCAGT CGTCGTACCT CTTCTACGGC ACCATCCGCG AGAACATCAC CTACGGCACC
TTCGAGGCGA CCGACGAGGA GGTCCGCGAG GCTGCGGAGG CCGCGGAGGC CCACGAGTTC
ATCAAGAACC TCCCGGAGGG CTACGACACC ATGGTCGGCG AGCGCGGCGT GAAGCTCTCC
GGCGGGCAGC GCCAGCGCGT CACCATCGCG CGAGCGGTGC TGAAGGACCC GGACCTCCTC
ATCCTCGACG AGGCGACCTC GGACGTGGAC ACCGAGACGG AGATGCTGAT CCAGCGCTCG
CTCGACCGGC TCACCGCCGA CCGTACCACG TTCGCGATCG CGCACCGCCT CTCGACGATC
AAAGACGCCG ACACCATCCT CGTGTTAGAG GGCGGCGAGA TCGCCGAGCG CGGCACCCAC
GACGAGCTGT TAGACAACAG CGGACTGTAC GCGCATCTCT GGGGCGTGCA GGCCGGCGAG
ATCGACGAAC TGCCACAGGA GTTCATCGAC CGCGCGCAGG AGCGCACTGC GCGCCTGGTC
GAGGACGCCG AGAGCGACGA CGACTGA
 
Protein sequence
MSSAEWEDDD PFEEQRDKIE NPMKRLFLAY GRDYLPQVSV GIFASVFARL LDLLPPLMLG 
IAIDAVFYED ALFSEQIPLV ILPDAWLPTG QTEQFWFTIA VIGAAFGIGA GFHWIRNWGF
NAFAQNIQHD VRTDTYDKMQ RLNMEFFSDK QTGEMMSILS NDVNRLERFL NDGMNSLFRL
SVMVVGIGVL LFWINWQLAL VALLPVPIIG GFTYLFIKII QPKYAEVRSS VGKMNSRLEN
NLGGIQVIKS SNTEPYESDR VDDVSMDYFD ANWDAITTRI KFFPALRVLA GIGFVLTFVI
GGLWVFQDTP PGPFTGDLSV GMFVVFILYT QRFIWPMAQF GQIINMYQRA RASSARIFGL
MDEPSRLAED PDAEDLTVGD GDVVYDDVSF GYDEETIVSD IDFAVEGGET LALVGPTGAG
KSTVLKLLLR MYDVDEGSIR IDGQDVRDVT LKSLRRSIGY VGQSSYLFYG TIRENITYGT
FEATDEEVRE AAEAAEAHEF IKNLPEGYDT MVGERGVKLS GGQRQRVTIA RAVLKDPDLL
ILDEATSDVD TETEMLIQRS LDRLTADRTT FAIAHRLSTI KDADTILVLE GGEIAERGTH
DELLDNSGLY AHLWGVQAGE IDELPQEFID RAQERTARLV EDAESDDD