Gene Hlac_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1038 
Symbol 
ID7400109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1030622 
End bp1032919 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content67% 
IMG OID643708105 
ProductATP-dependent protease Lon 
Protein accessionYP_002565705 
Protein GI222479468 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG AAAAGGACGC GAACGACCCG CCGACCGAGG ACCCCGAGGA GCCGGACGAG 
TCCGGCGTCG GGACCCCGGA TCCGACGGGG AACGGCGAGG ACGCCCGCGA CGACGGCGCC
CGCGCGCCGG CGGACGACGG CGTCGACGAG CCCGACGATA CCGGCGACCC GGCCTCCGAT
GAGGAGGCGG TCGCCGACGA TCCCACCGAC CCCGACAATT CTGACGAAGT CTGGGACGAC
GGCATCGTCG TCGACGACGG ACCCGGGCCG GACGACGCCG GGGCGGGCCG CGAGGAAGCG
ACCGGCGCCA CCTCCGAGGG TGTCACCGAA GCAAACGAGG GGGGCGACAT CGACGAGCTC
GGCAGCTCAG TCGAGGTCGA AGGCGCCGAT ATTGACGAAG ATCCCGACGA GGACGACCTG
CTTGGCGGAC TCAAGATCGA TACGACTGCC GAGCTTGAGA TCCCCGACCG ACTCGTCGAC
CAAGTCATCG GACAGGAGCA CGCCCGCGAC GTGATCATCA AGGCGGCCAA ACAGCGCCGC
CACGTGATGA TGATCGGTTC GCCCGGGACC GGGAAGTCGA TGCTCGCGAA GGCGATGTCC
GAGCTGCTCC CGAAAGAGGA GCTCCAGGAC GTGCTGGTCT ACCACAACCC GGACGACGGT
AACAAACCGA AGGTCCGGAC GGTGCCCGCC GGCAAGGGCG ACCAGATCGT CGACGCGCAC
CGCGAAGAGG CGCGCAAGCG CAACCAGATG CGGTCGCTGT TGATGTGGAT CATCATCGCC
GTCGTGTTGG GCTACGCGCT CATCATCGTC GGCCAGATCC TCGTCGGCAT CATCGCCGCC
GGGGTCGTCT ACCTCGTCTT CCGCTACCTG AACCGCGGGT CGGACGCGAT GATCCCGAAC
CTGCTGGTGA ACAACGGCGA CACGAAGACT GCGCCGTTCC GCGACGCGAC CGGCGCGCAC
GCCGGCGCGC TGCTCGGCGA CGTCCGGCAC GACCCGTTCC AGTCCGGTGG GATGGAGACG
CCCAGCCACG ACCGCGTCGA GGCGGGCGCC ATCCACAAGG CGAACAAGGG CGTGCTGTTC
ATCGACGAGA TCAACACGCT CGACATCCGG AGCCAGCAGC ACCTCATGAC GGCGATCCAG
GAGGGCGAAT TCTCGATCAC GGGCCAGTCC GAGCGCTCCT CGGGTGCGAT GGTCCAGACC
GAGCCCGTCC CGACCGACTT CGTCATGATC GCGGCCGGGA ACCTCGACGC GATGGAGAAC
ATGCACCCGG CGCTGCGGAG CCGTATCAAG GGGTACGGTT ACGAGGTGTA CATGGAGGAC
ACCATCGAGG ACACCCCAGA GATGCGTCGG AAGTACGTTC GCTTCATCGC TCAGGAGGTC
GCGAAGGACG GTCGCCTGCC GGAGTTCTCG GCCGACGCTA TCGAGGAGGT CATCCTCGAA
GCCAAGCGTC GCTCCGGCCG GAAGGGCCAC CTCACCCTCC TGTTCCGGAA CCTCGGCGGA
CTCGTCCGCG TCGCCGGCGA CATTGCCCGC GGCGAGGACG CAGAGCTGAC CACCCGCGAG
CACGTGCTGC AGGCGAAGGG ACGCTCGCGC TCCATCGAAC AGCAGCTCGC GGACGACTTC
ATCGAGCGCC GGAAGGACTA CGAGCTGCAG GTCTCCGACG GCTACGTCGT CGGCCGTGTC
AACGGCCTCG CCGTGATGGG CGAGGACTCC GGGATCATGC TCCCGGTGAT GGCCGAGGTC
GCGCCCTCGC AGGGGCCCGG CGAGGTCATC GCCACGGGTC AGCTGAAGGA GATGGCCCAA
GAGTCGGTGT CGAACGTCTC CGCCATCATC AAGAAGTTCT CCGACGAGAA CATCTCGGAG
ATGGACATTC ACATCCAGTT CGTGCAGGCG GGTCAGCAGG GCGTCGACGG CGACTCCGCG
TCCATCACGG TCGCGACCGC CGTCATCTCT GCGCTGGAGA ACGTGGGCGT CGACCAGAGC
CTCGCGATGA CGGGATCGCT GTCGGTGCGG GGCGACGTGC TCCCCGTCGG CGGCGTCACC
CACAAGATCG AGGCGGCCGC GAAGGCCGGC TGCACCCGAG TCATCATCCC GCAGGCGAAC
GAGCAGGACG TGATGATCGA AGACGAGTAC AAAGACATGA TCGAGGTCAT TCCGGTCTCG
CACATCAGCG AGGTCCTCGA CATCGCCCTG GAGGGCGAAG CCGAGAAGGA CTCGCTCGTC
TCCCGGCTCA AGTCGATCAC CGGCTCGGCG CTGAAGGAGG GAGGCGTCTC CGGTCCCTCC
AGCCCGAGCC CGCAGTAA
 
Protein sequence
MSNEKDANDP PTEDPEEPDE SGVGTPDPTG NGEDARDDGA RAPADDGVDE PDDTGDPASD 
EEAVADDPTD PDNSDEVWDD GIVVDDGPGP DDAGAGREEA TGATSEGVTE ANEGGDIDEL
GSSVEVEGAD IDEDPDEDDL LGGLKIDTTA ELEIPDRLVD QVIGQEHARD VIIKAAKQRR
HVMMIGSPGT GKSMLAKAMS ELLPKEELQD VLVYHNPDDG NKPKVRTVPA GKGDQIVDAH
REEARKRNQM RSLLMWIIIA VVLGYALIIV GQILVGIIAA GVVYLVFRYL NRGSDAMIPN
LLVNNGDTKT APFRDATGAH AGALLGDVRH DPFQSGGMET PSHDRVEAGA IHKANKGVLF
IDEINTLDIR SQQHLMTAIQ EGEFSITGQS ERSSGAMVQT EPVPTDFVMI AAGNLDAMEN
MHPALRSRIK GYGYEVYMED TIEDTPEMRR KYVRFIAQEV AKDGRLPEFS ADAIEEVILE
AKRRSGRKGH LTLLFRNLGG LVRVAGDIAR GEDAELTTRE HVLQAKGRSR SIEQQLADDF
IERRKDYELQ VSDGYVVGRV NGLAVMGEDS GIMLPVMAEV APSQGPGEVI ATGQLKEMAQ
ESVSNVSAII KKFSDENISE MDIHIQFVQA GQQGVDGDSA SITVATAVIS ALENVGVDQS
LAMTGSLSVR GDVLPVGGVT HKIEAAAKAG CTRVIIPQAN EQDVMIEDEY KDMIEVIPVS
HISEVLDIAL EGEAEKDSLV SRLKSITGSA LKEGGVSGPS SPSPQ