Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1038 |
Symbol | |
ID | 7400109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1030622 |
End bp | 1032919 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643708105 |
Product | ATP-dependent protease Lon |
Protein accession | YP_002565705 |
Protein GI | 222479468 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACG AAAAGGACGC GAACGACCCG CCGACCGAGG ACCCCGAGGA GCCGGACGAG TCCGGCGTCG GGACCCCGGA TCCGACGGGG AACGGCGAGG ACGCCCGCGA CGACGGCGCC CGCGCGCCGG CGGACGACGG CGTCGACGAG CCCGACGATA CCGGCGACCC GGCCTCCGAT GAGGAGGCGG TCGCCGACGA TCCCACCGAC CCCGACAATT CTGACGAAGT CTGGGACGAC GGCATCGTCG TCGACGACGG ACCCGGGCCG GACGACGCCG GGGCGGGCCG CGAGGAAGCG ACCGGCGCCA CCTCCGAGGG TGTCACCGAA GCAAACGAGG GGGGCGACAT CGACGAGCTC GGCAGCTCAG TCGAGGTCGA AGGCGCCGAT ATTGACGAAG ATCCCGACGA GGACGACCTG CTTGGCGGAC TCAAGATCGA TACGACTGCC GAGCTTGAGA TCCCCGACCG ACTCGTCGAC CAAGTCATCG GACAGGAGCA CGCCCGCGAC GTGATCATCA AGGCGGCCAA ACAGCGCCGC CACGTGATGA TGATCGGTTC GCCCGGGACC GGGAAGTCGA TGCTCGCGAA GGCGATGTCC GAGCTGCTCC CGAAAGAGGA GCTCCAGGAC GTGCTGGTCT ACCACAACCC GGACGACGGT AACAAACCGA AGGTCCGGAC GGTGCCCGCC GGCAAGGGCG ACCAGATCGT CGACGCGCAC CGCGAAGAGG CGCGCAAGCG CAACCAGATG CGGTCGCTGT TGATGTGGAT CATCATCGCC GTCGTGTTGG GCTACGCGCT CATCATCGTC GGCCAGATCC TCGTCGGCAT CATCGCCGCC GGGGTCGTCT ACCTCGTCTT CCGCTACCTG AACCGCGGGT CGGACGCGAT GATCCCGAAC CTGCTGGTGA ACAACGGCGA CACGAAGACT GCGCCGTTCC GCGACGCGAC CGGCGCGCAC GCCGGCGCGC TGCTCGGCGA CGTCCGGCAC GACCCGTTCC AGTCCGGTGG GATGGAGACG CCCAGCCACG ACCGCGTCGA GGCGGGCGCC ATCCACAAGG CGAACAAGGG CGTGCTGTTC ATCGACGAGA TCAACACGCT CGACATCCGG AGCCAGCAGC ACCTCATGAC GGCGATCCAG GAGGGCGAAT TCTCGATCAC GGGCCAGTCC GAGCGCTCCT CGGGTGCGAT GGTCCAGACC GAGCCCGTCC CGACCGACTT CGTCATGATC GCGGCCGGGA ACCTCGACGC GATGGAGAAC ATGCACCCGG CGCTGCGGAG CCGTATCAAG GGGTACGGTT ACGAGGTGTA CATGGAGGAC ACCATCGAGG ACACCCCAGA GATGCGTCGG AAGTACGTTC GCTTCATCGC TCAGGAGGTC GCGAAGGACG GTCGCCTGCC GGAGTTCTCG GCCGACGCTA TCGAGGAGGT CATCCTCGAA GCCAAGCGTC GCTCCGGCCG GAAGGGCCAC CTCACCCTCC TGTTCCGGAA CCTCGGCGGA CTCGTCCGCG TCGCCGGCGA CATTGCCCGC GGCGAGGACG CAGAGCTGAC CACCCGCGAG CACGTGCTGC AGGCGAAGGG ACGCTCGCGC TCCATCGAAC AGCAGCTCGC GGACGACTTC ATCGAGCGCC GGAAGGACTA CGAGCTGCAG GTCTCCGACG GCTACGTCGT CGGCCGTGTC AACGGCCTCG CCGTGATGGG CGAGGACTCC GGGATCATGC TCCCGGTGAT GGCCGAGGTC GCGCCCTCGC AGGGGCCCGG CGAGGTCATC GCCACGGGTC AGCTGAAGGA GATGGCCCAA GAGTCGGTGT CGAACGTCTC CGCCATCATC AAGAAGTTCT CCGACGAGAA CATCTCGGAG ATGGACATTC ACATCCAGTT CGTGCAGGCG GGTCAGCAGG GCGTCGACGG CGACTCCGCG TCCATCACGG TCGCGACCGC CGTCATCTCT GCGCTGGAGA ACGTGGGCGT CGACCAGAGC CTCGCGATGA CGGGATCGCT GTCGGTGCGG GGCGACGTGC TCCCCGTCGG CGGCGTCACC CACAAGATCG AGGCGGCCGC GAAGGCCGGC TGCACCCGAG TCATCATCCC GCAGGCGAAC GAGCAGGACG TGATGATCGA AGACGAGTAC AAAGACATGA TCGAGGTCAT TCCGGTCTCG CACATCAGCG AGGTCCTCGA CATCGCCCTG GAGGGCGAAG CCGAGAAGGA CTCGCTCGTC TCCCGGCTCA AGTCGATCAC CGGCTCGGCG CTGAAGGAGG GAGGCGTCTC CGGTCCCTCC AGCCCGAGCC CGCAGTAA
|
Protein sequence | MSNEKDANDP PTEDPEEPDE SGVGTPDPTG NGEDARDDGA RAPADDGVDE PDDTGDPASD EEAVADDPTD PDNSDEVWDD GIVVDDGPGP DDAGAGREEA TGATSEGVTE ANEGGDIDEL GSSVEVEGAD IDEDPDEDDL LGGLKIDTTA ELEIPDRLVD QVIGQEHARD VIIKAAKQRR HVMMIGSPGT GKSMLAKAMS ELLPKEELQD VLVYHNPDDG NKPKVRTVPA GKGDQIVDAH REEARKRNQM RSLLMWIIIA VVLGYALIIV GQILVGIIAA GVVYLVFRYL NRGSDAMIPN LLVNNGDTKT APFRDATGAH AGALLGDVRH DPFQSGGMET PSHDRVEAGA IHKANKGVLF IDEINTLDIR SQQHLMTAIQ EGEFSITGQS ERSSGAMVQT EPVPTDFVMI AAGNLDAMEN MHPALRSRIK GYGYEVYMED TIEDTPEMRR KYVRFIAQEV AKDGRLPEFS ADAIEEVILE AKRRSGRKGH LTLLFRNLGG LVRVAGDIAR GEDAELTTRE HVLQAKGRSR SIEQQLADDF IERRKDYELQ VSDGYVVGRV NGLAVMGEDS GIMLPVMAEV APSQGPGEVI ATGQLKEMAQ ESVSNVSAII KKFSDENISE MDIHIQFVQA GQQGVDGDSA SITVATAVIS ALENVGVDQS LAMTGSLSVR GDVLPVGGVT HKIEAAAKAG CTRVIIPQAN EQDVMIEDEY KDMIEVIPVS HISEVLDIAL EGEAEKDSLV SRLKSITGSA LKEGGVSGPS SPSPQ
|
| |