Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1628 |
Symbol | |
ID | 7399577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1648534 |
End bp | 1652181 |
Gene Length | 3648 bp |
Protein Length | 1215 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643708694 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_002566283 |
Protein GI | 222480046 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.279329 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCCCCT CGTCTCCCTT TTCTACCGCG ACTGCCGCCA CGCTCGTTCT TGTCGCCTGT CTGATCGTGG CCGCCGTCGC ACCCGGTGCG GTCGCTGGCG GTTTGGTTAC GAGCAGCACT GAGGCGGTTG ATTCCGGCAG AACCGATCCC GCCGCGGTCG GTTCCGTTTC GACGGCCGAC GGCGATGTGA TCGCCTCCGA CCTGCGCTCG GCGAACGGCA CCGTCAAGCT CGTGGTGCGG TTCGCGGGCG GCACTCGGAT CGGAACCGAT GACGGCGAAG TGGGGTATTC GAAGGGGTCG TCGACGCTCT CGACGAACGA CCTCAAAACG AACGCCGCGA GCGCGCAGGC CGACTTCGAA TCGTTCGCTG AGGGACGGTC CGCGGTCACG GTCGAACGAA GCTTCTGGCT CGCGAACGCG ATGCTCGTCA CGGTCGACAC CGACCGCGTC CCGCTGGATC GGCTCGTTGA TGTGCCGGGC GTCGAGGGCG TCCACGAGAA CTTCGAAGTC GAACTCGACT CGGCAACAAC GACGACTCCC GGCGACGGTG GTTCGCAGGC GATCGGGACC TCCGGTCTAC CGCCCGCGCC TACTCCCGAG GACGTCTCGA CCGCTTCGAC CGACACCGAC GCCACCTACG GCGTCGAGAT GGTGCGCGCG CCGGAGGTCT GGGAGACGTT CGGGACCCGC GGGAAGGGCG CGACCGTCGC CGTGATCGAC ACGGGAATTG ACCCGGACCA TCCTGACCTA ACGGTGAGCG GGTGGGCGGA GTACGACGCG GACGGGAATC TGGTGAGCGA CGACGTGTCC GATGCGTCGG ACGGAGACGG GCACGGAACT CACGTCGCCG GAACCGTCGC GGGCGGAAAC GCGAGTGGGA CCGCAATCGG CGTCGCACCG AACGCGTCGC TCCACGGGAT CAAGGTGTTC GACGACGACG GGACCAACGC GACGTTCGTC CGCGTCGTCG CGGGAATGGA ACACGCGACG CAGGATCCGG ACGTCGACGT GCTTCAGATG AGCCTCGGTG CGGACGGGCA CTTACCATAC TTCATCGAAC CGGTTCGGAA CACTCGCAGT GCCGGGAAGA TCGCGGTCGT TTCGGCCGGG AACATCGGTC AAGGAACGTC GAGTTCTCCC GGGAACGTCT ACGACTCGCT CGCGGTCGGA GCGGTCAACG ACAGCCGCGG CGTCGCCGAC TTCTCCAGCG GAGAGACGAT CAACACGTCA AGCGCGTGGG GAAGTGATGC CCCCGCGGAC TGGCCCGACG AGTACGTGGT TCCGGACGTG TCGGCGCCAG GCGTGAGCGT ATACTCGGCG GAACCGGGCG GTACCTACAT CCGAAAGGAC GGCACCTCCA TGGCCGCACC GCACGTCAGC GGCGTCGCGG CGTTGATGCT GTCGGCGTCG TCGGGCGATG TTTCTGACGA CAAGCTCTAC GACACGCTCC GTAATACCGC GAACCACCCG TCGAACGCGG CCGATCCCGA CGACCGCTAC GGGGCCGGAA TTGTGGACGG ATTCGAAGCC GTCTCTTCGG TCACGAGCGA CGAGTTCACG GTGACTGAGT TTGATGGCCC CAAAGAAACG GACCCCGGTG CGACGGTTGA AGCCTCCGCA ACGGTCACCA ACACTGGTGG CGGACCGGGA ACCGGAACTG TCGAGTACCA ATTTAACGGA ACCGTCGGTA ACGACACAAC CGTCTCTCTC GATCCCGGCG AAGCGACGAC CGTCTCGTTC ACCTACGTCG TCCCCCATGA GACCGATCCG GACGAATACT CGCACGGCGT CTACACTGAG GATTCGAGTC AAGCGTCGAC AATTACTGTC CGTGAACCTC CCTCGTACAC GGTTACGGAT CTCACCACGC CCGAAACCGT CGAACGCGGT GGACCGCTTA CCACCACGGT GAACGTGACG AACGACGGCG AGGTGGCCGG CGATAACCGC ACCGTCGAAC TCCGCCTCGT CGACCCCGGG AATGCGAGCA ACGCCAGCGC CCTCGGCGCG GAGAACGTCT CGCTCGGCCC CGGAAACACG ACGACGGTTT CGCTCAGCGG AACCGTGCCG AGCGGGTTCG GGACCGGCGA GACGACCGTG ACGGTCGTCT CCCCCGAGGA CGACGCCTCG GCGTCGATCC GGATCGCGGA GGCCGTCGGG ACGATCAGCG GTACCGTCAC CGACGCGGAG ACGAACGCGT CCCTCGCCGA GATCGACGTG GTTGTGAAGA ACGGTACAGA AGTGGTAGGA ACGACGATAA CGGGATCTGA GGGCACCTAC GCGATCGACG TTCCGGCTAC GGATCTCACC GTCATCGCGA GCAACGCGAC GTACGCCCCG GCGGATCAGA CGGTGGCCCT GGCCGGGTCG GGGGACACCG CGACCGCGAA CATCTCGCTC GCGCTCCGGA ACGGCACGCT GAGCGGCGTC GTTGGCGCGA GTGACGGACT CGATCTCCCG TCGAACGCGA CCGTGACGGT CACGGACGAG ACGGGCGGAG CGGTCGCGGC CGTCGACGCG GCCAGCGACG GGACCTATTC CGTCGATCTC CGTCCCGGCA CCTACAACGC CACCGCGGAC GCGCCGGACT TCGATACCCA GACGGTAGCC GACATCGCGG TTTCGCCGAA CGCGACGGCA AGCGAGAGCT TCGCACTAAT CCCACAGCGA GGGTCTCTCT CCGGGATGGT GACGAACGCG ACCGACAGCG AGGCGATCCC GGACGCGACG ATCACGGTAG ACGGGGAACG GACGGCCACG ACCGACGCGA ACGGAACCTA CGCGCTCGAC GCTCTCGAAC GCGGCGAGCG GGAGATAACT GTCTCCGCGG ACGGATACGC ACAGAAGAGT CAGACGCTCA CGTTCACCGC GAACGACACT CGAGAGCTGA ACGTCTCGCT GTCTCCTCGT GGCGCGTTCG TTATCACCCA ACTCTCGGGA GATGACGAAA TCGAACAGGG ATCGAGCGGC GGCTTCGGGC TCACAGTGCG AAACGACGGG CGCGTTACCG ACGACGCGTC CGTGGACGTC ACGCTGAATC GGAGCGGCTC CGTGAGTCCC AATCCCGTGG TGATCGGCGA CGTTGCGGTC GGCGATACCG AGTCGGAATC GGTCTCTGTG TCGCTCGGAT CGAGTGCAGC GACCGGGACA TACGCCGTAA CCGCCACGAC TCCCGACAGC GAAAAGACAC TCACGTTCGT TGCAGTCAGC AGCGATGACG CCGAATCCGA CGGCGGCGGC ACAGCAGGCG GTGGCGGCGG CACAGCAGGC GCCGGTCCCG CGCCACCGCC GAGCACGGAG GACCCTGACG AGCCCGAAAA CGCGACCGAC GATCCAACGA ACGAGACCGA CGACCCGACG AACGCGACCG ACGACCCGAC GAACGCGACC GACGACCCGA CAGACGGCGA AGACGATCCG GTCGACGGTG AGTTAGCGGA CGGCGATGAT GAGAACAACG CGGAAGACGA CGGGATCGAT ACCGAATCTG GCGTGGCGAC CGACGGGGAG TCCGGAGGCG ACGACACCAG CGACGAAGAC GAGTCGGCCG ACGGAACGCC CGGCTTCGGT CCCGCGATCG GCGTCCTCGC GATTTTCGGG GCGGCGTTCC TCGTCGGGCG GCGCGGGGGA CGCGGCGGTC GCGAGTAA
|
Protein sequence | MIPSSPFSTA TAATLVLVAC LIVAAVAPGA VAGGLVTSST EAVDSGRTDP AAVGSVSTAD GDVIASDLRS ANGTVKLVVR FAGGTRIGTD DGEVGYSKGS STLSTNDLKT NAASAQADFE SFAEGRSAVT VERSFWLANA MLVTVDTDRV PLDRLVDVPG VEGVHENFEV ELDSATTTTP GDGGSQAIGT SGLPPAPTPE DVSTASTDTD ATYGVEMVRA PEVWETFGTR GKGATVAVID TGIDPDHPDL TVSGWAEYDA DGNLVSDDVS DASDGDGHGT HVAGTVAGGN ASGTAIGVAP NASLHGIKVF DDDGTNATFV RVVAGMEHAT QDPDVDVLQM SLGADGHLPY FIEPVRNTRS AGKIAVVSAG NIGQGTSSSP GNVYDSLAVG AVNDSRGVAD FSSGETINTS SAWGSDAPAD WPDEYVVPDV SAPGVSVYSA EPGGTYIRKD GTSMAAPHVS GVAALMLSAS SGDVSDDKLY DTLRNTANHP SNAADPDDRY GAGIVDGFEA VSSVTSDEFT VTEFDGPKET DPGATVEASA TVTNTGGGPG TGTVEYQFNG TVGNDTTVSL DPGEATTVSF TYVVPHETDP DEYSHGVYTE DSSQASTITV REPPSYTVTD LTTPETVERG GPLTTTVNVT NDGEVAGDNR TVELRLVDPG NASNASALGA ENVSLGPGNT TTVSLSGTVP SGFGTGETTV TVVSPEDDAS ASIRIAEAVG TISGTVTDAE TNASLAEIDV VVKNGTEVVG TTITGSEGTY AIDVPATDLT VIASNATYAP ADQTVALAGS GDTATANISL ALRNGTLSGV VGASDGLDLP SNATVTVTDE TGGAVAAVDA ASDGTYSVDL RPGTYNATAD APDFDTQTVA DIAVSPNATA SESFALIPQR GSLSGMVTNA TDSEAIPDAT ITVDGERTAT TDANGTYALD ALERGEREIT VSADGYAQKS QTLTFTANDT RELNVSLSPR GAFVITQLSG DDEIEQGSSG GFGLTVRNDG RVTDDASVDV TLNRSGSVSP NPVVIGDVAV GDTESESVSV SLGSSAATGT YAVTATTPDS EKTLTFVAVS SDDAESDGGG TAGGGGGTAG AGPAPPPSTE DPDEPENATD DPTNETDDPT NATDDPTNAT DDPTDGEDDP VDGELADGDD ENNAEDDGID TESGVATDGE SGGDDTSDED ESADGTPGFG PAIGVLAIFG AAFLVGRRGG RGGRE
|
| |