Gene Hlac_1628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1628 
Symbol 
ID7399577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1648534 
End bp1652181 
Gene Length3648 bp 
Protein Length1215 aa 
Translation table11 
GC content66% 
IMG OID643708694 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_002566283 
Protein GI222480046 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.279329 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCCCT CGTCTCCCTT TTCTACCGCG ACTGCCGCCA CGCTCGTTCT TGTCGCCTGT 
CTGATCGTGG CCGCCGTCGC ACCCGGTGCG GTCGCTGGCG GTTTGGTTAC GAGCAGCACT
GAGGCGGTTG ATTCCGGCAG AACCGATCCC GCCGCGGTCG GTTCCGTTTC GACGGCCGAC
GGCGATGTGA TCGCCTCCGA CCTGCGCTCG GCGAACGGCA CCGTCAAGCT CGTGGTGCGG
TTCGCGGGCG GCACTCGGAT CGGAACCGAT GACGGCGAAG TGGGGTATTC GAAGGGGTCG
TCGACGCTCT CGACGAACGA CCTCAAAACG AACGCCGCGA GCGCGCAGGC CGACTTCGAA
TCGTTCGCTG AGGGACGGTC CGCGGTCACG GTCGAACGAA GCTTCTGGCT CGCGAACGCG
ATGCTCGTCA CGGTCGACAC CGACCGCGTC CCGCTGGATC GGCTCGTTGA TGTGCCGGGC
GTCGAGGGCG TCCACGAGAA CTTCGAAGTC GAACTCGACT CGGCAACAAC GACGACTCCC
GGCGACGGTG GTTCGCAGGC GATCGGGACC TCCGGTCTAC CGCCCGCGCC TACTCCCGAG
GACGTCTCGA CCGCTTCGAC CGACACCGAC GCCACCTACG GCGTCGAGAT GGTGCGCGCG
CCGGAGGTCT GGGAGACGTT CGGGACCCGC GGGAAGGGCG CGACCGTCGC CGTGATCGAC
ACGGGAATTG ACCCGGACCA TCCTGACCTA ACGGTGAGCG GGTGGGCGGA GTACGACGCG
GACGGGAATC TGGTGAGCGA CGACGTGTCC GATGCGTCGG ACGGAGACGG GCACGGAACT
CACGTCGCCG GAACCGTCGC GGGCGGAAAC GCGAGTGGGA CCGCAATCGG CGTCGCACCG
AACGCGTCGC TCCACGGGAT CAAGGTGTTC GACGACGACG GGACCAACGC GACGTTCGTC
CGCGTCGTCG CGGGAATGGA ACACGCGACG CAGGATCCGG ACGTCGACGT GCTTCAGATG
AGCCTCGGTG CGGACGGGCA CTTACCATAC TTCATCGAAC CGGTTCGGAA CACTCGCAGT
GCCGGGAAGA TCGCGGTCGT TTCGGCCGGG AACATCGGTC AAGGAACGTC GAGTTCTCCC
GGGAACGTCT ACGACTCGCT CGCGGTCGGA GCGGTCAACG ACAGCCGCGG CGTCGCCGAC
TTCTCCAGCG GAGAGACGAT CAACACGTCA AGCGCGTGGG GAAGTGATGC CCCCGCGGAC
TGGCCCGACG AGTACGTGGT TCCGGACGTG TCGGCGCCAG GCGTGAGCGT ATACTCGGCG
GAACCGGGCG GTACCTACAT CCGAAAGGAC GGCACCTCCA TGGCCGCACC GCACGTCAGC
GGCGTCGCGG CGTTGATGCT GTCGGCGTCG TCGGGCGATG TTTCTGACGA CAAGCTCTAC
GACACGCTCC GTAATACCGC GAACCACCCG TCGAACGCGG CCGATCCCGA CGACCGCTAC
GGGGCCGGAA TTGTGGACGG ATTCGAAGCC GTCTCTTCGG TCACGAGCGA CGAGTTCACG
GTGACTGAGT TTGATGGCCC CAAAGAAACG GACCCCGGTG CGACGGTTGA AGCCTCCGCA
ACGGTCACCA ACACTGGTGG CGGACCGGGA ACCGGAACTG TCGAGTACCA ATTTAACGGA
ACCGTCGGTA ACGACACAAC CGTCTCTCTC GATCCCGGCG AAGCGACGAC CGTCTCGTTC
ACCTACGTCG TCCCCCATGA GACCGATCCG GACGAATACT CGCACGGCGT CTACACTGAG
GATTCGAGTC AAGCGTCGAC AATTACTGTC CGTGAACCTC CCTCGTACAC GGTTACGGAT
CTCACCACGC CCGAAACCGT CGAACGCGGT GGACCGCTTA CCACCACGGT GAACGTGACG
AACGACGGCG AGGTGGCCGG CGATAACCGC ACCGTCGAAC TCCGCCTCGT CGACCCCGGG
AATGCGAGCA ACGCCAGCGC CCTCGGCGCG GAGAACGTCT CGCTCGGCCC CGGAAACACG
ACGACGGTTT CGCTCAGCGG AACCGTGCCG AGCGGGTTCG GGACCGGCGA GACGACCGTG
ACGGTCGTCT CCCCCGAGGA CGACGCCTCG GCGTCGATCC GGATCGCGGA GGCCGTCGGG
ACGATCAGCG GTACCGTCAC CGACGCGGAG ACGAACGCGT CCCTCGCCGA GATCGACGTG
GTTGTGAAGA ACGGTACAGA AGTGGTAGGA ACGACGATAA CGGGATCTGA GGGCACCTAC
GCGATCGACG TTCCGGCTAC GGATCTCACC GTCATCGCGA GCAACGCGAC GTACGCCCCG
GCGGATCAGA CGGTGGCCCT GGCCGGGTCG GGGGACACCG CGACCGCGAA CATCTCGCTC
GCGCTCCGGA ACGGCACGCT GAGCGGCGTC GTTGGCGCGA GTGACGGACT CGATCTCCCG
TCGAACGCGA CCGTGACGGT CACGGACGAG ACGGGCGGAG CGGTCGCGGC CGTCGACGCG
GCCAGCGACG GGACCTATTC CGTCGATCTC CGTCCCGGCA CCTACAACGC CACCGCGGAC
GCGCCGGACT TCGATACCCA GACGGTAGCC GACATCGCGG TTTCGCCGAA CGCGACGGCA
AGCGAGAGCT TCGCACTAAT CCCACAGCGA GGGTCTCTCT CCGGGATGGT GACGAACGCG
ACCGACAGCG AGGCGATCCC GGACGCGACG ATCACGGTAG ACGGGGAACG GACGGCCACG
ACCGACGCGA ACGGAACCTA CGCGCTCGAC GCTCTCGAAC GCGGCGAGCG GGAGATAACT
GTCTCCGCGG ACGGATACGC ACAGAAGAGT CAGACGCTCA CGTTCACCGC GAACGACACT
CGAGAGCTGA ACGTCTCGCT GTCTCCTCGT GGCGCGTTCG TTATCACCCA ACTCTCGGGA
GATGACGAAA TCGAACAGGG ATCGAGCGGC GGCTTCGGGC TCACAGTGCG AAACGACGGG
CGCGTTACCG ACGACGCGTC CGTGGACGTC ACGCTGAATC GGAGCGGCTC CGTGAGTCCC
AATCCCGTGG TGATCGGCGA CGTTGCGGTC GGCGATACCG AGTCGGAATC GGTCTCTGTG
TCGCTCGGAT CGAGTGCAGC GACCGGGACA TACGCCGTAA CCGCCACGAC TCCCGACAGC
GAAAAGACAC TCACGTTCGT TGCAGTCAGC AGCGATGACG CCGAATCCGA CGGCGGCGGC
ACAGCAGGCG GTGGCGGCGG CACAGCAGGC GCCGGTCCCG CGCCACCGCC GAGCACGGAG
GACCCTGACG AGCCCGAAAA CGCGACCGAC GATCCAACGA ACGAGACCGA CGACCCGACG
AACGCGACCG ACGACCCGAC GAACGCGACC GACGACCCGA CAGACGGCGA AGACGATCCG
GTCGACGGTG AGTTAGCGGA CGGCGATGAT GAGAACAACG CGGAAGACGA CGGGATCGAT
ACCGAATCTG GCGTGGCGAC CGACGGGGAG TCCGGAGGCG ACGACACCAG CGACGAAGAC
GAGTCGGCCG ACGGAACGCC CGGCTTCGGT CCCGCGATCG GCGTCCTCGC GATTTTCGGG
GCGGCGTTCC TCGTCGGGCG GCGCGGGGGA CGCGGCGGTC GCGAGTAA
 
Protein sequence
MIPSSPFSTA TAATLVLVAC LIVAAVAPGA VAGGLVTSST EAVDSGRTDP AAVGSVSTAD 
GDVIASDLRS ANGTVKLVVR FAGGTRIGTD DGEVGYSKGS STLSTNDLKT NAASAQADFE
SFAEGRSAVT VERSFWLANA MLVTVDTDRV PLDRLVDVPG VEGVHENFEV ELDSATTTTP
GDGGSQAIGT SGLPPAPTPE DVSTASTDTD ATYGVEMVRA PEVWETFGTR GKGATVAVID
TGIDPDHPDL TVSGWAEYDA DGNLVSDDVS DASDGDGHGT HVAGTVAGGN ASGTAIGVAP
NASLHGIKVF DDDGTNATFV RVVAGMEHAT QDPDVDVLQM SLGADGHLPY FIEPVRNTRS
AGKIAVVSAG NIGQGTSSSP GNVYDSLAVG AVNDSRGVAD FSSGETINTS SAWGSDAPAD
WPDEYVVPDV SAPGVSVYSA EPGGTYIRKD GTSMAAPHVS GVAALMLSAS SGDVSDDKLY
DTLRNTANHP SNAADPDDRY GAGIVDGFEA VSSVTSDEFT VTEFDGPKET DPGATVEASA
TVTNTGGGPG TGTVEYQFNG TVGNDTTVSL DPGEATTVSF TYVVPHETDP DEYSHGVYTE
DSSQASTITV REPPSYTVTD LTTPETVERG GPLTTTVNVT NDGEVAGDNR TVELRLVDPG
NASNASALGA ENVSLGPGNT TTVSLSGTVP SGFGTGETTV TVVSPEDDAS ASIRIAEAVG
TISGTVTDAE TNASLAEIDV VVKNGTEVVG TTITGSEGTY AIDVPATDLT VIASNATYAP
ADQTVALAGS GDTATANISL ALRNGTLSGV VGASDGLDLP SNATVTVTDE TGGAVAAVDA
ASDGTYSVDL RPGTYNATAD APDFDTQTVA DIAVSPNATA SESFALIPQR GSLSGMVTNA
TDSEAIPDAT ITVDGERTAT TDANGTYALD ALERGEREIT VSADGYAQKS QTLTFTANDT
RELNVSLSPR GAFVITQLSG DDEIEQGSSG GFGLTVRNDG RVTDDASVDV TLNRSGSVSP
NPVVIGDVAV GDTESESVSV SLGSSAATGT YAVTATTPDS EKTLTFVAVS SDDAESDGGG
TAGGGGGTAG AGPAPPPSTE DPDEPENATD DPTNETDDPT NATDDPTNAT DDPTDGEDDP
VDGELADGDD ENNAEDDGID TESGVATDGE SGGDDTSDED ESADGTPGFG PAIGVLAIFG
AAFLVGRRGG RGGRE