Gene Hlac_2392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2392 
Symbol 
ID7400510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2385697 
End bp2387412 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content69% 
IMG OID643709465 
ProductO-sialoglycoprotein endopeptidase/protein kinase 
Protein accessionYP_002567037 
Protein GI222480800 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.890416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTTC TCGGCATCGA GGGAACCGCG TGGTGCGCGA GCGCCGCCCT GTACGACGCC 
GAGACCGACT CCGTTCTCAT CGAATCGAAC CCGTACGAGC CGGACAGCGG CGGCATTCAC
CCTCGCGAGG CTGCCGAGCA CATGTCCGAG GCGATCCCCG AGGTTGTCGA CGCGGTTCTC
ACCACGGCTG AGGCCGAGCA TGGTCCCGAC GCAATCGATG CGGTCGCGTT CTCGCGAGGG
CCGGGACTCG GCCCGTGTCT CCGCATCGTC GGGACCGCCG CTCGGTCGCT CGCGGGGACG
CTCGACGTAC CCCTCGTCGG CGTCAATCAC ATGGTGGCGC ACTTAGAGAT CGGTCGCCAT
CAGTCCGGCT TCGAAAACCC GGTGTGTCTG AACACCTCCG GCGCGAACGC CCACCTGCTC
GGCTACCACG ACGGGCGCTA CCGCGTGCTC GGGGAGACGA TGGACGCCGG CGTCGGCAAC
GCGATCGATA AGTTCACCCG CCACGTCGGC TGGGACCACC CCGGCGGCCC GAAGGTCGAG
GCGGCGGCGC GGCGGTACGC CGAGGGAAAC GACGGCCCGG AGGACCTCCT CGACCTCCCG
TACGTCGTCA AAGGGATGGA CTTCTCCTTC TCCGGGATCA GCTCCGCCGC CAACGACGCG
TACGACGACG GAGTCCCAGT CGAGGAAATC TGTTTTTCGC TCCAAGAGCA CGTGTTCGCG
ATGCTGACGG AGGTTTCGGA GCGTGCCCTC TCACTGACCG GCGCCGACGA GCTCGTGTTG
GGTGGGGGGG TCGCACAGAA CGATCGGCTT CGCGAGATGC TGGCGTCGAT GTGTGCGGCC
CGCGGCGCAC GCTTCCACGC CCCAGACTCG CGGTTCCTCC GCGACAACGC CGGGATGATC
GCCGTGTTGG GCGCGAAGAT GGCACAGGCT GGCGACACAG TCCCCATCTC GGAGTCCGCG
ATCGATCCCA ACTTTCGCCC GGATCAGGTG CCCGTGACGT GGCGGAGCGG TGAGTCGGTC
GCTCGCGGCC GTGCTCCCGG GAGCGACGAC AAGACGGACG CGGATCGGCG AGGCGCCGAA
GCGACGGTAG AGATTGTCCC GAGCGGCGAG CCCGACGCCG CCGACCGTCG GGTGATCAAA
CGCCGGGTTC CGAAGGAGTA CCGTCACCCC GGCCTCGACC GGACGCTCAG GCGTGACCGG
ACCGTCGCGG AGGCCCGGCT GACGAGCGAG GCCCGGCAGG CGGGGGTGAC GACCCCGTTG
GTGTACGACG CCGACGTGCC GAATGCGACG CTGACGCTCC AGTACGTCGG CGACCGCGAC
CTTGCCGCCG CCCTTGACGG GGGAACCGAG CGCGTGGCGG CAGTCGGGCG ATACCTCGCG
CGGCTCCACG ACGCGGGGAT CGTTCACGGC GATCCGACTA CGCGGAACGT GCGGGTGGGC
GTGGGAGATT CCGATACACA AACAGGGGAT GGCGAAGCGA ACGGTACGAC GGCCGTCGAC
GATCGGACCG CACTCATCGA CTTCGGGCTC GCCTATCACA CCGGCCACGT TGAGGACCAC
GCGATGGATC TCCACGTGTT CGAGGGGTCC GTCCGGGCGA CCGCGATCGA TCCGGATCCG
CTGATCGAGG CGTTCGAGGA CGGATACGCG ACGGTCGGCG ACGACGACGT TCTCGACCGG
CTCCGCGACG TAGAGGGCCG CGGGCGGTAC CGGTGA
 
Protein sequence
MRVLGIEGTA WCASAALYDA ETDSVLIESN PYEPDSGGIH PREAAEHMSE AIPEVVDAVL 
TTAEAEHGPD AIDAVAFSRG PGLGPCLRIV GTAARSLAGT LDVPLVGVNH MVAHLEIGRH
QSGFENPVCL NTSGANAHLL GYHDGRYRVL GETMDAGVGN AIDKFTRHVG WDHPGGPKVE
AAARRYAEGN DGPEDLLDLP YVVKGMDFSF SGISSAANDA YDDGVPVEEI CFSLQEHVFA
MLTEVSERAL SLTGADELVL GGGVAQNDRL REMLASMCAA RGARFHAPDS RFLRDNAGMI
AVLGAKMAQA GDTVPISESA IDPNFRPDQV PVTWRSGESV ARGRAPGSDD KTDADRRGAE
ATVEIVPSGE PDAADRRVIK RRVPKEYRHP GLDRTLRRDR TVAEARLTSE ARQAGVTTPL
VYDADVPNAT LTLQYVGDRD LAAALDGGTE RVAAVGRYLA RLHDAGIVHG DPTTRNVRVG
VGDSDTQTGD GEANGTTAVD DRTALIDFGL AYHTGHVEDH AMDLHVFEGS VRATAIDPDP
LIEAFEDGYA TVGDDDVLDR LRDVEGRGRY R