Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2392 |
Symbol | |
ID | 7400510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 2385697 |
End bp | 2387412 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643709465 |
Product | O-sialoglycoprotein endopeptidase/protein kinase |
Protein accession | YP_002567037 |
Protein GI | 222480800 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.890416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTTC TCGGCATCGA GGGAACCGCG TGGTGCGCGA GCGCCGCCCT GTACGACGCC GAGACCGACT CCGTTCTCAT CGAATCGAAC CCGTACGAGC CGGACAGCGG CGGCATTCAC CCTCGCGAGG CTGCCGAGCA CATGTCCGAG GCGATCCCCG AGGTTGTCGA CGCGGTTCTC ACCACGGCTG AGGCCGAGCA TGGTCCCGAC GCAATCGATG CGGTCGCGTT CTCGCGAGGG CCGGGACTCG GCCCGTGTCT CCGCATCGTC GGGACCGCCG CTCGGTCGCT CGCGGGGACG CTCGACGTAC CCCTCGTCGG CGTCAATCAC ATGGTGGCGC ACTTAGAGAT CGGTCGCCAT CAGTCCGGCT TCGAAAACCC GGTGTGTCTG AACACCTCCG GCGCGAACGC CCACCTGCTC GGCTACCACG ACGGGCGCTA CCGCGTGCTC GGGGAGACGA TGGACGCCGG CGTCGGCAAC GCGATCGATA AGTTCACCCG CCACGTCGGC TGGGACCACC CCGGCGGCCC GAAGGTCGAG GCGGCGGCGC GGCGGTACGC CGAGGGAAAC GACGGCCCGG AGGACCTCCT CGACCTCCCG TACGTCGTCA AAGGGATGGA CTTCTCCTTC TCCGGGATCA GCTCCGCCGC CAACGACGCG TACGACGACG GAGTCCCAGT CGAGGAAATC TGTTTTTCGC TCCAAGAGCA CGTGTTCGCG ATGCTGACGG AGGTTTCGGA GCGTGCCCTC TCACTGACCG GCGCCGACGA GCTCGTGTTG GGTGGGGGGG TCGCACAGAA CGATCGGCTT CGCGAGATGC TGGCGTCGAT GTGTGCGGCC CGCGGCGCAC GCTTCCACGC CCCAGACTCG CGGTTCCTCC GCGACAACGC CGGGATGATC GCCGTGTTGG GCGCGAAGAT GGCACAGGCT GGCGACACAG TCCCCATCTC GGAGTCCGCG ATCGATCCCA ACTTTCGCCC GGATCAGGTG CCCGTGACGT GGCGGAGCGG TGAGTCGGTC GCTCGCGGCC GTGCTCCCGG GAGCGACGAC AAGACGGACG CGGATCGGCG AGGCGCCGAA GCGACGGTAG AGATTGTCCC GAGCGGCGAG CCCGACGCCG CCGACCGTCG GGTGATCAAA CGCCGGGTTC CGAAGGAGTA CCGTCACCCC GGCCTCGACC GGACGCTCAG GCGTGACCGG ACCGTCGCGG AGGCCCGGCT GACGAGCGAG GCCCGGCAGG CGGGGGTGAC GACCCCGTTG GTGTACGACG CCGACGTGCC GAATGCGACG CTGACGCTCC AGTACGTCGG CGACCGCGAC CTTGCCGCCG CCCTTGACGG GGGAACCGAG CGCGTGGCGG CAGTCGGGCG ATACCTCGCG CGGCTCCACG ACGCGGGGAT CGTTCACGGC GATCCGACTA CGCGGAACGT GCGGGTGGGC GTGGGAGATT CCGATACACA AACAGGGGAT GGCGAAGCGA ACGGTACGAC GGCCGTCGAC GATCGGACCG CACTCATCGA CTTCGGGCTC GCCTATCACA CCGGCCACGT TGAGGACCAC GCGATGGATC TCCACGTGTT CGAGGGGTCC GTCCGGGCGA CCGCGATCGA TCCGGATCCG CTGATCGAGG CGTTCGAGGA CGGATACGCG ACGGTCGGCG ACGACGACGT TCTCGACCGG CTCCGCGACG TAGAGGGCCG CGGGCGGTAC CGGTGA
|
Protein sequence | MRVLGIEGTA WCASAALYDA ETDSVLIESN PYEPDSGGIH PREAAEHMSE AIPEVVDAVL TTAEAEHGPD AIDAVAFSRG PGLGPCLRIV GTAARSLAGT LDVPLVGVNH MVAHLEIGRH QSGFENPVCL NTSGANAHLL GYHDGRYRVL GETMDAGVGN AIDKFTRHVG WDHPGGPKVE AAARRYAEGN DGPEDLLDLP YVVKGMDFSF SGISSAANDA YDDGVPVEEI CFSLQEHVFA MLTEVSERAL SLTGADELVL GGGVAQNDRL REMLASMCAA RGARFHAPDS RFLRDNAGMI AVLGAKMAQA GDTVPISESA IDPNFRPDQV PVTWRSGESV ARGRAPGSDD KTDADRRGAE ATVEIVPSGE PDAADRRVIK RRVPKEYRHP GLDRTLRRDR TVAEARLTSE ARQAGVTTPL VYDADVPNAT LTLQYVGDRD LAAALDGGTE RVAAVGRYLA RLHDAGIVHG DPTTRNVRVG VGDSDTQTGD GEANGTTAVD DRTALIDFGL AYHTGHVEDH AMDLHVFEGS VRATAIDPDP LIEAFEDGYA TVGDDDVLDR LRDVEGRGRY R
|
| |