Gene Hlac_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1110 
Symbol 
ID7400919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1117050 
End bp1118126 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content70% 
IMG OID643708175 
Productpeptidase M48 Ste24p 
Protein accessionYP_002565774 
Protein GI222479537 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCCCT CCATCGCCGC CGCCCTTCGG TCCGTCCCGC CGTTCGATCG GCTCGCTCGC 
GTCGATCAAC AGCAGTGGCT CCGAATCCGG ATCGCGGTCG CCACCGGGCT GGTGATCGCC
CTCCCGTTCG CGTTCGCGTA CACGTTCGTC TTCCTGATCA ACACCATCGG ACTCCCCCTC
TTGGAGTGGG CGAGCGAGCG CCCCTACACC GGGGAGTTCT ACGTCGACCC CGTCGTCCTC
GCGGTCGTCG TGCTCGGCGG GCTGGCGGTG CAGTACCGGT ACGGCCCGCG GACCGTGGTG
CGCTCCGTCG GCGGGCGTCG CGTCTCCGCG GACGAGTACC CGGAACTCCA CGCCGCGGTC
ACCCGGCTAG CGGCCCAGAC CGACGTGCCG AAGCCCGACG TGGCGGTCGC GCGGACGGAC
CTCCCGAACG CTTTCGCGGT CGGCCGACGA GAGAGCGGCA CCGTCGTGGT CACGACCGCG
CTGTTGGAGA CGCTCGACGA CGACGAGCGC GACGCGGTGT TGGCCCACGA GCTTGCACAC
CTCAAAAACC GGGACGCGAG CCTGATGACG GTTGCGTGGG TGTTGCCGAC GGTCACCTAC
TACCTCGCGG CGCTCGCGTT CTACGTGCTG TACGGCCTGT TCAAACTCCT CAGCTTCGGC
GGCGGATCGG GCGGCGACCG CGACGGGCGA GCGCTCGCGG TCGGGATCGT CGTGATCACC
GTAAGCGCGC TCGTCACGCT CACCGTCTCG GCGATGTTCT GGTGCGGAAG CGTCCTGATC
CACCGCGTAC TCTCGCGATA CCGCGAGTAC GCGGCCGACC GCGCGGCCGC CGAGATCACC
GGGTCGCCGG CGGCGCTCGC GAGCGCGCTC GACGCCCTCG ACGAGTCGAT GCCGGAGGTG
CCCGACCGCG ATCTCCGCGA GTTCGACGGC GGCGCGGAGG CGCTGTACGT CGCGCCCTTA
GAGAGCCGCG CGTTCGGCGA CAAGGAACTC GTGAGTACCG ACGTGTTCCC GGAGACGCAC
CCGCCGACGC GCGAGCGGAT CGAGCGGCTC CGCGAGCTGG CGGGTGAGAC CGCGTGA
 
Protein sequence
MWPSIAAALR SVPPFDRLAR VDQQQWLRIR IAVATGLVIA LPFAFAYTFV FLINTIGLPL 
LEWASERPYT GEFYVDPVVL AVVVLGGLAV QYRYGPRTVV RSVGGRRVSA DEYPELHAAV
TRLAAQTDVP KPDVAVARTD LPNAFAVGRR ESGTVVVTTA LLETLDDDER DAVLAHELAH
LKNRDASLMT VAWVLPTVTY YLAALAFYVL YGLFKLLSFG GGSGGDRDGR ALAVGIVVIT
VSALVTLTVS AMFWCGSVLI HRVLSRYREY AADRAAAEIT GSPAALASAL DALDESMPEV
PDRDLREFDG GAEALYVAPL ESRAFGDKEL VSTDVFPETH PPTRERIERL RELAGETA