Gene Hlac_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0454 
Symbol 
ID7401072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp472018 
End bp472953 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content70% 
IMG OID643707518 
Productsignal peptide peptidase SppA, 36K type 
Protein accessionYP_002565126 
Protein GI222478889 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.496953 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGCA ACCCCCGCTC GAACGGGAAG AATCTTCTCG CGGCTGCGGT GATCGGCGCG 
ACGGCGACAG TGGCGGGCAA GCGGCTCGTC GCTCGACTCA CCCGCGGGCG GTTCGGCGAG
ACAGAGGAGT ACAACACGGC GAAGGTGACG GTTTCTGGAC CGATCCGGCG CGACACCGGC
CGCCCGTCCC CGCTTTCGAG AGCGGGTGGG GCGACCGCCG ACGACGTGGT CGAGCAGATC
GAGGCGGCCG ACGAGGACGA GGACGTGGAG GCGCTCGTCG TCGAGCTCAA TACCCCGGGG
GGCGAGGTGC TCCCGAGCGA CGATATCCGG CGCGCGGCGG CCGACTTCGA CGGCCCCACG
CTCGCGTACG CTACCGACAC CTGCGCGTCC GGCGGCTACT GGATCGCGAG CGGCTGCGAC
GAGCTGTGGG CGCGCGACGC GAGCCTCGTC GGCTCGATCG GCGTCGTCGG CTCCCGCCCG
AACGCGGCCG GACTGGCCGA CAAGCTCGGG ATCTCCTACG AGCAGTTCAC CGCCGGCGAG
TACAAGGACG CCGGCGTCCC GCTGCGGGAG ATCGAGGAGG ACGAACGCGA GTACCTGCAG
GGGATCATCG ACGGCTACTA CGAGCAGTTC GTCGAGACGG TCAGCGAGGG CCGAGACATG
GACCCGGAGG CGATCCGGGA GACGGAAGCG CGGGTCTACC TCGGCAGCGA CGCCGCCGAG
ATCGGACTCG TCGACGAGCT CGGCACCGAA GACGACGTTG AAGACCGGCT CGAAGAGCTG
ATCGACACGG AACCAGAGAT CCACGAGTTC ACGCCCAAGC GGGGACTCGC GGAGCGACTC
GGTATCGGGG CCGAGCGCGT CGCGTTCGCG GCCGGCAGCG GCGTCGCGAG CGTGTTCACG
AACGAGGGCG GCGACGTGGA CGTCGAGCTA CGGTAA
 
Protein sequence
MERNPRSNGK NLLAAAVIGA TATVAGKRLV ARLTRGRFGE TEEYNTAKVT VSGPIRRDTG 
RPSPLSRAGG ATADDVVEQI EAADEDEDVE ALVVELNTPG GEVLPSDDIR RAAADFDGPT
LAYATDTCAS GGYWIASGCD ELWARDASLV GSIGVVGSRP NAAGLADKLG ISYEQFTAGE
YKDAGVPLRE IEEDEREYLQ GIIDGYYEQF VETVSEGRDM DPEAIRETEA RVYLGSDAAE
IGLVDELGTE DDVEDRLEEL IDTEPEIHEF TPKRGLAERL GIGAERVAFA AGSGVASVFT
NEGGDVDVEL R