Gene Hlac_2196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2196 
Symbol 
ID7401131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2179943 
End bp2181214 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content70% 
IMG OID643709268 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_002566843 
Protein GI222480606 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGC CAGCCTACAC CCGTCGTGGC GTGCTCCGAA CGATCGGGAG TGGATCCGCG 
GTCGGATCGC TCGCGACGCT CGGGACCGGA ACGGGCGCGG CGACGCGAAC CGCCGACGAA
ACCGTGACCG TCAACGTTGG ATACGCGTCG GAGAGCGGCC GGAGCGCCGC GCTGTCTCGC
GCGTCGGCGG TGGACCACGA GTTCGGCTTC GACGCGCTCA CAGCCCAGAT GCCCGAGTCG
GAGACCGACG CGCTGGCGGC CCAACGCGAG ATCCGGTACG TCGAGGAGGA CCGCGCTCTC
GGCGGTATCG TTACGCCGCG GAGGCAACCC CGCCGTCAGG TCGACGGACA GCGCGTCCCG
TACGGCGTCG CCGTGACGGG GGCCGACGTG GCCGCGGAAC ACGGCTACAC CGGTAGCGGA
GCGAACGTCG CGGTACTCGA CACCGGTATC GACAGCACGC ACCCCGACCT CGCGACGAAC
CGGGGGAGAG GGGCCGCGTT CGTCCCGAGC GTCAGCGACA TCGAGGAGTT CGACTTCCCG
GCCGGCCAGG ACGACGACAT CCTCGTCTCG CACGGCACGC ACGTCGCCGG CGTGGTCGGC
GCGAACGACA ACGACACCGG TATCGTCGGT GTCAGTCCCG CGGCGACTCT CCACGCCGTG
AAAGTCCTTA TCGGCGTGCT CGGCGGCGGC TCGGCGGCCG GCATCGCGGC GGGCGTGGAG
TTCGTGGCTG ATCAGGGATG GGATGTCGCA AACCTCAGCT TAGGGGAGAC CGGACGGGTC
GACGTGCTCG CGGACGCGGT AGCGGACGCC TACGAGCGCG GAGTCCTCCT CGTGGCGGCG
GCCGGAAACG ATGGCCTCCT CGAAAACGAC CCGCCCGCGA GTGACGGAGA ATCGGCGGTT
TCCTACCCGG CGGCCTTCGA GGAAGTCATC GCCGTTGGCG CGACCGACCG GAACGACGAC
CTCGCGGCGT TCTCCTCGGT CGGTCCGGAG GTCGAACTCG CCGCCCCCGG CACGGACGTG
CTGTCCACGG CCCTGCCGAT CAACATCTAC TCGAACGTGG AGGAGCCGTT CAACCGCTAC
ATCGAACTGT CTGGGACCTC TTTCGCGGCC CCACACGTCG CCGGCGCGGG CGCACTGTTA
ATGAGCGATG TCGGCCTCTC GAACGTTGAG GCCCGCGAGC GTCTCCGGGA GACCGCGGCC
GACGTGGGCC TGCGCGAGGA CGAACAGGGC TACGGCCGCC TCGACGTCGC GGCCGCACTC
GGTGTCGAGT GA
 
Protein sequence
MTEPAYTRRG VLRTIGSGSA VGSLATLGTG TGAATRTADE TVTVNVGYAS ESGRSAALSR 
ASAVDHEFGF DALTAQMPES ETDALAAQRE IRYVEEDRAL GGIVTPRRQP RRQVDGQRVP
YGVAVTGADV AAEHGYTGSG ANVAVLDTGI DSTHPDLATN RGRGAAFVPS VSDIEEFDFP
AGQDDDILVS HGTHVAGVVG ANDNDTGIVG VSPAATLHAV KVLIGVLGGG SAAGIAAGVE
FVADQGWDVA NLSLGETGRV DVLADAVADA YERGVLLVAA AGNDGLLEND PPASDGESAV
SYPAAFEEVI AVGATDRNDD LAAFSSVGPE VELAAPGTDV LSTALPINIY SNVEEPFNRY
IELSGTSFAA PHVAGAGALL MSDVGLSNVE ARERLRETAA DVGLREDEQG YGRLDVAAAL
GVE