Gene Hhal_0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0102 
Symbol 
ID4710595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp117829 
End bp118836 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content69% 
IMG OID639854559 
Productputative periplasmic protease 
Protein accessionYP_001001698 
Protein GI121996911 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAACG AATACCTGCT GTTCCTTGCC CAGACCGCCA CGGTGGTACT GGCCATTCTG 
CTCGTGCTTA CCGCGGTGGT CCGGCTGCGC CAGGAAGGGG GCAGTGCGCC GGGGCGGCTG
CAGGTGCGTC CGCTCAACGG CGTCTACCGC CAACGGGCGC AGGCCCTGCG TCGTGCCGGT
GAGCAGGCCT CCTGGCGCGG CCGGGTGCGC AAGACGCTGC GGCGCCAGGC GTCGGAGACG
CCGCCTGCGG AGTTGCCGGA CAAGCGGATC TACGTCCTGG AGTTCCGCGG CGATATCCGG
GCGCGCGCTG TGGAAGGGCT TCGGGAGGAG ATCACGGCGG TCATTGCCGC GGCCCGCCCT
GGGCAGGACG AGGTCATCCT GCGTCTGGAG AGCCCCGGCG GGGGCGTGCC CGCGTACGGA
CTGGCGGCCT CGCAACTGGC GCGCCTGCGT GAGGCGGGGA TCCATCTGAC AGTATGCGTT
GACCGCGTGG CCGCCAGCGG CGGTTATCTC ATGGCGGTGG TCGGGGATCG GATCGTGGCG
GCCCCCTTCG CGCTGATCGG ATCCATCGGC GTGGTCGGGA GCCTGCCCAA CTTCCACCGC
TGGTTGCGCA ACCGCGACAT CGATTTCGAG CAGCACACGG CGGGTCCCTA CAAGCGGACC
CTGACAGTCT TCGGGGAGAA CACCGAGGCG GATCGAGAGC GCTTTCGCGA GGACCTGGGC
CATATCCACG AGCAGTTCAA GGGATTCCTG CGGCGCTACC GTCCGCAGCT GGATGTCGAG
ACGGTGGCAA CCGGGGAGTT CTGGCTGGCT GAGCGAGCCC TGGAAGCGGG GCTGATCGAC
GCCCTGCAGA CCAGCGACGA CTGCATCATG GCCCAGCGCG AGCAGGCGCA CCTGCTGGAG
GTCGATTATC GTCAGCGGGA GGGCTGGTCC CAGCGCCTGA CCCAGGTCAC CGAGCGGTTG
CTGGGGCAGC GCAGCGGCAT CGACCGGCTG GGGCCAGATC TGGAGTAA
 
Protein sequence
MLNEYLLFLA QTATVVLAIL LVLTAVVRLR QEGGSAPGRL QVRPLNGVYR QRAQALRRAG 
EQASWRGRVR KTLRRQASET PPAELPDKRI YVLEFRGDIR ARAVEGLREE ITAVIAAARP
GQDEVILRLE SPGGGVPAYG LAASQLARLR EAGIHLTVCV DRVAASGGYL MAVVGDRIVA
APFALIGSIG VVGSLPNFHR WLRNRDIDFE QHTAGPYKRT LTVFGENTEA DRERFREDLG
HIHEQFKGFL RRYRPQLDVE TVATGEFWLA ERALEAGLID ALQTSDDCIM AQREQAHLLE
VDYRQREGWS QRLTQVTERL LGQRSGIDRL GPDLE