Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0102 |
Symbol | |
ID | 4710595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 117829 |
End bp | 118836 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639854559 |
Product | putative periplasmic protease |
Protein accession | YP_001001698 |
Protein GI | 121996911 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAACG AATACCTGCT GTTCCTTGCC CAGACCGCCA CGGTGGTACT GGCCATTCTG CTCGTGCTTA CCGCGGTGGT CCGGCTGCGC CAGGAAGGGG GCAGTGCGCC GGGGCGGCTG CAGGTGCGTC CGCTCAACGG CGTCTACCGC CAACGGGCGC AGGCCCTGCG TCGTGCCGGT GAGCAGGCCT CCTGGCGCGG CCGGGTGCGC AAGACGCTGC GGCGCCAGGC GTCGGAGACG CCGCCTGCGG AGTTGCCGGA CAAGCGGATC TACGTCCTGG AGTTCCGCGG CGATATCCGG GCGCGCGCTG TGGAAGGGCT TCGGGAGGAG ATCACGGCGG TCATTGCCGC GGCCCGCCCT GGGCAGGACG AGGTCATCCT GCGTCTGGAG AGCCCCGGCG GGGGCGTGCC CGCGTACGGA CTGGCGGCCT CGCAACTGGC GCGCCTGCGT GAGGCGGGGA TCCATCTGAC AGTATGCGTT GACCGCGTGG CCGCCAGCGG CGGTTATCTC ATGGCGGTGG TCGGGGATCG GATCGTGGCG GCCCCCTTCG CGCTGATCGG ATCCATCGGC GTGGTCGGGA GCCTGCCCAA CTTCCACCGC TGGTTGCGCA ACCGCGACAT CGATTTCGAG CAGCACACGG CGGGTCCCTA CAAGCGGACC CTGACAGTCT TCGGGGAGAA CACCGAGGCG GATCGAGAGC GCTTTCGCGA GGACCTGGGC CATATCCACG AGCAGTTCAA GGGATTCCTG CGGCGCTACC GTCCGCAGCT GGATGTCGAG ACGGTGGCAA CCGGGGAGTT CTGGCTGGCT GAGCGAGCCC TGGAAGCGGG GCTGATCGAC GCCCTGCAGA CCAGCGACGA CTGCATCATG GCCCAGCGCG AGCAGGCGCA CCTGCTGGAG GTCGATTATC GTCAGCGGGA GGGCTGGTCC CAGCGCCTGA CCCAGGTCAC CGAGCGGTTG CTGGGGCAGC GCAGCGGCAT CGACCGGCTG GGGCCAGATC TGGAGTAA
|
Protein sequence | MLNEYLLFLA QTATVVLAIL LVLTAVVRLR QEGGSAPGRL QVRPLNGVYR QRAQALRRAG EQASWRGRVR KTLRRQASET PPAELPDKRI YVLEFRGDIR ARAVEGLREE ITAVIAAARP GQDEVILRLE SPGGGVPAYG LAASQLARLR EAGIHLTVCV DRVAASGGYL MAVVGDRIVA APFALIGSIG VVGSLPNFHR WLRNRDIDFE QHTAGPYKRT LTVFGENTEA DRERFREDLG HIHEQFKGFL RRYRPQLDVE TVATGEFWLA ERALEAGLID ALQTSDDCIM AQREQAHLLE VDYRQREGWS QRLTQVTERL LGQRSGIDRL GPDLE
|
| |