Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4090 |
Symbol | |
ID | 5735949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5223030 |
End bp | 5223905 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641281242 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001546850 |
Protein GI | 159900603 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0153565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTAC AAGAATTTTC GAGCGAACTA GCTGATGCCG TGGAACAGAG CGGGGCAGGC ATCGTGACGA TCTATGCTCG CCGTCGCCAA AGTGCCAGCG GGATCGTTTG GCAAGCTGAT TTGGTGCTGA CCGCTGATCA TGTCATCCAA CGCGATGAGC ATATCAAGGT GGTTGGGCCA GATGGCACGG AGTATCAAGC CCACGTCGTT GGTCGCGACC CGAGCAGCGA TGTGGCGTTG CTGCGCGTGC CCAACGCCAA TTTTCCCCCC GCAACTTTGG CCAAAACCGA GCCACGGGTT GGTCAGTTGG CCTTGGCAGT CGGTCGTCCA AGCACGGTAC AAGCCAGTTT TGGTATTATT AATGCGATTG GCGGCCCAGT CCCAACCCGC CGTGGCACAT TAGCTCAATA TTTGCGCACC GATGCCACCC CATATCCAGG CTTCTCGGGG GGCGGCTTGG TCAATGTTAA AGGTGAAGTC GTTGGTTTGT TTACTTCCGG CTTTGCAGGC GGCGAGCCAA TTGCAATTCC AGTGGCTGTG CTGACCAGCG TCGCCGATAC CTTGCTCAAC CATGGCCGCG TGCGCCGTGG CTTTATCGGG ATTGCCAGCC AAACTGTCAA TTTGCCCGAT AACCAACGCG CTGGCCGCAA CCAAGCGACT GGTCTGTTGG TGATTAGCGT CGAAGCCGAT AGCCCAGCTC AACACGCTGG CTTGTTGGTT GGCGATATTT TGGTGGGCTT AGATGGCCAT GAATTGGGTG AACCACGCGA TTTGCAAATG CTCTTGGCAA GCGATCGAGC GGGCAAGACT GTAACGCTTG ATGTGCTACG AGCTGGCCAA TTGCAAAACC TGAGCATCAC AATTGGAGCA AAATAG
|
Protein sequence | MSLQEFSSEL ADAVEQSGAG IVTIYARRRQ SASGIVWQAD LVLTADHVIQ RDEHIKVVGP DGTEYQAHVV GRDPSSDVAL LRVPNANFPP ATLAKTEPRV GQLALAVGRP STVQASFGII NAIGGPVPTR RGTLAQYLRT DATPYPGFSG GGLVNVKGEV VGLFTSGFAG GEPIAIPVAV LTSVADTLLN HGRVRRGFIG IASQTVNLPD NQRAGRNQAT GLLVISVEAD SPAQHAGLLV GDILVGLDGH ELGEPRDLQM LLASDRAGKT VTLDVLRAGQ LQNLSITIGA K
|
| |