Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0718 |
Symbol | |
ID | 4711059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 802972 |
End bp | 804246 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639855181 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_001002302 |
Protein GI | 121997515 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTAG CACACGCGAT GCAGCATCGA CGCGCGCCGC GGATGGTGGT GGCGGCGCTG GCGACCGCTC TGATCGGCCT GGTGGCCACC CCGGCGCTGG CCGACGACCT GGAGCACCTC CAGCCGGATG AGCGCAACAC GGTGGAGATC TTCCAGCGCT ACGGCCCCTC GGTGGTGGCC ATCGAGGTCG AGGTCCGCGG CGAGCGGGTC GACCCCTTCG ACCGCATCCC CGAAGGGATG CTCCCGCGGG AATTCCGCGA GTTCTTCGAG CGCCGGCAGC AGCCCCGCGA GGACTCGCCC CGCCGCCAGG GCGCCGGGAG TGGCTTTCTG GTCGATGATG CGGGGCATAT CGTCACCAAC TACCACGTCA TCCGCAACGC CCTGGAGGAG GAGAGCGTGG ATCTGCGCGA GGGCGCTTCC CTGAAGCTCA GCTTCGCCGA GCACGAAGCG GTGCCGGCCC GAGTGGTGGG CGCCAACGCG CTCTACGACC TGGCCCTGCT CAAGCCCGAG GATCCGGACA GCATCCCGGA CGGCGCCGAG CCGCTGCCGC TGGCCGACTC GGATCAGACC CTGGTGGGCC AGAAGACCAT CGCCATCGGC AACCCCTTCG GGCTGAGCTC TACGGTGACC ACGGGGATTG TCTCCGGTGT CGGCCGGGAT CTGCCGGGGA TCGGGCAGAT CGAGATCCCC ATGATCCAGA CCGATGCGGC GATCAATCCG GGCAACTCCG GCGGGCCGCT GCTCAACTCC GCCGGAGAGG TGATCGGTGT GAACACCGCC ATCGTGCCGG GCGGCGGCGG GCTGACCGGC CGGGGCGGCT CGGTGGGCGT GGGCTTTGCC GTGCCCAGCA ATCTGCTCCA GGAGAGCCTG GACGAGATGG AGGAGGGCGG TCTGACGGAT CTGACCTCCC GGGCCCGTCT GGGGGTGATG GTGGCCGGTC TGCAGGGCTA TCCGGAGGGG GTCCGCGAGC GGCTGAATCT CCCGGAGCGC GGGGTGATGG TGGTGGATGT CGAGTCGGGC AGTCCGGCCG AGGAGGCGGG GCTGCAGGGG GCGTCGTTCG AGGTGAGTGT CGAAGGGCGT GCCATGCCGG CCGACGGGGA TGTCATCACC CACGTGAACG GTGAGGCGGT GTCGGAGCCG CGGGAGTTGC AGCGGTTGGT CTTCGCCCGT CGGGCGGGGG ATGCGGTGAC GCTGACGGTC CTGCGCGACG GTGAGGAGCG GAAGTTCGAG GTGGAGCTCC GTGAGGTGCC GCGGGAGCAG CAGCGGCGCC GGTAG
|
Protein sequence | MTVAHAMQHR RAPRMVVAAL ATALIGLVAT PALADDLEHL QPDERNTVEI FQRYGPSVVA IEVEVRGERV DPFDRIPEGM LPREFREFFE RRQQPREDSP RRQGAGSGFL VDDAGHIVTN YHVIRNALEE ESVDLREGAS LKLSFAEHEA VPARVVGANA LYDLALLKPE DPDSIPDGAE PLPLADSDQT LVGQKTIAIG NPFGLSSTVT TGIVSGVGRD LPGIGQIEIP MIQTDAAINP GNSGGPLLNS AGEVIGVNTA IVPGGGGLTG RGGSVGVGFA VPSNLLQESL DEMEEGGLTD LTSRARLGVM VAGLQGYPEG VRERLNLPER GVMVVDVESG SPAEEAGLQG ASFEVSVEGR AMPADGDVIT HVNGEAVSEP RELQRLVFAR RAGDAVTLTV LRDGEERKFE VELREVPREQ QRRR
|
| |