Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0037 |
Symbol | |
ID | 4710926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 38779 |
End bp | 40248 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639854495 |
Product | protease Do |
Protein accession | YP_001001634 |
Protein GI | 121996847 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTACAG TCAAACGGGG GCCGGCGCGG ATCACGCTGA CGGCCGGGCT AGGGCTGGGG GTCAGCGCCC CCGTCGGTGC GCTCGAACTG CCGGACTTCA CCGAACTGGT TCGCGAGAAC AGCGCGGCTG TGGTCAACAT CAGTACCCGC CAGGAGGCTC CGGGTCAGTC GGCCCCGGAG GAGATGCTGC CCGAAGGGAT CGAGCCCCAG CAGGGGCCGC CGCCGGCCTC TGGCCTGCGT CCCGTTTCTG CGCCGCTGGA CCCGCTGCAG ACCCCCGACC TGGAAGGATC CCGGCCCGGA CCGGATCCGT TCGGTGACGA CGGCCAATCC CTCGGCTCGG GCTTCCTGAT CAGCGACGAT GGAGTGATCC TGACTAATCA CCACGTCGTG GCACGGGCCG ACGAGGTCAT TGTCCGCCTC TCGGATGGTC GCGAACACGA CGCCGACGTC GTAGGCTCGG ATGAGCGCAC CGACCTGGCG GTGGTCGAGA TCGATACGGA CGACGAGCTG CCGACGGTGT CCGTGGGCAG TGCCGAAAAG CTCGAGGTTG GCGAATGGGT GCTGGCCATC GGCTCGCCCT TCGGCTTCGA ACATTCGGTC ACCGCCGGCA TTGTCTCCGC CAAGGGGCGC TCGTTGCCCC ACGGGAACTA CGTGCCCTAC ATCCAGACCG ATGTGGCCAT CAACCCGGGG AACTCCGGCG GGCCGCTGTT CAATCTGGAG GGCGATGTGG TCGGGGTCAA TTCGCAAATC TACAGCCGCA CCGGCGGCTT CATGGGGCTG TCGTTCGCGA TCCCCATCGA GTTGGCCATC GATGTGGCGG AGCAGCTCCA GGCCACCGGC GAGGTCGAGC GCGGCTGGCT CGGCGTGCTC ATCCAGGATC TCACCCGAGA CCTTGCCGAG GGGTTCGGCC TTGAGCGTCC ACGCGGGGCT CTGGTTTCGG AACTGCTCGA CCATAGCCCG GCTGCCGAGG CCGGCATCGA GAGCGGCGAC GTCATCCTCG AGTTTGACGG CGAGGTGGTG GAGAACTCTG CGACCCTGCC GCCCATGGTG GGGCGTACCA GCATTGGGCG GACCGTCGAG TTGCTGATCC TGCGGGATGG TGAGGAGAAG ACCCTGGAGG TCGAGGTGGC GGAGTTGCCG GGCGAGGATG AACTCGCCGC CCCGTCGGAC GCCGTAGAGG GCGGCGGTGG CAGTGATCTG GGCCTGCAGG TGGAGCCGGT GGACGACACC ACCCGACAGC AGTTGGAGCT AGACGACGAG GGCGGGGTGC TGATCACGTC CGTCGAGGAG GGGCCGGCGG CCGACGCAGG CTTGCAGGTC GGCGACGTGT TGGTGAGCTT CGATCGCCAG CCGGTTCACT CCGCGGAGGA CCTCAATGAC GGTGCGGCTG CCGCCGACCC CGGTAGCACC GTGCCGGTGC TGGTCATTCG CGACGGGCAC CCGAGTTTTC TGGCATTGCA AATGCCCTGA
|
Protein sequence | MGTVKRGPAR ITLTAGLGLG VSAPVGALEL PDFTELVREN SAAVVNISTR QEAPGQSAPE EMLPEGIEPQ QGPPPASGLR PVSAPLDPLQ TPDLEGSRPG PDPFGDDGQS LGSGFLISDD GVILTNHHVV ARADEVIVRL SDGREHDADV VGSDERTDLA VVEIDTDDEL PTVSVGSAEK LEVGEWVLAI GSPFGFEHSV TAGIVSAKGR SLPHGNYVPY IQTDVAINPG NSGGPLFNLE GDVVGVNSQI YSRTGGFMGL SFAIPIELAI DVAEQLQATG EVERGWLGVL IQDLTRDLAE GFGLERPRGA LVSELLDHSP AAEAGIESGD VILEFDGEVV ENSATLPPMV GRTSIGRTVE LLILRDGEEK TLEVEVAELP GEDELAAPSD AVEGGGGSDL GLQVEPVDDT TRQQLELDDE GGVLITSVEE GPAADAGLQV GDVLVSFDRQ PVHSAEDLND GAAAADPGST VPVLVIRDGH PSFLALQMP
|
| |