Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | WD0833 |
Symbol | htrA |
ID | 2737731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Wolbachia endosymbiont of Drosophila melanogaster |
Kingdom | Bacteria |
Replicon accession | NC_002978 |
Strand | + |
Start bp | 798286 |
End bp | 799779 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637173009 |
Product | protease DO |
Protein accession | NP_966586 |
Protein GI | 42520671 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0653734 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGTA AGGCATTTAT TTTATCTATA TTTGCATATT TTCTAATTGC GTTTTCTTCG TATGCTAATA TGTTTGATTG GAATGCAAAA AAGGTCGTTG ATGCTAGCAC TACTGCATGC AACTGTAATC AAGGACTTGC CGATTTAGTG GAAGAACTCA TTCCTGCGGT TGTAAATATT TCAAGCGAAC AAATCATCAA ACAAGAAAAT AACAGCAGAA CTAGAGTTCC ATCTATGCCA GGAAATAATT TTTTTGATGA TTTTAGAGAG TTTTTTGAGC ATTTTGATCA GTTTTTTATG GATAGGGGCC CTAGTGTTAA CAAAGAGGTG GTGTTGCTTG GTTCTGGGTT TATTATAGAT AAAGGTGGAA CTATAGTAAC CAATTATCAC GTTATTAAAA ACGCCAAAGA TATCACAGTT ACTATGAACG ATAATACTTA TTTCAAAGCA GAAGTTTTAG GCTATGATGC AAGAACTGAT CTTGCTGTGC TTAAGATTAA TTCTGATAAA GATCTTTCTT CTGTTGCATT TGGTGATTCT GATAAAGCAA GGGTTGGTGA TACGGTTATG GCAATAGGTA ACCCATTTGG TTTGGGTGGC TCTGTAAGCA CAGGAATTAT ATCTGCAAGG TCCAGGGACA TTAGTATTGG CACTATGAAT GAATTTATTC AAACTGATGC TGCAATTAAT AGGGGTAATT CTGGAGGACC GTTATTTGAT TTAAATGGAA AAGTTATAGG TATTAATACT GCTATCTATT CTCCATCTGA GTCTGGCGGC AACGTGGGTA TAGGTTTTGC CATACCATCT AATTTAGCTA TGTCAATTAT TGACACATTA AAAAGTGGCA AAAAAATAAA ACATGGTTGG CTTGGTGTGC AAGTTCAGCC TATAACGAAA GAATTTGCTG AGTCCTTGGG TTTAAAAGAT ATAAAAGGCG CATTGGTTGC AAGTATAGTA AAGGATAGTC CTGCAGAAAA AGGTGGGATT AAAGTAGGTG ATATATTATT AGAATTTGAC GGTAAAAAAA TCGATAGAAT GACACAATTA CCTCAAATGG TTTCAAGAGC TGGACCTGAG AAAAAAGTGC AAGTTAAGTT ACTTAGAAAG AGCAAAGAGG TTAATATTAA AGTTGTGATC GGAGAATCTA CAAATGATGG CCAAGATAAC AATCAAGAAG AAAATAAATC AACATCTGAT TATGTAACCG GTTTAACTGT TTCAAATCTG CCAAAAGAAT CAAAAGAAAG TAAAAATAAT GTACCTACAA AAGGTGTGAT AGTTACCAAT GTAGATAGTA ACAGTAATGC CACGTTGCGT GGTATTAAAA AAGGAGACAT TATTATCCAA TTAGATGGAA CCGATATAGA AAATACTAAT GATTTTCAAA AACAAGTTGA TTCAGCAGTA AAGAAAAACG GTAAAGATTC AATAATGTTG CTCATTTACC GCAATGGAAA TCAATTCTTT ACTTCGATAA AGTTGAAGAA ATAG
|
Protein sequence | MKSKAFILSI FAYFLIAFSS YANMFDWNAK KVVDASTTAC NCNQGLADLV EELIPAVVNI SSEQIIKQEN NSRTRVPSMP GNNFFDDFRE FFEHFDQFFM DRGPSVNKEV VLLGSGFIID KGGTIVTNYH VIKNAKDITV TMNDNTYFKA EVLGYDARTD LAVLKINSDK DLSSVAFGDS DKARVGDTVM AIGNPFGLGG SVSTGIISAR SRDISIGTMN EFIQTDAAIN RGNSGGPLFD LNGKVIGINT AIYSPSESGG NVGIGFAIPS NLAMSIIDTL KSGKKIKHGW LGVQVQPITK EFAESLGLKD IKGALVASIV KDSPAEKGGI KVGDILLEFD GKKIDRMTQL PQMVSRAGPE KKVQVKLLRK SKEVNIKVVI GESTNDGQDN NQEENKSTSD YVTGLTVSNL PKESKESKNN VPTKGVIVTN VDSNSNATLR GIKKGDIIIQ LDGTDIENTN DFQKQVDSAV KKNGKDSIML LIYRNGNQFF TSIKLKK
|
| |