Gene WD0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0833 
SymbolhtrA 
ID2737731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp798286 
End bp799779 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content34% 
IMG OID637173009 
Productprotease DO 
Protein accessionNP_966586 
Protein GI42520671 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0653734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTA AGGCATTTAT TTTATCTATA TTTGCATATT TTCTAATTGC GTTTTCTTCG 
TATGCTAATA TGTTTGATTG GAATGCAAAA AAGGTCGTTG ATGCTAGCAC TACTGCATGC
AACTGTAATC AAGGACTTGC CGATTTAGTG GAAGAACTCA TTCCTGCGGT TGTAAATATT
TCAAGCGAAC AAATCATCAA ACAAGAAAAT AACAGCAGAA CTAGAGTTCC ATCTATGCCA
GGAAATAATT TTTTTGATGA TTTTAGAGAG TTTTTTGAGC ATTTTGATCA GTTTTTTATG
GATAGGGGCC CTAGTGTTAA CAAAGAGGTG GTGTTGCTTG GTTCTGGGTT TATTATAGAT
AAAGGTGGAA CTATAGTAAC CAATTATCAC GTTATTAAAA ACGCCAAAGA TATCACAGTT
ACTATGAACG ATAATACTTA TTTCAAAGCA GAAGTTTTAG GCTATGATGC AAGAACTGAT
CTTGCTGTGC TTAAGATTAA TTCTGATAAA GATCTTTCTT CTGTTGCATT TGGTGATTCT
GATAAAGCAA GGGTTGGTGA TACGGTTATG GCAATAGGTA ACCCATTTGG TTTGGGTGGC
TCTGTAAGCA CAGGAATTAT ATCTGCAAGG TCCAGGGACA TTAGTATTGG CACTATGAAT
GAATTTATTC AAACTGATGC TGCAATTAAT AGGGGTAATT CTGGAGGACC GTTATTTGAT
TTAAATGGAA AAGTTATAGG TATTAATACT GCTATCTATT CTCCATCTGA GTCTGGCGGC
AACGTGGGTA TAGGTTTTGC CATACCATCT AATTTAGCTA TGTCAATTAT TGACACATTA
AAAAGTGGCA AAAAAATAAA ACATGGTTGG CTTGGTGTGC AAGTTCAGCC TATAACGAAA
GAATTTGCTG AGTCCTTGGG TTTAAAAGAT ATAAAAGGCG CATTGGTTGC AAGTATAGTA
AAGGATAGTC CTGCAGAAAA AGGTGGGATT AAAGTAGGTG ATATATTATT AGAATTTGAC
GGTAAAAAAA TCGATAGAAT GACACAATTA CCTCAAATGG TTTCAAGAGC TGGACCTGAG
AAAAAAGTGC AAGTTAAGTT ACTTAGAAAG AGCAAAGAGG TTAATATTAA AGTTGTGATC
GGAGAATCTA CAAATGATGG CCAAGATAAC AATCAAGAAG AAAATAAATC AACATCTGAT
TATGTAACCG GTTTAACTGT TTCAAATCTG CCAAAAGAAT CAAAAGAAAG TAAAAATAAT
GTACCTACAA AAGGTGTGAT AGTTACCAAT GTAGATAGTA ACAGTAATGC CACGTTGCGT
GGTATTAAAA AAGGAGACAT TATTATCCAA TTAGATGGAA CCGATATAGA AAATACTAAT
GATTTTCAAA AACAAGTTGA TTCAGCAGTA AAGAAAAACG GTAAAGATTC AATAATGTTG
CTCATTTACC GCAATGGAAA TCAATTCTTT ACTTCGATAA AGTTGAAGAA ATAG
 
Protein sequence
MKSKAFILSI FAYFLIAFSS YANMFDWNAK KVVDASTTAC NCNQGLADLV EELIPAVVNI 
SSEQIIKQEN NSRTRVPSMP GNNFFDDFRE FFEHFDQFFM DRGPSVNKEV VLLGSGFIID
KGGTIVTNYH VIKNAKDITV TMNDNTYFKA EVLGYDARTD LAVLKINSDK DLSSVAFGDS
DKARVGDTVM AIGNPFGLGG SVSTGIISAR SRDISIGTMN EFIQTDAAIN RGNSGGPLFD
LNGKVIGINT AIYSPSESGG NVGIGFAIPS NLAMSIIDTL KSGKKIKHGW LGVQVQPITK
EFAESLGLKD IKGALVASIV KDSPAEKGGI KVGDILLEFD GKKIDRMTQL PQMVSRAGPE
KKVQVKLLRK SKEVNIKVVI GESTNDGQDN NQEENKSTSD YVTGLTVSNL PKESKESKNN
VPTKGVIVTN VDSNSNATLR GIKKGDIIIQ LDGTDIENTN DFQKQVDSAV KKNGKDSIML
LIYRNGNQFF TSIKLKK