Gene WD1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD1237 
SymbolclpA 
ID2738148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp1183023 
End bp1185329 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content34% 
IMG OID637173388 
ProductATP-dependent Clp protease, ATP-binding subunit ClpA 
Protein accessionNP_966949 
Protein GI42521034 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCCA AAAACTTAGA GGCAAGTTTG AATAGAGCAC TATTCATTGC TTCTAATTTT 
AATCTTAAAT ATGCGAAAGT AGAACATTTA TTGCTGGCGT TAACTAAAGA CGTGGATGTA
AATTACGTTT TATCAAGATG TAATATCAGA GCCGATGAAA TCATCAGTAT GGATAACATT
CTATCAAGAT GCAATATTAA GGCTAATGAT TATAATAATA ATATAAAAGG GTTTTTACAA
AGCAGTTCTG AATTGATTAT CAGTGGAGTT AAACCTAGCT CAATGTTTCA ATGCATAATA
CACAGAGCAA TAATACGGGC TCATAGCTTG GGGAAAAAAG AAATAAATGG AGCAAGTGTT
CTAGTAGAAA TTTTGTCTGG GCAAGATTTA TACGTTGAAG ATCTATTGCA GAAACAAAGT
GCGAAAGATT CCAATTTGAT TTACCACATA TCTAACATGA AATACTCTAA CGATATAGAT
GAATACACCA CTAATCATAA AGTAAAACTT GATAAAAATA ATGATGCTCC TACTACAGTA
AATAAAGGTG AGCTACTAAA AGATGAGGAA ATTCTACAAA GTTATTGTAA AAATTTAAAT
GACTATGCAA GAAGCAAAAA AATAGATTAT GTTATTGGTC GTGATTATGA ATTAAATCGC
ACTATAGAAA TATTATTGAG GCGTAGAAAA AACAACCCTT TATATGTTGG AGATCCAGGT
GTTGGCAAAA CCACAATAGT TGAGGGTTTG GTACTCAAAA TCATTGAAAA CAGTGTTCCA
AGTGCGCTCA GGTCCAGTAT AATTTATGCT TTGGATTTAG GGTCACTCCT TGCAGGGACA
CGCTATAGAG GTGATTTTGA AGAGAGAATA AAATCTATAA TAAAAGCGAT TGAGGCAAAA
CCAGGTGCTA TCCTTTTTAT TGACGAAATA CACACTATCA TTGGAGCAGG TTCTACAAGT
GGTAGCTTTC TTGATGCAGG TAACCTACTC AAACCTGCGC TTGCAAGAGG TACACTGCGT
TGTATAGGTG CAACTACATA TAGAGAATAC AGCAATAGTT TTGAAAAAGA TAAAGCATTA
GCGAGGAGGT TTCAAAAAAT TAATGTTAAG GAATCTTCTG TTAACGAAAC GATAAAGATA
CTAGATGGTA TAAAGCACTA CTATGAAGGG TATCATGGAG TATATTATAC AAAGAATGCC
ATTAGGTCTG CGGCTGAACT TTCGCATAAG TATATTACTG GGCGAATATT ACCTGATAAA
GCGGTTGATG TTATTGATGA GGCAGGGGCA TATTGTAAAT TGCTAAGAAA CAGGGGTAAA
ATTGTAAATA GTAAAGATAT TAAGAATACC ATTACTAGAA TTACAAATGT GCCTTGCGGA
TCTGAATCTG ATGATTTGCA GAAAGTAAAG TCTTTAAAAG CTAATCTAGA GAAGGTGATT
TTTGGCCAAG AGCAGGCAAT AGAATCTCTT GTTAATTCTA TTAAAATTGC TAAATCTGGG
TTGAGAAATT ACAATAAGCC TTTAGCAAAT TATCTTTTTG CAGGGCCAAC CGGTGTTGGT
AAAACCGAGC TAGCAAAACA ATTAGCAGAA AGCATGGGCA TGAATCTTAT ACGCTTTGAT
ATGTCCGAAT ATATTGAGTC TCATACGATA TCAAGGATGA TTGGTTCTCC TCCTGGATAT
GTAGGTTATG ATCAGGGTGG ATTACTCACA GAGTCTGTAT CTAATAATCA ATATAGTGTT
GTGCTGCTTG ATGAAATTGA AAAGGCTCAT AGCGATATCT ATAATATATT GCTACAAATT
ATGGATTATG GTTGCGTTAC AGACACTTAC GGACGTAAAG TTAACTTTTC CAATATAATC
TTAATTATGA CAACTAATGC AGGAGCAGCT GAACGCAGCA AAAGTTTTGT TGGCTTTGGG
CATAAAAAAT TTAACACTGG TGATAGTGAA AAAGCGATAG AACAGGTTTT TAGCCCTGAA
TTTCGTAATC GTCTTGATGC GATTATTTCT TTTTCTGACT TAAATGCGGA TATAATTCTG
CATATTGTAG ATAAATTTAT TCAGGAACTG AAAAAGCAGC TGACGCAAAA AGGCATAAAC
TGCTTGGTAG AAGATGAAGT GAAATCTTAT CTTGCACAAA CAGGTTATAG TAAGGAAATG
GGAGCACGTC CGATAGAGAG ACTTATTGAA AAAGAGATAA AAAGTTACTT AGCGGAAGAA
ATACTGAATC GTAAATTAAT AAAAGGAAAG AAATTAAGAA TCTATATGAA TAAAGTAGAA
AATAAAATTG CTTTTGATAT AGTTTAA
 
Protein sequence
MISKNLEASL NRALFIASNF NLKYAKVEHL LLALTKDVDV NYVLSRCNIR ADEIISMDNI 
LSRCNIKAND YNNNIKGFLQ SSSELIISGV KPSSMFQCII HRAIIRAHSL GKKEINGASV
LVEILSGQDL YVEDLLQKQS AKDSNLIYHI SNMKYSNDID EYTTNHKVKL DKNNDAPTTV
NKGELLKDEE ILQSYCKNLN DYARSKKIDY VIGRDYELNR TIEILLRRRK NNPLYVGDPG
VGKTTIVEGL VLKIIENSVP SALRSSIIYA LDLGSLLAGT RYRGDFEERI KSIIKAIEAK
PGAILFIDEI HTIIGAGSTS GSFLDAGNLL KPALARGTLR CIGATTYREY SNSFEKDKAL
ARRFQKINVK ESSVNETIKI LDGIKHYYEG YHGVYYTKNA IRSAAELSHK YITGRILPDK
AVDVIDEAGA YCKLLRNRGK IVNSKDIKNT ITRITNVPCG SESDDLQKVK SLKANLEKVI
FGQEQAIESL VNSIKIAKSG LRNYNKPLAN YLFAGPTGVG KTELAKQLAE SMGMNLIRFD
MSEYIESHTI SRMIGSPPGY VGYDQGGLLT ESVSNNQYSV VLLDEIEKAH SDIYNILLQI
MDYGCVTDTY GRKVNFSNII LIMTTNAGAA ERSKSFVGFG HKKFNTGDSE KAIEQVFSPE
FRNRLDAIIS FSDLNADIIL HIVDKFIQEL KKQLTQKGIN CLVEDEVKSY LAQTGYSKEM
GARPIERLIE KEIKSYLAEE ILNRKLIKGK KLRIYMNKVE NKIAFDIV