Gene WD0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0472 
Symbol 
ID2737952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp454779 
End bp455879 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content36% 
IMG OID637172673 
ProductAAA family ATPase 
Protein accessionNP_966258 
Protein GI42520343 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0845041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTGAAATTTC AGAATTACTA AAAAGAATTT CAATCTGTTT ACTATGCGTG 
GCAACTATTG TCTCTATCGC TTTTTTGTTT GCCTATATGC ATACAATGTT GCTTAGTAAA
GGGTATGAGT TAGCTGCACA TTATGTTAGT GAAGCTATGC CTTGTGTGTT ACTTGTCGTT
CTCCTTTTTA TTGTTTGGTT TGTATATGAT CAAGTATTCA GCAAACTAAA GGAAGTAGAG
CTGCCTATAT CGATAGAATT AGCTAATTCA GATGATAAAA GAATAACATT TGCTGATGCT
ATAATTGATG ACTCCTTAAA ACAACGGTTG CAGATGATTT GCTGTGACCA AATGACAGAA
GAAATGCGCA AGCTGTTTGG AAATAAAAGT ATTAATTCAC TAAGAGGTTA CATATTATAT
GGCCCTCCTG GAAATGGTAA AACACTTATC GCTCGTGCAA TTGCAGGTGA ATCAAATATG
AATTTCATAA GCATCTCAGG TCCTGAACTT ATTGGAGTAT ATATTGGTCA TGGTGCACAT
GCTGTACGCG AGCTTTTTAA AATAGCAAAA AAATATTCTC CTTGTATAGT CTTTATAGAT
GAAATAGATG CAGTTGCACA AAAAAGAAGT ACCGCTAATA ACTCAGCTTA CCATTGTCGA
GAGAGTTTAA CACAGTTGTT AACTGAAATA GATGGATTTA AGAGCAGAAA AGATATAATA
GTAATTGGTG CAACTAACCT TATCGGTGGT ATAGATCCAG CACTTATTAG ACCTGGACGT
TTAGGTCAGA AGGTTTATGT CCCTAATCCA AACATAGAGG TGAGACAAAA GATATTGGCA
CTTTATATGC GAGGTACAAA AACTGATGAA AAGCTAAGCC TCCAAAACAT TGCAGACAAG
ACTGAAGGCT ATTCTGGGGC TGAGTTAGAG CAATTAGTGA ATGAAGCAAA AATTAGCGCT
GGTGCGCAAA GACGGCTTAT AGTGAGTGAG GAAGATTTCA GTTACGCGCT TCACAGGTTA
AGTCCAGAAC AAGAAAGAGA TAGAATAAAG TTAGTTTCAA ACGCGAAGAC ACAGATAGAA
GAAACTTCTT TTATACGGTA A
 
Protein sequence
MKKIEISELL KRISICLLCV ATIVSIAFLF AYMHTMLLSK GYELAAHYVS EAMPCVLLVV 
LLFIVWFVYD QVFSKLKEVE LPISIELANS DDKRITFADA IIDDSLKQRL QMICCDQMTE
EMRKLFGNKS INSLRGYILY GPPGNGKTLI ARAIAGESNM NFISISGPEL IGVYIGHGAH
AVRELFKIAK KYSPCIVFID EIDAVAQKRS TANNSAYHCR ESLTQLLTEI DGFKSRKDII
VIGATNLIGG IDPALIRPGR LGQKVYVPNP NIEVRQKILA LYMRGTKTDE KLSLQNIADK
TEGYSGAELE QLVNEAKISA GAQRRLIVSE EDFSYALHRL SPEQERDRIK LVSNAKTQIE
ETSFIR