Gene WD0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0441 
Symbol 
ID2738017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp424704 
End bp425912 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content33% 
IMG OID637172645 
Productankyrin repeat-containing protein 
Protein accessionNP_966230 
Protein GI42520315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGGGA TATTTATGAA GAATATTTTA TATTTTATAT TATTAGCTGT CGTATTCGGA 
TCACCCATCC TGTTTGCCGT AGAGCAGCTT GAAGAAAAGA AAATTGAATC AGTTCATAAA
AACGAAAATG TATGTGTAAG GAAGCAAGAT AGCACAAATC AAAAAGATGA GTTAAAAAGC
AACGTCGATG CTAAACAAAC ACAAGAGGCT GAGAGCAAAA GATTAGACGG CACTACCGAT
AAAGAGAAAC TGCAACACAA CGAGAACAAA GCTTTCGTGG TTGAAAAAAC TATAAGTGAA
GGTGAAAGGA TTGATAAAGA TTTGCCAAAT GACCAGCTAG AAGGAAGCAC AGACAAATTT
GCTCAAAATT TACCAAATGT GGCCAATAAA GAAGTGAATA AAGATTTGAA ACCAGAGCCT
TTGCCTTTAA GTGCTGATTT AAATGAGAAT ACAGCTAACC CGCAAAAAAA TCTACAAGCT
GATCAGAAGA TTGATATAAA AGATAATGAA CTTTCAAAAA GTGATGCGAG TCAATTATTA
GAGGGAAAAA AAGAAAAAGT AGAGAATCAG TCTGAAGAAA AAAAAGTGAA AGAAACAAAC
AGTAATTCCA AGGACCGCAA TAGAGTAAAA CCTATAACTA AAAAAGATGA AGAAGAGCAA
AGTGAAAAGA AAAGTTTACA AAAATGGACA AAGCTAAACA GAGAACCAAT AAAAGAATGG
GGTCATAAAG ACATACAAAG CAAGTCAATA TATAAACGAC AATATGATAG CCTTAATGAG
CATCTTCCTA CAACTGTGTT TATTGATGAT TACAGTAAGC AATTTTTTTA CTGCATTAAG
AAGAACAACT TAACTTGCTT AAGAGGAGTA ATAAGTAAGC TAGAAAAAAT TGGATTAACA
ATTCAAGAGA TACTAAGGTT TAGAAACAAA TTGGGTGATA CTCCTCTCAT TTATTCAGTT
AAACAAGGTG AGGTAGACAT AGTGCGCTTT CTCTTATTAC AAGGTGCTGA TCTTAGAGTA
GTTAACAATA ATTTTCAATC CCCAATTGAT ATAGCAATCG AAAAAAAGCA GATCAATATA
ATAAATGCGA TTGCCGAAAT GATGCCACAT CTTTTGGAGG ATAGAAAAAT AGACAATAAA
GAAAGCTCAG CAATGTACGA TTGGGCTGTG AAAACGAAAG AAATACAGTG CGATAAGCAA
GATGATTAG
 
Protein sequence
MLGIFMKNIL YFILLAVVFG SPILFAVEQL EEKKIESVHK NENVCVRKQD STNQKDELKS 
NVDAKQTQEA ESKRLDGTTD KEKLQHNENK AFVVEKTISE GERIDKDLPN DQLEGSTDKF
AQNLPNVANK EVNKDLKPEP LPLSADLNEN TANPQKNLQA DQKIDIKDNE LSKSDASQLL
EGKKEKVENQ SEEKKVKETN SNSKDRNRVK PITKKDEEEQ SEKKSLQKWT KLNREPIKEW
GHKDIQSKSI YKRQYDSLNE HLPTTVFIDD YSKQFFYCIK KNNLTCLRGV ISKLEKIGLT
IQEILRFRNK LGDTPLIYSV KQGEVDIVRF LLLQGADLRV VNNNFQSPID IAIEKKQINI
INAIAEMMPH LLEDRKIDNK ESSAMYDWAV KTKEIQCDKQ DD