Gene WD0766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0766 
Symbol 
ID2738857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp738997 
End bp740421 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content36% 
IMG OID637172942 
Productankyrin repeat-containing protein 
Protein accessionNP_966522 
Protein GI42520607 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.705844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATG ACAAATTTAT GGAAATATTA AAAAAAATAA ATGACCTTTC AGATTTGAGC 
AAGGACAATA TAGTTGAAAA AATAAAAGCT AAGTTACAAG AAGAAGATCC AGATTTATGT
CAAAAGTGGG AAAAGAGTAA ACCTGACAAC GATAGTGGAT CTGGCATAAA TTACATATTT
ACTATATCTC GTGGTCAAAA TTCTCAGGAA GTCAAATTGT TACATTTTGC TTCTTATTGG
AACTGTGCAA ATGTAGCAAA AGCTCTGATT GAAAACGGGG CAGATATTAA TGCAGAACAT
GATAATAAAA TTACTCCTTT ACATCTTGCT GCTCACTATG GCCACAAAGA GATAGTACAA
GTTCTATCAA AAGCAGAAGG AATCAACGTT GATGCAAAAG ATAGTGATGG GTTGACTCCT
TTACATCTTG CTACTGCAAA TAGCCATAAG GATGTAGTAG AAACTCTAAT TGCAAACAAA
GTAAATGTTA ATGCAGAAGA TGATGATAGA TGTACACCTT TACATCTTGC TGCTGAAGCG
AACCACATAG AGGTAGTAAA AATTCTAGTT GAGAAAGCAG ATGTTAATAT AAAGGATGCT
GATAGATGGA CTCCTTTGCA TGTTGCTGCT GCAAATGGCC ATAAGGATGT AGTAGAAACT
CTAATTGCAA ACAAAGTAAA TGTTAATGCA GAAGATGATG ATAGATGTAC ACCTTTACAT
CTTGCTGCTG AAGCGAACCA CATAGAGGTA GTAAAAATTC TAGTTGAGAA AGCAGATGTT
AATATAAAGG ATGCTGATAG ATGGACTCCT TTGCATGTTG CTGCTGCAAA TGGCCACGAA
GATGTAGTAA AAACTCTAAT CGCAAAAGGA GCAAAGGTTA AGGCAAAAAA TGGTGATAGA
CATACTCCTT TACATTTTGC TGCTCAAAAT GGCCACGAAG GTATAGTAAA AGTTCTGCTA
GAAGCTGGAG CAGACCCTTC ATTAAAAGAT GTTGATGGAA AAACGCCAAG AGACCTCACT
AAAGATCAAG GTATAATTCA GCTTTTAGAG GAAGCGGAAA AAAAGCAAAC GTTAAAAAAT
GAGAATAAAA AAACGCCAAA GGATCTTACT GAAAATAAAG ATGTAATGCA GCTTCCAGAG
AAAAAGGAAG AAAAACAAAT TGGAAAAAAT GCAATTGTGA AAGAAAAAGA ACAATCTGCA
AAAAATGCAA TTGTAAAAGG TGTTATTGTG TGTTTTGTAA CTGCAGTGAT AGTTGGTGTT
GCACTTGCAT TTGCTACTGC CCTATCTGTA CCAGCAATAA TTGGACTAGC TGCAGGATCT
GCGCTCATAG TTGGTGCTGG TCAATATATA ATGTCAAAGC CTAAACCTGA AATGAAAGAA
GTAAAGGAAC CTGTGCCTAG AGAGACAGAA AAAGCACTTA CTTGA
 
Protein sequence
MKYDKFMEIL KKINDLSDLS KDNIVEKIKA KLQEEDPDLC QKWEKSKPDN DSGSGINYIF 
TISRGQNSQE VKLLHFASYW NCANVAKALI ENGADINAEH DNKITPLHLA AHYGHKEIVQ
VLSKAEGINV DAKDSDGLTP LHLATANSHK DVVETLIANK VNVNAEDDDR CTPLHLAAEA
NHIEVVKILV EKADVNIKDA DRWTPLHVAA ANGHKDVVET LIANKVNVNA EDDDRCTPLH
LAAEANHIEV VKILVEKADV NIKDADRWTP LHVAAANGHE DVVKTLIAKG AKVKAKNGDR
HTPLHFAAQN GHEGIVKVLL EAGADPSLKD VDGKTPRDLT KDQGIIQLLE EAEKKQTLKN
ENKKTPKDLT ENKDVMQLPE KKEEKQIGKN AIVKEKEQSA KNAIVKGVIV CFVTAVIVGV
ALAFATALSV PAIIGLAAGS ALIVGAGQYI MSKPKPEMKE VKEPVPRETE KALT