Gene WD1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD1012 
Symbol 
ID2738223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp974993 
End bp976186 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content39% 
IMG OID637173169 
ProductHK97 family phage portal protein 
Protein accessionNP_966738 
Protein GI42520823 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.982443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTCA ATATTTTTCA AAGAAAAAAA AGCGCAGTAT TTACTAAATA TTCTGCTTTG 
CAGCTAATGA TGGAGCCAAG TTGGAGTAAG CGTGATTATG TAAGTTTTGC TGAGGAAGGT
TACATAAAAA ATGTTATTGC CTTTCGAGCA ATTAATATGA TTGCAAGTGC TGCATCTTCG
GTACCTTTTA CTCTCTGCCA GCTTACTGAA CAGGGAAAAT CGCAATTAAA AGCCCATCCA
TTACTGAAAT TACTTTATTC TCCTAATCCA ATGACATCAA AATCGGAATT TATTGAGGGG
ATTGTAACTT ATCGGTTAGT TAACGGCAAT TCTTATATAT TGATGGTTGA ATCACAGAAT
AGTAGAAAAC CACCAACAGA GCTTTATCTT CTGCGCCCCG ATAGGGTTGA AATTGTTCCA
GGGAGAAATA ACGTTCCTTA TATCTATCGT TATACCATAA ATAACAACAG TTATGACTTT
AAAGTTGATA AACTAACTGG ATGTTCAGCA GTGTTGCACT TAAAGACCTT TAATCCTTTG
AATGATTGGT ATGGGTTATC ACCAATTGAG GCAGCTGCAT ATAGCATAGA TCAACATAAT
CAGGCGGGTG CTTGGAATCA AGCGATGTTG CAAAATGGGG CAAGACCAAG TGGTGCAATA
GTTGTAAAAT CAGCAAAAGA TGGAAGTGGT GGAAGTTTAA GTCAAGAGCA GTACCAACGC
TTAAAAGCGC AGATAAATGA TCATTATTCA GGTTCTATAA ACGCTGGAAG ACCGATATTG
CTTGAAGGAG GCTTGGAGTG GAAAGAAATG AGCTTATCGC CAAGGGATAT GGATTTTATT
GAGTCCAAAC ACAGCTCAGC TCGTGATATT GCGCTAGCTT TTGGCGTTCC ACCGCAATTG
CTTGGCATAC CGGGTGATAA CACTTATAGC AATTTAGTCG AAGCACGCTT ATCCCTCTGG
GAGCAGACGG TTTTACCAAC GCTAGAAAAT ATTATCTGTC ACCTGAATTC TTGGCTGACA
CCAAGATTTG GTGAAGATCT GTGCTTGTCG TATGACAAAG ATGCAATAGA AATTCTCATG
GAAAAAAGAC AAAAGTTGTG GAAATACGTA GAAAACGCAA GCTTCATGAC GCTCAACGAA
AAGAGAGAAG CATTTGGCTT GCCACCACTG CCAGGTGGTG ATGAGTTAGG TTAA
 
Protein sequence
MNLNIFQRKK SAVFTKYSAL QLMMEPSWSK RDYVSFAEEG YIKNVIAFRA INMIASAASS 
VPFTLCQLTE QGKSQLKAHP LLKLLYSPNP MTSKSEFIEG IVTYRLVNGN SYILMVESQN
SRKPPTELYL LRPDRVEIVP GRNNVPYIYR YTINNNSYDF KVDKLTGCSA VLHLKTFNPL
NDWYGLSPIE AAAYSIDQHN QAGAWNQAML QNGARPSGAI VVKSAKDGSG GSLSQEQYQR
LKAQINDHYS GSINAGRPIL LEGGLEWKEM SLSPRDMDFI ESKHSSARDI ALAFGVPPQL
LGIPGDNTYS NLVEARLSLW EQTVLPTLEN IICHLNSWLT PRFGEDLCLS YDKDAIEILM
EKRQKLWKYV ENASFMTLNE KREAFGLPPL PGGDELG