Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | WD1012 |
Symbol | |
ID | 2738223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Wolbachia endosymbiont of Drosophila melanogaster |
Kingdom | Bacteria |
Replicon accession | NC_002978 |
Strand | - |
Start bp | 974993 |
End bp | 976186 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637173169 |
Product | HK97 family phage portal protein |
Protein accession | NP_966738 |
Protein GI | 42520823 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.982443 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTCA ATATTTTTCA AAGAAAAAAA AGCGCAGTAT TTACTAAATA TTCTGCTTTG CAGCTAATGA TGGAGCCAAG TTGGAGTAAG CGTGATTATG TAAGTTTTGC TGAGGAAGGT TACATAAAAA ATGTTATTGC CTTTCGAGCA ATTAATATGA TTGCAAGTGC TGCATCTTCG GTACCTTTTA CTCTCTGCCA GCTTACTGAA CAGGGAAAAT CGCAATTAAA AGCCCATCCA TTACTGAAAT TACTTTATTC TCCTAATCCA ATGACATCAA AATCGGAATT TATTGAGGGG ATTGTAACTT ATCGGTTAGT TAACGGCAAT TCTTATATAT TGATGGTTGA ATCACAGAAT AGTAGAAAAC CACCAACAGA GCTTTATCTT CTGCGCCCCG ATAGGGTTGA AATTGTTCCA GGGAGAAATA ACGTTCCTTA TATCTATCGT TATACCATAA ATAACAACAG TTATGACTTT AAAGTTGATA AACTAACTGG ATGTTCAGCA GTGTTGCACT TAAAGACCTT TAATCCTTTG AATGATTGGT ATGGGTTATC ACCAATTGAG GCAGCTGCAT ATAGCATAGA TCAACATAAT CAGGCGGGTG CTTGGAATCA AGCGATGTTG CAAAATGGGG CAAGACCAAG TGGTGCAATA GTTGTAAAAT CAGCAAAAGA TGGAAGTGGT GGAAGTTTAA GTCAAGAGCA GTACCAACGC TTAAAAGCGC AGATAAATGA TCATTATTCA GGTTCTATAA ACGCTGGAAG ACCGATATTG CTTGAAGGAG GCTTGGAGTG GAAAGAAATG AGCTTATCGC CAAGGGATAT GGATTTTATT GAGTCCAAAC ACAGCTCAGC TCGTGATATT GCGCTAGCTT TTGGCGTTCC ACCGCAATTG CTTGGCATAC CGGGTGATAA CACTTATAGC AATTTAGTCG AAGCACGCTT ATCCCTCTGG GAGCAGACGG TTTTACCAAC GCTAGAAAAT ATTATCTGTC ACCTGAATTC TTGGCTGACA CCAAGATTTG GTGAAGATCT GTGCTTGTCG TATGACAAAG ATGCAATAGA AATTCTCATG GAAAAAAGAC AAAAGTTGTG GAAATACGTA GAAAACGCAA GCTTCATGAC GCTCAACGAA AAGAGAGAAG CATTTGGCTT GCCACCACTG CCAGGTGGTG ATGAGTTAGG TTAA
|
Protein sequence | MNLNIFQRKK SAVFTKYSAL QLMMEPSWSK RDYVSFAEEG YIKNVIAFRA INMIASAASS VPFTLCQLTE QGKSQLKAHP LLKLLYSPNP MTSKSEFIEG IVTYRLVNGN SYILMVESQN SRKPPTELYL LRPDRVEIVP GRNNVPYIYR YTINNNSYDF KVDKLTGCSA VLHLKTFNPL NDWYGLSPIE AAAYSIDQHN QAGAWNQAML QNGARPSGAI VVKSAKDGSG GSLSQEQYQR LKAQINDHYS GSINAGRPIL LEGGLEWKEM SLSPRDMDFI ESKHSSARDI ALAFGVPPQL LGIPGDNTYS NLVEARLSLW EQTVLPTLEN IICHLNSWLT PRFGEDLCLS YDKDAIEILM EKRQKLWKYV ENASFMTLNE KREAFGLPPL PGGDELG
|
| |