Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | WD0458 |
Symbol | |
ID | 2738126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Wolbachia endosymbiont of Drosophila melanogaster |
Kingdom | Bacteria |
Replicon accession | NC_002978 |
Strand | - |
Start bp | 439020 |
End bp | 440219 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637172659 |
Product | HK97 family phage major capsid protein |
Protein accession | NP_966244 |
Protein GI | 42520329 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.830326 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACTTA CCGATATCGC TCACCGTATC AATGAACTCG CTTCATCATG GGAGCAATTT AAATTAATAA ATGATCGCAA ACTAAAAGAA ATTGAAAGCA AAGGGCGTGC TGATTCTGCA ACAATTGAGC AGCTATGCAA GGTAAATAAT GCCATTGATA GTTGCAAAGA GCGTTTAGAC TTGATCGAAA CTGCAGCTCA ACGCCCAGAA GTAAATACAG ATTTTAGCAC AAGCGATAAA TATTTTTCTG ATTATATCCG CAAGGGAATG GAAAGCGGTT TATCACACAA AACCCTCAGT GGAGATGACA GTGATATCGG GGGGTATCTA GTTACTCCGC ATATTGTAAA ACGCATAAAC AAGCGCGTAA CTGATTCGTC CCCAATGCGA CAAATATGCT CCAGTCAAAG AATCTCTACT GAAACATTGG ATTACATTAT AGAAGATTTT GACCGCGCCG GTGCAGGTTG GAGCAGTGAA ACAGTAGATG ATGAGGACGG TGGCAATAAG TCTAAGTATG ATTTTGCAAA AGATACGGAC ACGCCCAAAA TCCAAAAGAT TTCCGTCACA ACTTACGAGT TATATGCTCA ACCACAAATA TCACAAAAGT TACTCGATGA TGCGTTTGTC GATGTTGAGA GTTGGCTGGT GGAAAAGATT GCCGAGACTT TTAGTAAGGA AGAAAGCGAA GCCTTCATTA AGGGTGAGGG CACTTTTCAA CCAAAAGGAA TTTTAGCTTA TGAAAATGGA AACAGTTATA ATAAAATAGA GCAAGTTAAA ACTGAAAAAT TAGATAGTGA TTCAATAATG ATGTTGTATT ACTCTCTGGA CGAATATTAT TCCAAAAATG CGTCATTTTT GATGAACAGG AGTACGTTGA AGGATATCAG GCTGCTAAAA TCTCAAACAG GTCAGTATCT CTGGCAGCCA AGTCTGTCGC TTGAAGCTCC AGATACTTTA ATGGGAATAC CAGTATATCA ATCTGCCGAT ATGCCACCAG CGCCAAACAA TCAGCTACCA GTAATTGCGA TGGCAGATTT CAAACAAGCT TATAAGATTG TAGATAACAG AGGAATGAGA ATATTAAGAG ACCCTTATAC GAATAAACCT TATGTGAGGT TTTTTGTCAC TAAGCGTGTC GGCGGAGAGG TTGTAAACAC CAGTGCTATT AAATTGTTGA AAATTGCGAG CAAGTACTAA
|
Protein sequence | MSLTDIAHRI NELASSWEQF KLINDRKLKE IESKGRADSA TIEQLCKVNN AIDSCKERLD LIETAAQRPE VNTDFSTSDK YFSDYIRKGM ESGLSHKTLS GDDSDIGGYL VTPHIVKRIN KRVTDSSPMR QICSSQRIST ETLDYIIEDF DRAGAGWSSE TVDDEDGGNK SKYDFAKDTD TPKIQKISVT TYELYAQPQI SQKLLDDAFV DVESWLVEKI AETFSKEESE AFIKGEGTFQ PKGILAYENG NSYNKIEQVK TEKLDSDSIM MLYYSLDEYY SKNASFLMNR STLKDIRLLK SQTGQYLWQP SLSLEAPDTL MGIPVYQSAD MPPAPNNQLP VIAMADFKQA YKIVDNRGMR ILRDPYTNKP YVRFFVTKRV GGEVVNTSAI KLLKIASKY
|
| |