Gene WD0458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0458 
Symbol 
ID2738126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp439020 
End bp440219 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content39% 
IMG OID637172659 
ProductHK97 family phage major capsid protein 
Protein accessionNP_966244 
Protein GI42520329 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.830326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACTTA CCGATATCGC TCACCGTATC AATGAACTCG CTTCATCATG GGAGCAATTT 
AAATTAATAA ATGATCGCAA ACTAAAAGAA ATTGAAAGCA AAGGGCGTGC TGATTCTGCA
ACAATTGAGC AGCTATGCAA GGTAAATAAT GCCATTGATA GTTGCAAAGA GCGTTTAGAC
TTGATCGAAA CTGCAGCTCA ACGCCCAGAA GTAAATACAG ATTTTAGCAC AAGCGATAAA
TATTTTTCTG ATTATATCCG CAAGGGAATG GAAAGCGGTT TATCACACAA AACCCTCAGT
GGAGATGACA GTGATATCGG GGGGTATCTA GTTACTCCGC ATATTGTAAA ACGCATAAAC
AAGCGCGTAA CTGATTCGTC CCCAATGCGA CAAATATGCT CCAGTCAAAG AATCTCTACT
GAAACATTGG ATTACATTAT AGAAGATTTT GACCGCGCCG GTGCAGGTTG GAGCAGTGAA
ACAGTAGATG ATGAGGACGG TGGCAATAAG TCTAAGTATG ATTTTGCAAA AGATACGGAC
ACGCCCAAAA TCCAAAAGAT TTCCGTCACA ACTTACGAGT TATATGCTCA ACCACAAATA
TCACAAAAGT TACTCGATGA TGCGTTTGTC GATGTTGAGA GTTGGCTGGT GGAAAAGATT
GCCGAGACTT TTAGTAAGGA AGAAAGCGAA GCCTTCATTA AGGGTGAGGG CACTTTTCAA
CCAAAAGGAA TTTTAGCTTA TGAAAATGGA AACAGTTATA ATAAAATAGA GCAAGTTAAA
ACTGAAAAAT TAGATAGTGA TTCAATAATG ATGTTGTATT ACTCTCTGGA CGAATATTAT
TCCAAAAATG CGTCATTTTT GATGAACAGG AGTACGTTGA AGGATATCAG GCTGCTAAAA
TCTCAAACAG GTCAGTATCT CTGGCAGCCA AGTCTGTCGC TTGAAGCTCC AGATACTTTA
ATGGGAATAC CAGTATATCA ATCTGCCGAT ATGCCACCAG CGCCAAACAA TCAGCTACCA
GTAATTGCGA TGGCAGATTT CAAACAAGCT TATAAGATTG TAGATAACAG AGGAATGAGA
ATATTAAGAG ACCCTTATAC GAATAAACCT TATGTGAGGT TTTTTGTCAC TAAGCGTGTC
GGCGGAGAGG TTGTAAACAC CAGTGCTATT AAATTGTTGA AAATTGCGAG CAAGTACTAA
 
Protein sequence
MSLTDIAHRI NELASSWEQF KLINDRKLKE IESKGRADSA TIEQLCKVNN AIDSCKERLD 
LIETAAQRPE VNTDFSTSDK YFSDYIRKGM ESGLSHKTLS GDDSDIGGYL VTPHIVKRIN
KRVTDSSPMR QICSSQRIST ETLDYIIEDF DRAGAGWSSE TVDDEDGGNK SKYDFAKDTD
TPKIQKISVT TYELYAQPQI SQKLLDDAFV DVESWLVEKI AETFSKEESE AFIKGEGTFQ
PKGILAYENG NSYNKIEQVK TEKLDSDSIM MLYYSLDEYY SKNASFLMNR STLKDIRLLK
SQTGQYLWQP SLSLEAPDTL MGIPVYQSAD MPPAPNNQLP VIAMADFKQA YKIVDNRGMR
ILRDPYTNKP YVRFFVTKRV GGEVVNTSAI KLLKIASKY