Gene WD0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0301 
SymbolcoxA 
ID2738695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp280476 
End bp282026 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content38% 
IMG OID637172514 
Productcytochrome c oxidase, subunit I 
Protein accessionNP_966102 
Protein GI42520187 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.249737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGACG TACCAAAGGG CATAAAGCGT TGGTTGTTTT CCACCAATCA TAAAGATATA 
GGGACACTGT ACATTATTTT TTCCATATTA GCTGGAATTA TTGGTGGATT ATTATCGGTG
ATTATTCGCA CTCAGCTAAT GCACATTAAT ATACTTAACA ATAACTATCA ATTATATAAC
GTAATGGTTA CAGGGCATGC GTTGATAATG GTGTTTTTTA TGATAATGCC AGCCCTCATG
GGAGGATTTG GTAACTGGTT TGTACCTCTC ATGATTGGCG CACCAGATAT GGCATTTCCT
CGTATGAATA ATTTAAGTTT TTGGTTATTA GTGTCATCTT TTATTTTGCT CATTCTCTCA
GTGTTTATTG GTGAAGGTCC AGGTACAGGT TGGACTTTAT ATCCACCTCT ATCACAGGTA
ATGTCCCATC CAAGTGCAGG AGTTGACATT GCTATACTTG CACTTCATGT TGCTGGTATG
TCGTCAATTG TTGGGGCGAT CAACTTTATA GTTACTATAT TTAACATGCG CACAAAAGGA
ATGTCATTAA CTAAGATGCC ACTGTTTGTT TGGTCTGTCT TGCTAACAGC ATTTATGTTG
ATTGTTGCCT TACCAGTGCT TGCCGGTGCT ATAACTATGC TTCTTACTGA TCGCAATATT
GGTACTTCCT TTTTTGATCC TGCCGGTGGC GGCGATCCTG TGTTATTTCA ACATCTATTT
TGGTTTTTTG GTCATCCAGA AGTTTACGTA ATTATTTTTC CTGCATTTGG CATCATAAGT
CAGGTTGTAT CAACTTTTTC TCACAGACCT GTATTTGGTT ACATAGGGAT GGTTTATGCA
ATGATAGGTA TAGCAGTATT TGGCTTTATG GTTTGGGCTC ACCATATGTT CACTGTTGGG
CTTAGTGCTG ACGCTGCTGC ATTTTTTAGC ACTACCACAA TTTTTATCGG TGTTATAACT
GGTGTAAAAG TCTTTAGCTG GATTGCAACT ATGTGGGGTG GAGCAATTGA GTTTAAGACC
CCTATGCTAT TTGCACTAGG TTTTATTTTC ATGTTTGTTG GCGGTGGCAT AACGGGAATA
ATTCTTTCTC ATGGTGGAAT AGATAAGCTC CTGCACGACA CCTATTATGT TGTTGCTCAC
TTCCATTATG TCATGTCACT TGCTGCATTA TTTGGAGCTT TTGCTGGCTT TTATTATTGG
ATTGGTAAAA TGTCGGGTAA ACAATATAAT GAGCGCTTAG GTCAAATCCA CTTTTGGCTT
ACTTTTATTA GCACCAATAT CACTTTTTTA CCTCAACATT TCTTAGGATT AGCTGGTATG
CCAAGGCGTA TACCTGATTA TCCTGATGCG TTTATCCCTT GGAATTATAT ATCCTCAATT
GGTTCGTATA TGTCCTTTGT TTCAGTTATG TTTTTTGTGT TTATAGTTAT ACATCTTTTT
AAATGGGGCA AGAAAGCTGG AGATAATCCT TGGGAAGGTG ACACCTTGGA ATGGACGGTA
TCTTCACCAC CGCCTTTTCA TACTTTCGAA AAGCCACCAG TGGTAAAATA G
 
Protein sequence
MSDVPKGIKR WLFSTNHKDI GTLYIIFSIL AGIIGGLLSV IIRTQLMHIN ILNNNYQLYN 
VMVTGHALIM VFFMIMPALM GGFGNWFVPL MIGAPDMAFP RMNNLSFWLL VSSFILLILS
VFIGEGPGTG WTLYPPLSQV MSHPSAGVDI AILALHVAGM SSIVGAINFI VTIFNMRTKG
MSLTKMPLFV WSVLLTAFML IVALPVLAGA ITMLLTDRNI GTSFFDPAGG GDPVLFQHLF
WFFGHPEVYV IIFPAFGIIS QVVSTFSHRP VFGYIGMVYA MIGIAVFGFM VWAHHMFTVG
LSADAAAFFS TTTIFIGVIT GVKVFSWIAT MWGGAIEFKT PMLFALGFIF MFVGGGITGI
ILSHGGIDKL LHDTYYVVAH FHYVMSLAAL FGAFAGFYYW IGKMSGKQYN ERLGQIHFWL
TFISTNITFL PQHFLGLAGM PRRIPDYPDA FIPWNYISSI GSYMSFVSVM FFVFIVIHLF
KWGKKAGDNP WEGDTLEWTV SSPPPFHTFE KPPVVK