Gene WD0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0159 
Symbol 
ID2737866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp147586 
End bp148611 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content39% 
IMG OID637172387 
ProductNADH dehydrogenase subunit H 
Protein accessionNP_965977 
Protein GI42520062 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAC TAGTTAATAT TTTATTTATT TTAGTACCGC TACTACTTTC AGTTGCATAT 
TTGACATACT TTGAGCGTAA GGTCCTTGCT GCAATTCAAC TAAGGCACGG CCCGAGTGTA
GTTGGACCTT TTGGGCTATT GCAGCCATTT GCAGATGCTA TTAAGCTACT GATTAAAGAG
CCGATAATAC CATTTAGAGC GAGCACCATA CTGTTCATTA TGGCTCCAAT GCTTACCTTT
ATCTTGGCAT TAATTGCCTG GGCAGTTATA CCGTTTGGTG CTGAAGTAAT TGTAGAAAAT
GGCCAGCAAG TAATTATTCC TAAGGTTATA GCAAATATTA ATGTTGGAGT GCTTTACGTG
CTAGCTATAT CGTCGCTGGG AGTATACGGC GTGATTATTG CAGGCTGGTC AAGCAACTCC
AATTATGCAT TCCTTGGCGC TATACGGTCG GCTGCTCAGA TGATTTCATA TGAAGTTTCA
ATAGGCTTAA TAGTTGCTGC AGTCGTTATT ACCACTGGAA CATTAAATCT TGGAGAGATG
GTGGTAGCGA AACACAATAT GCCATTTTGG GTTGATTTGC TACTAATGCC TATAGGAATA
ATATTTTTTA TTTCTTTGCT TGCAGAAACT AATCGTCACC CATTTGATTT ACCAGAAGCT
GAAGCAGAGC TTGTCTCTGG ATATAACGTT GAATATTCAT CCATGCCTTT TGCCCTCTTT
TTTCTTGGAG AATATGCAAA TATGATTCTA GCAAGTGCTA TGATGACGAT ATTCTTTCTA
GGAGGATGGT ATCCGCCGCT GGAGTTCAGT TTACTTTACA AAATTCCAGG TTTAATTTGG
TTCGTTTTGA AAATAGTTAT ACTTTTGTTT ATATTTATTT GGATTAGAGC AACAATACCT
CGTTATCGAT ATGATCAGCT AATGCGCCTT GGTTGGAAAG TATTTCTACC AATATCGGTG
CTTTGGGTGA TACTCATTTC AGGGGTGTTG CTCTTTACTG GGAACTTGCC TGGATCCAAT
GTTTAA
 
Protein sequence
MNTLVNILFI LVPLLLSVAY LTYFERKVLA AIQLRHGPSV VGPFGLLQPF ADAIKLLIKE 
PIIPFRASTI LFIMAPMLTF ILALIAWAVI PFGAEVIVEN GQQVIIPKVI ANINVGVLYV
LAISSLGVYG VIIAGWSSNS NYAFLGAIRS AAQMISYEVS IGLIVAAVVI TTGTLNLGEM
VVAKHNMPFW VDLLLMPIGI IFFISLLAET NRHPFDLPEA EAELVSGYNV EYSSMPFALF
FLGEYANMIL ASAMMTIFFL GGWYPPLEFS LLYKIPGLIW FVLKIVILLF IFIWIRATIP
RYRYDQLMRL GWKVFLPISV LWVILISGVL LFTGNLPGSN V