Gene WD1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD1239 
SymbolpyrD 
ID2737683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp1186489 
End bp1187556 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content35% 
IMG OID637173391 
Productdihydroorotate dehydrogenase 2 
Protein accessionNP_966951 
Protein GI42521036 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAAAA ATAAACTAAA AATAAGCAAA GTGTTTTACA AACGTAATTT ACTATTTTTA 
CTACCGCCCG AAGTTGCTCA CTCTTTGGCA ATTATGGCAT TAAAGAAAAT GCCTTATAAG
AATCCTATAG AGCTACCAGA ATCTTTGAGT GTGAATTTTT TTGGTAATAA GCTCAGAAGC
CCCGTAGGTC TGGCTGCAGG TTTTGACAAG AATGCAGAAG TTATAAGGCC TATGCTCTCA
TTTGGTTTTG GGTTTATTGA AACTGGTACT GTAACTCGTA ATCCACAATA TGGAAACAAA
AAGCCAAGAA TTTTTCGGTT AATTAAAGAT CAAGGGGTAA TTAACAGATT GGGATTTAAC
AATAAAGGAA TAGACTATTT TCTTAAACAA ATAGGTGAAA CCAAGCTTGA TGACTGCATT
TTTGGCATCA ACATAGGAAA AAACAGTACA TCAAAGGACC AAATCAGCGA TTATGTTGAC
TTAATAAAGA TAGTATATGG AAAGAGCAAT TATATAGTGC TGAACATCTC ATCCCCAAAC
ACGCCTAATT TACGCAATCT GCACAATAAG CAAGAATTAT CGGAATTGTT GAAATCCGTA
ACTCTAACCC GAAAATCAAT TGATAATTCT AAATCCATAC CAATAATATT AAAAATCTCA
CCAGATGTAG ATCAGCAAAC GAAAGAAAAT ATCGCTGAGC TTGCGTTGGA ATATAAGATT
GACGGATTAA CAGTAAGCAA CACTACGGTA AGTAGAGATA ATCTGCATTC TCACCATAAT
GAGAGTGGTG GGTTGAGTGG CAAACCGCTG TTTAAACTTT CAACCGAGTT ATTGGGCGAT
ATGTACAAAT TTACTAAGGG CAAAATATTA TTGATAGGGT GCGGAGGAAT CTCAAGTGGT
GCTGATGCAT ATAAAAAAAT AAAGGCAGGA GCTTCTTTGG TGCAGTTGTA CACTGCTCTC
ATATACCACG GACCTCAAGT TGTAAACAAA ATTAATCTAG AACTTGCAGA ACTAATAAGG
AGAGATGGAT TTAGTAACAT TAATGAGGTG GTGGGTTGTA TACATTAA
 
Protein sequence
MVKNKLKISK VFYKRNLLFL LPPEVAHSLA IMALKKMPYK NPIELPESLS VNFFGNKLRS 
PVGLAAGFDK NAEVIRPMLS FGFGFIETGT VTRNPQYGNK KPRIFRLIKD QGVINRLGFN
NKGIDYFLKQ IGETKLDDCI FGINIGKNST SKDQISDYVD LIKIVYGKSN YIVLNISSPN
TPNLRNLHNK QELSELLKSV TLTRKSIDNS KSIPIILKIS PDVDQQTKEN IAELALEYKI
DGLTVSNTTV SRDNLHSHHN ESGGLSGKPL FKLSTELLGD MYKFTKGKIL LIGCGGISSG
ADAYKKIKAG ASLVQLYTAL IYHGPQVVNK INLELAELIR RDGFSNINEV VGCIH