Gene WD1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD1024 
SymbolpurM 
ID2738042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp985655 
End bp986692 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content35% 
IMG OID637173180 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionNP_966749 
Protein GI42520834 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACTT ATACCAGATC AGGAATAGAT ATTGAACTAT ATAATAAGTT AATAAAAGAA 
GTCAAGCCTA TTGCTCAAGA AACTACTAGA GAAGAAGTAA TCAGCGAAAT AGGTTCATTT
TCTGCGTTAT TTGATTTTGC TGCACTAAGT AAGAAGTATG ACCATCCAGT ACTCGTTTCC
TCAACTGATG GAGTAGGTAC GAAACTGTTG ATAGCTCAAG AAGTGAATAA ACATGATACT
ATAGGTATAG ATTTAGTTGC AATGTGTGTA AATGACTTAC TTGCACAAGG AGCAACGCCT
TTGTTTTTCC TTGATTACTT TGCAACAGGC GTTTTGACCA AAGATGTTTT ATTATCTGTG
GTTAAGGGCA TTGCAGAGGG GTGCAAGCAA GCTAAAATAG CATTGGTTGG TGGGGAAACT
GCAGAAATGC CTGGAATGTA TGGTAATAAT CACTATGACC TTGCAGGGTT TGTGGTTGGT
GTAGTTGATC GAAAGCAAAT TCTTCCAAAC TGTAGTATGA TGAAAGCAGG TGATTATATA
GTTGGCTTAG AGTCAAGTGG AATTCACTCA AATGGGTTTT CTTTAGTGCG CCATGTTTTC
AAAAGCTTAG GTATAAATTA TAACGATACA TCTCTATGGA ATAATAAATC TTGGAGTGAA
ATACTACTTG AACCAACAAA AATATATGTT GATTCTTTGC TGCCTATCAT GTCACAAGTA
AAAGGTATTG CGCACATCAC GGGTGGTGGT TTGGTAGACA ATATTCCGCG AATTCTTCCA
AAAAACTTAT TTGCAAACAT AGACATTAAT TCCTGGAAAT GGCCAGATAT ATTTTTATGG
CTAACAAAGG AGGGTAAAAT AGAGAAGAAA GAAATGCTAA AAACATTTAA TTGTGGTATT
GGTATGGTAT TGATCGTAAG TTCTGAGAAT ATGCAAAACG TGAAAAATCA TTTCCAAAAA
CGTGGAGAAA AAATTGAAAT TATTGGAAAA CTTGATGAGG CATGTAACTC TCCACTTGAT
AGAGTAGTAT TTAGTTAA
 
Protein sequence
MNTYTRSGID IELYNKLIKE VKPIAQETTR EEVISEIGSF SALFDFAALS KKYDHPVLVS 
STDGVGTKLL IAQEVNKHDT IGIDLVAMCV NDLLAQGATP LFFLDYFATG VLTKDVLLSV
VKGIAEGCKQ AKIALVGGET AEMPGMYGNN HYDLAGFVVG VVDRKQILPN CSMMKAGDYI
VGLESSGIHS NGFSLVRHVF KSLGINYNDT SLWNNKSWSE ILLEPTKIYV DSLLPIMSQV
KGIAHITGGG LVDNIPRILP KNLFANIDIN SWKWPDIFLW LTKEGKIEKK EMLKTFNCGI
GMVLIVSSEN MQNVKNHFQK RGEKIEIIGK LDEACNSPLD RVVFS