Gene WD0867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0867 
SymbolpurH 
ID2738441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp838119 
End bp839630 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content35% 
IMG OID637173040 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionNP_966617 
Protein GI42520702 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA AAAGAGCTTT AATATCAGTA TACGATAAAA CGAATATAAT TGATCTTGCA 
TCGTTTTTAA CGCAGCAACA AATAGAAATT CTTTCAACGG GCAATACTTA TAAACTGCTA
TCTAGTGCAG GAATAAAAAC ACAAGAGGTC TCAGATTACA CACAATTTCC AGAGATACTG
GGTGGTAGAG TAAAAACTTT ACACCCTAAA ATTCATGGAG GCATACTTTG CAATAGAGAA
AAACACAAAA CGGAAATACA AAATCTAGGT ATTGAGCCAA TAGAACTGCT TATAACTAAC
CTATACCCAT TTTGGGAGAC AGTAAGTAGC GGCTCAAATG AAGAGCAAAT TATAGAACAA
ATTGATATCG GCGGAGTGGC GTTAATTAGA GCTGCAGCAA AAAACTTTCG TTTTACTTCA
GTTATTTCTA GCATTCAAGA CTATGAAGCA CTGAAAGCTG AGATGATAGA AAATAACAAT
AAAACAACAT TGGAATATAG AAAACACTTA GCAACCAAAG CATTTGCTCT CACTGCACAC
TACGATTCTA ATATTCACAG TTGGTTTTTA TCCCAGAGTA AAAATAATGA GTTACCAGAG
TTTTTTGCTC TATACGGGCA TAAAGTACAA GAACTCAGGT ATGGTGAAAA TCCCCATCAA
AAAGCTGCAT TTTATAGTAA TCAATTTACA GAATATCCGT TGGAAAAACT ACATGGAAAA
GAGTTGAGTT ATAATAATAT AGTAGATATA GAATCCGCAC TTAACATAAC TTCTGAATTC
GAAGAACCTG CAGCAGTGAT AATCAAGCAT AATAACCCAT GTGGCGCTGC TATTGGTAAT
AATGCTTTGG AGGCATATGA AAAAGCTCTA TCGTGCGATG AAGTAAGCAG TTTTGGTGGT
ATAGTTGCCT TAAACCGGGA GATAGATTTA AAGCTAGCAG AAAAATTAAA CGAGATATTT
TTGGAAGTAG TGATAGCACC ATCGGTAAAC AATGAGGCAC TAAAAATTTT ACAAAGAAAG
AAAAATTTAA GAGTGATTAT TCATAAATCT TTTCAACAAA ATGTGAAATA CCAAACTAAA
AATGTTGTTG GTGGGTTTTT GGTGCAAGAA AATAATGACC ACACAATAAA AGCAGAACAA
GTAACAGAAT GCACTGCAAC AGACAAAGAA AAAAAAGATC TTATTTTTGC CTGGAAAATA
TGTAAGCATG TGAAATCCAA CGCAATAGTT ATAGCAAAAG ATGGTTGTGC TATTGGCATC
GGTGCAGGGC AAACAAGCAG AATAGATAGT GTGAACATTG CAGTGAAAAA AGCAGGTGAA
AAATGTAAAG GTGCAGTGCT TGCTTCAGAT GCATTTTTTC CATTCCCAGA TAGCATAGTA
GAAAGTGCAA AACATGAGAT TACAGCTATA ATTCAGCCCG GCGGCTCGCT GAAAGATCAA
GATGTGATAA AAGCTGCAAA TGAAAATAAA ATTGCTATGT TTTTCACTGG CGTTCGCAGT
TTTTTCCATT AG
 
Protein sequence
MKIKRALISV YDKTNIIDLA SFLTQQQIEI LSTGNTYKLL SSAGIKTQEV SDYTQFPEIL 
GGRVKTLHPK IHGGILCNRE KHKTEIQNLG IEPIELLITN LYPFWETVSS GSNEEQIIEQ
IDIGGVALIR AAAKNFRFTS VISSIQDYEA LKAEMIENNN KTTLEYRKHL ATKAFALTAH
YDSNIHSWFL SQSKNNELPE FFALYGHKVQ ELRYGENPHQ KAAFYSNQFT EYPLEKLHGK
ELSYNNIVDI ESALNITSEF EEPAAVIIKH NNPCGAAIGN NALEAYEKAL SCDEVSSFGG
IVALNREIDL KLAEKLNEIF LEVVIAPSVN NEALKILQRK KNLRVIIHKS FQQNVKYQTK
NVVGGFLVQE NNDHTIKAEQ VTECTATDKE KKDLIFAWKI CKHVKSNAIV IAKDGCAIGI
GAGQTSRIDS VNIAVKKAGE KCKGAVLASD AFFPFPDSIV ESAKHEITAI IQPGGSLKDQ
DVIKAANENK IAMFFTGVRS FFH