Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | WD0867 |
Symbol | purH |
ID | 2738441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Wolbachia endosymbiont of Drosophila melanogaster |
Kingdom | Bacteria |
Replicon accession | NC_002978 |
Strand | + |
Start bp | 838119 |
End bp | 839630 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637173040 |
Product | phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | NP_966617 |
Protein GI | 42520702 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA AAAGAGCTTT AATATCAGTA TACGATAAAA CGAATATAAT TGATCTTGCA TCGTTTTTAA CGCAGCAACA AATAGAAATT CTTTCAACGG GCAATACTTA TAAACTGCTA TCTAGTGCAG GAATAAAAAC ACAAGAGGTC TCAGATTACA CACAATTTCC AGAGATACTG GGTGGTAGAG TAAAAACTTT ACACCCTAAA ATTCATGGAG GCATACTTTG CAATAGAGAA AAACACAAAA CGGAAATACA AAATCTAGGT ATTGAGCCAA TAGAACTGCT TATAACTAAC CTATACCCAT TTTGGGAGAC AGTAAGTAGC GGCTCAAATG AAGAGCAAAT TATAGAACAA ATTGATATCG GCGGAGTGGC GTTAATTAGA GCTGCAGCAA AAAACTTTCG TTTTACTTCA GTTATTTCTA GCATTCAAGA CTATGAAGCA CTGAAAGCTG AGATGATAGA AAATAACAAT AAAACAACAT TGGAATATAG AAAACACTTA GCAACCAAAG CATTTGCTCT CACTGCACAC TACGATTCTA ATATTCACAG TTGGTTTTTA TCCCAGAGTA AAAATAATGA GTTACCAGAG TTTTTTGCTC TATACGGGCA TAAAGTACAA GAACTCAGGT ATGGTGAAAA TCCCCATCAA AAAGCTGCAT TTTATAGTAA TCAATTTACA GAATATCCGT TGGAAAAACT ACATGGAAAA GAGTTGAGTT ATAATAATAT AGTAGATATA GAATCCGCAC TTAACATAAC TTCTGAATTC GAAGAACCTG CAGCAGTGAT AATCAAGCAT AATAACCCAT GTGGCGCTGC TATTGGTAAT AATGCTTTGG AGGCATATGA AAAAGCTCTA TCGTGCGATG AAGTAAGCAG TTTTGGTGGT ATAGTTGCCT TAAACCGGGA GATAGATTTA AAGCTAGCAG AAAAATTAAA CGAGATATTT TTGGAAGTAG TGATAGCACC ATCGGTAAAC AATGAGGCAC TAAAAATTTT ACAAAGAAAG AAAAATTTAA GAGTGATTAT TCATAAATCT TTTCAACAAA ATGTGAAATA CCAAACTAAA AATGTTGTTG GTGGGTTTTT GGTGCAAGAA AATAATGACC ACACAATAAA AGCAGAACAA GTAACAGAAT GCACTGCAAC AGACAAAGAA AAAAAAGATC TTATTTTTGC CTGGAAAATA TGTAAGCATG TGAAATCCAA CGCAATAGTT ATAGCAAAAG ATGGTTGTGC TATTGGCATC GGTGCAGGGC AAACAAGCAG AATAGATAGT GTGAACATTG CAGTGAAAAA AGCAGGTGAA AAATGTAAAG GTGCAGTGCT TGCTTCAGAT GCATTTTTTC CATTCCCAGA TAGCATAGTA GAAAGTGCAA AACATGAGAT TACAGCTATA ATTCAGCCCG GCGGCTCGCT GAAAGATCAA GATGTGATAA AAGCTGCAAA TGAAAATAAA ATTGCTATGT TTTTCACTGG CGTTCGCAGT TTTTTCCATT AG
|
Protein sequence | MKIKRALISV YDKTNIIDLA SFLTQQQIEI LSTGNTYKLL SSAGIKTQEV SDYTQFPEIL GGRVKTLHPK IHGGILCNRE KHKTEIQNLG IEPIELLITN LYPFWETVSS GSNEEQIIEQ IDIGGVALIR AAAKNFRFTS VISSIQDYEA LKAEMIENNN KTTLEYRKHL ATKAFALTAH YDSNIHSWFL SQSKNNELPE FFALYGHKVQ ELRYGENPHQ KAAFYSNQFT EYPLEKLHGK ELSYNNIVDI ESALNITSEF EEPAAVIIKH NNPCGAAIGN NALEAYEKAL SCDEVSSFGG IVALNREIDL KLAEKLNEIF LEVVIAPSVN NEALKILQRK KNLRVIIHKS FQQNVKYQTK NVVGGFLVQE NNDHTIKAEQ VTECTATDKE KKDLIFAWKI CKHVKSNAIV IAKDGCAIGI GAGQTSRIDS VNIAVKKAGE KCKGAVLASD AFFPFPDSIV ESAKHEITAI IQPGGSLKDQ DVIKAANENK IAMFFTGVRS FFH
|
| |