Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | WD1142 |
Symbol | purK |
ID | 2738881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Wolbachia endosymbiont of Drosophila melanogaster |
Kingdom | Bacteria |
Replicon accession | NC_002978 |
Strand | + |
Start bp | 1092439 |
End bp | 1093503 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637173292 |
Product | phosphoribosylaminoimidazole carboxylase, ATPase subunit |
Protein accession | NP_966859 |
Protein GI | 42520944 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAC CAGATGCGCT GAGCAAAAAA GTAATAGGAA TAATAGGTGG TGGACAATTA GGTAAAATGA CTGCTATCGC TGCAACAAAA CTTGGACAAA AAATACATGT TTTTGCCAGT GCTAAAGACG ATCCAGCTTG TTCTATTGCT GATGATTTCA CAATAGCAGA TTTCTCTGAT AAGAAAGCGC TTGAATCTTT TGCACAGAGT GTGGATTTGG TCACTATTGA GTCTGAAAAT ATTCCATGTA GTGCAATTGA TATCGATGTA AATTTTTATC CGGGTAAAAA AGCGTTACAC ATTGCGCAAA ATAGGCTTAG AGAGAAAGAT TTCATTAGAA GCTTGAGTAT AAAAACTGCT GAATACAAAA GTATACAAAA TTATAATGAG CTACTGAAAA GCAGTAGAGC TTTTGGCTAT CCAACAAGGC TGAAAACAAC AGAAATGGGT TATGATGGAA AAGGGCAATA TGTGCTTGAG AATGATTCTG AAGTGAAGCA ATTTGCTTTC TTTGATTGGA ATACAGAGTA CATTCTTGAA GCAAGTGTTG ATTTACTGAA AGAGGTTTCA ATAGTCGTTG CAAGAGATAA AAACGGTAAA GTAGCTTTTT TTCCTATAGC AGAAAATTAC CACGTTGATG GAATACTTGA TACTTCAACA GTGCCAGCTA AAATAGATAG TAAATTAACT CAAGAGGTAC AACGAGCTGC AAAGAAAATA GCAAATGCGC TTGATGTAAT AGGAATTCTG GCTATTGAAT TTTTTGTTAC TAAGGATAAC GAATTGTTAG TTAATGAACT AGCTCCAAGA CCTCACAATT CTTGCCACTG GAGCTTGGAT GCATGTAACG TTAGTCAATT TGAACAGCTA GTTAGGATAA TATGCGGGCT ACCTATGCAG GAAGTAGTAT TACGCTTTCC TTGTATGACG AAAAATATAA TAGGTAATGA TATATATGAT TCTCATAAGT ATTTGAGCAA CGAAAAAGCT AGTTTAACCA TATATGGGAA AAAAGAGGTT AGGGATAAGC GTAAAATGGG ACATGTCAAT ATAGATTTAA GTTAA
|
Protein sequence | MNEPDALSKK VIGIIGGGQL GKMTAIAATK LGQKIHVFAS AKDDPACSIA DDFTIADFSD KKALESFAQS VDLVTIESEN IPCSAIDIDV NFYPGKKALH IAQNRLREKD FIRSLSIKTA EYKSIQNYNE LLKSSRAFGY PTRLKTTEMG YDGKGQYVLE NDSEVKQFAF FDWNTEYILE ASVDLLKEVS IVVARDKNGK VAFFPIAENY HVDGILDTST VPAKIDSKLT QEVQRAAKKI ANALDVIGIL AIEFFVTKDN ELLVNELAPR PHNSCHWSLD ACNVSQFEQL VRIICGLPMQ EVVLRFPCMT KNIIGNDIYD SHKYLSNEKA SLTIYGKKEV RDKRKMGHVN IDLS
|
| |