Gene WD1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD1142 
SymbolpurK 
ID2738881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp1092439 
End bp1093503 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content35% 
IMG OID637173292 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionNP_966859 
Protein GI42520944 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAC CAGATGCGCT GAGCAAAAAA GTAATAGGAA TAATAGGTGG TGGACAATTA 
GGTAAAATGA CTGCTATCGC TGCAACAAAA CTTGGACAAA AAATACATGT TTTTGCCAGT
GCTAAAGACG ATCCAGCTTG TTCTATTGCT GATGATTTCA CAATAGCAGA TTTCTCTGAT
AAGAAAGCGC TTGAATCTTT TGCACAGAGT GTGGATTTGG TCACTATTGA GTCTGAAAAT
ATTCCATGTA GTGCAATTGA TATCGATGTA AATTTTTATC CGGGTAAAAA AGCGTTACAC
ATTGCGCAAA ATAGGCTTAG AGAGAAAGAT TTCATTAGAA GCTTGAGTAT AAAAACTGCT
GAATACAAAA GTATACAAAA TTATAATGAG CTACTGAAAA GCAGTAGAGC TTTTGGCTAT
CCAACAAGGC TGAAAACAAC AGAAATGGGT TATGATGGAA AAGGGCAATA TGTGCTTGAG
AATGATTCTG AAGTGAAGCA ATTTGCTTTC TTTGATTGGA ATACAGAGTA CATTCTTGAA
GCAAGTGTTG ATTTACTGAA AGAGGTTTCA ATAGTCGTTG CAAGAGATAA AAACGGTAAA
GTAGCTTTTT TTCCTATAGC AGAAAATTAC CACGTTGATG GAATACTTGA TACTTCAACA
GTGCCAGCTA AAATAGATAG TAAATTAACT CAAGAGGTAC AACGAGCTGC AAAGAAAATA
GCAAATGCGC TTGATGTAAT AGGAATTCTG GCTATTGAAT TTTTTGTTAC TAAGGATAAC
GAATTGTTAG TTAATGAACT AGCTCCAAGA CCTCACAATT CTTGCCACTG GAGCTTGGAT
GCATGTAACG TTAGTCAATT TGAACAGCTA GTTAGGATAA TATGCGGGCT ACCTATGCAG
GAAGTAGTAT TACGCTTTCC TTGTATGACG AAAAATATAA TAGGTAATGA TATATATGAT
TCTCATAAGT ATTTGAGCAA CGAAAAAGCT AGTTTAACCA TATATGGGAA AAAAGAGGTT
AGGGATAAGC GTAAAATGGG ACATGTCAAT ATAGATTTAA GTTAA
 
Protein sequence
MNEPDALSKK VIGIIGGGQL GKMTAIAATK LGQKIHVFAS AKDDPACSIA DDFTIADFSD 
KKALESFAQS VDLVTIESEN IPCSAIDIDV NFYPGKKALH IAQNRLREKD FIRSLSIKTA
EYKSIQNYNE LLKSSRAFGY PTRLKTTEMG YDGKGQYVLE NDSEVKQFAF FDWNTEYILE
ASVDLLKEVS IVVARDKNGK VAFFPIAENY HVDGILDTST VPAKIDSKLT QEVQRAAKKI
ANALDVIGIL AIEFFVTKDN ELLVNELAPR PHNSCHWSLD ACNVSQFEQL VRIICGLPMQ
EVVLRFPCMT KNIIGNDIYD SHKYLSNEKA SLTIYGKKEV RDKRKMGHVN IDLS