Gene BURPS668_0839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0839 
SymbolpurK 
ID4881810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp816438 
End bp817634 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content72% 
IMG OID640126767 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001057890 
Protein GI126441524 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCAC TCCCCACCCC GAATTCCCCG ATCCTGCCGG GCGCCTGGCT CGGCATGGTC 
GGCGGCGGCC AGCTCGGCCG CATGTTCTGC TTTGCCGCGC AAGCGATGGG CTACCGCGTC
GCCGTGCTCG ATCCCGATCC GACGAGCCCC GCGGGCGCCG TCGCCGACAA GCATCTGCGC
GCCGCGTACG ACGACGAGGC CGCGCTCGCC GAGCTCGCGC AATTGTGCGA TGCCGTATCG
ACCGAATTCG AGAACGTGCC CGCCGCGAGC CTCGAGCTGC TCGCGCAATC GACGTTCGTC
GCGCCGGCCG GCCGGTGCGT CGCGATCGCG CAGGACCGGA TCGCCGAGAA ACGATTCATC
GCGGCGTCGG GCGTGCCCGT CGCGCCGCAC GTCGTGATCG AATCGCACGC GCAGCTCGCC
GCGCTCGCCG ATGCGGACCT CGCCGCGGTG CTGCCCGGCA TCCTGAAGAC CGCGCGTCTC
GGTTACGACG GCAAGGGGCA GGTGCGTGTC GCGACGGTGC GCGAGGCGCG CGACGCGTAC
GCGTCGCTCG GCGGCGTGCC TTGCGTGCTC GAGAAGCGCC TGCCGCTCGA ATACGAGGTG
TCGGCGCTGA TCGCGCGCGG CGCGAACGGC GCGTCGGCGG TGTTTCCGCT CGCGCAGAAC
ACGCACCACG GCGGCATCCT GTCGCTGAGC GTCGTGCCCG CGCCCGCCGC GAGCGATGCG
CTCGTGCGCG AAGCGCAGCA GGCGGCCGTG CGGATCGCCG ATTCGCTCGG CTACGTCGGC
GTGCTGTGCG TCGAGTTCTT CGTGCTCGAA GACGGCTCGC TCGTCGCGAA CGAAATGGCG
CCGCGCCCGC ACAACTCCGG CCATTACACG GTCGACGCGT GCGAGACGAG CCAGTTCGAG
CAGCAGGTGC GCGCGATGAC GCGGCTGCCG CTCGGCAGCA CGCGCCAGCA TTCGCCCGCC
GCGATGCTCA ACGTGCTCGG CGACGTGTGG TTCGCGAACG GCGTGTCGGG TGAGCCCGTC
ACGCCGCCGT GGGACGAGGT CGCCGCAATG CCGACCGCGC GGCTGCATCT GTACGGCAAG
GAAGAGGCGC GCGCCGGCCG CAAGATGGGC CATGTGAACT TCACCGCGGC GACGCGCGAC
GAAGCGGTCG CCGGCGCGAC CGCGTGCGCG CGGCTGTTGC GCATTGCGCT CGACTGA
 
Protein sequence
MTALPTPNSP ILPGAWLGMV GGGQLGRMFC FAAQAMGYRV AVLDPDPTSP AGAVADKHLR 
AAYDDEAALA ELAQLCDAVS TEFENVPAAS LELLAQSTFV APAGRCVAIA QDRIAEKRFI
AASGVPVAPH VVIESHAQLA ALADADLAAV LPGILKTARL GYDGKGQVRV ATVREARDAY
ASLGGVPCVL EKRLPLEYEV SALIARGANG ASAVFPLAQN THHGGILSLS VVPAPAASDA
LVREAQQAAV RIADSLGYVG VLCVEFFVLE DGSLVANEMA PRPHNSGHYT VDACETSQFE
QQVRAMTRLP LGSTRQHSPA AMLNVLGDVW FANGVSGEPV TPPWDEVAAM PTARLHLYGK
EEARAGRKMG HVNFTAATRD EAVAGATACA RLLRIALD