Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0668 |
Symbol | purK |
ID | 3848781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 766293 |
End bp | 767489 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637840341 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_441224 |
Protein GI | 83720003 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCAC TCCCCAACCC GAATTCCCCG ATCCTGCCGG GCGCCTGGCT CGGCATGGTC GGCGGCGGCC AGCTCGGCCG CATGTTCTGC TTCGCCGCGC AGGCGATGGG CTACCGCGTC GCCGTGCTCG ATCCCGATCC GACGAGCCCG GCGGGCGCCG TCGCGGACAA GCATCTGCGC GCCGCATACG ACGACGAGGC CGCGCTCGCC GAGCTCGCGC AATTGTGCGA CGCGGTATCG ACCGAGTTCG AGAACGTGCC CGCCGCGAGC CTCGATTTCC TCGCGCAATC GACGTTCGTC GCGCCCGCCG GCCGCTGCGT CGCGATCGCG CAGGACCGGA TCGCCGAGAA GCGTTTCATC GCGGCGTCGG GCGTGCCGGT CGCGCCGCAC GTCGTGATCG AGTCGGCCGC GCAGCTCGCG GCGCTCGCCG ATGCGGATCT CGCCGCGGTG CTGCCCGGCA TCCTGAAGAC CGCGCGCCTC GGCTACGACG GCAAGGGGCA GGTGCGCGTC GCGACCGCGC AGGAAGCGCG CGACGCGTAT GGGTCGCTCG GCGGCGTGCC GTGCGTGCTC GAGAAGCGCC TGCCGCTCAA ATACGAGGTG TCGGCGCTGA TCGCGCGCGG CGCGAACGGC GCGTCCGCCG TGTTTCCGCT CGCGCAGAAC ACGCACCACG GCGGCATCCT GTCGCTGAGC GTCGTGCCCG CGCCCGCCGC GAGCGATGCG CTCGTGCGCG ACGCGCAGCA GGCGGCCGCG CGGATCGCCG ATTCGCTCGA CTACGTCGGC GTGCTGTGCG TCGAGTTCTT CGTGCTCGAA GACGGCTCGC TCGTTGCGAA CGAAATGGCG CCGCGGCCGC ACAATTCCGG CCACTACACG GTCGATGCGT GCGAGACGAG CCAGTTCGAG CAGCAGGTGC GCGCGATGAC GCGGCTGCCG CTCGGCAGCA CGCGCCAGCA TTCGCCCGCC GCGATGCTTA ACGTGCTCGG CGACGTATGG TTCGCGAGCG GCGCGTCGGG CGAGCCCGTC ACGCCGCCGT GGGACCAGGT CGCCGCGATG CCGACCGCGC GGCTGCATCT GTACGGCAAG GAAGAAGCGC GGGTCGGCCG CAAGATGGGC CACGTGAACT TCACCGCGGC GACGCTCGAC GAAGCGGTCG CGGGCGCGAC CGCGTGCGCG CGGCTGTTGC GCATTCCGCT CGACTGA
|
Protein sequence | MTALPNPNSP ILPGAWLGMV GGGQLGRMFC FAAQAMGYRV AVLDPDPTSP AGAVADKHLR AAYDDEAALA ELAQLCDAVS TEFENVPAAS LDFLAQSTFV APAGRCVAIA QDRIAEKRFI AASGVPVAPH VVIESAAQLA ALADADLAAV LPGILKTARL GYDGKGQVRV ATAQEARDAY GSLGGVPCVL EKRLPLKYEV SALIARGANG ASAVFPLAQN THHGGILSLS VVPAPAASDA LVRDAQQAAA RIADSLDYVG VLCVEFFVLE DGSLVANEMA PRPHNSGHYT VDACETSQFE QQVRAMTRLP LGSTRQHSPA AMLNVLGDVW FASGASGEPV TPPWDQVAAM PTARLHLYGK EEARVGRKMG HVNFTAATLD EAVAGATACA RLLRIPLD
|
| |