Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK0264 |
Symbol | purK |
ID | 3022516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 309788 |
End bp | 310939 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637544464 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_081873 |
Protein GI | 52144955 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAGAA TCATTTTACC TGGAAAAACA ATCGGCATTA TTGGAGGCGG CCAGCTAGGA AGAATGATGG CATTGGCAGC TAAGGAAATG GGATATAAAA TTGCTGTTTT AGATCCTACA AAAAACTCAC CATGTGCACA AGTTGCTGAT ATTGAAATTG TTGCATCATA TGACGATTTA AAAGCAATTC AGCATTTAGC CGAGATTAGT GATGTTGTCA CATATGAATT TGAGAATATT GATTATAGAT GTTTACAATG GCTTGAAAAG CATGCTTACT TACCGCAAGG CAGTCAGTTG TTAAGTAAAA CACAAAATCG TTTTACTGAA AAGAATGCAA TTGAAAAAGC GGGATTACCA GTAGCAACGT ATAGATTGGT TCAAACTCAA GAGCAGCTTA CTGAAGCAAT CGCCGAGTTA TCATATCCTT CTGTCTTAAA AACAACGACA GGTGGATATG ATGGGAAAGG GCAAGTTGTT TTAAGAAGTG AAGCTGACGT TGATGAAGCG CGAAAGCTTG CGAATGCAGC AGAGTGTATT TTAGAGAAAT GGGTGCCTTT TGAAAAAGAA GTATCAGTTA TCGTAATTCG TAGTGTAAGT GGTGAAACGA AAGTATTTCC GGTAGCGGAA AATATTCATG TAAATAACAT TTTGCATGAA TCCATCGTTC CAGCTCGCAT TACAGAAGAA CTTTCTCAAA AAGCAATTGC TTATGCAAAA GTGCTTGCGG ATGAACTAGA ACTTGTGGGA ACACTAGCGG TAGAGATGTT TGCTACAGCT GATGGTGAGA TTTATATTAA TGAACTAGCA CCAAGACCTC ACAATTCAGG ACACTATACA CAGGATGCAT GTGAAACGAG TCAATTCGGT CAACATATTC GAGCAATCTG TAATTTACCT CTTGGAGAAA CAAATTTGTT AAAACCAGTT GTCATGGTAA ACATTTTAGG CGAACATATA GAAGGGGTCC TAAGACAAGT GAATAGATTA ACCGGGTGCT ATTTACACTT GTATGGAAAA GAAGAAGCAA AAGCGCAGCG AAAAATGGGG CATGTTAATA TTTTAAATGA TAATATTGAA GTCGCTCTAG AAAAAGCGAA GAGTTTGCAT ATTTGGGACC ATCAAGAACA ACTGTTGGAG GGAAAAAGAT GA
|
Protein sequence | MTRIILPGKT IGIIGGGQLG RMMALAAKEM GYKIAVLDPT KNSPCAQVAD IEIVASYDDL KAIQHLAEIS DVVTYEFENI DYRCLQWLEK HAYLPQGSQL LSKTQNRFTE KNAIEKAGLP VATYRLVQTQ EQLTEAIAEL SYPSVLKTTT GGYDGKGQVV LRSEADVDEA RKLANAAECI LEKWVPFEKE VSVIVIRSVS GETKVFPVAE NIHVNNILHE SIVPARITEE LSQKAIAYAK VLADELELVG TLAVEMFATA DGEIYINELA PRPHNSGHYT QDACETSQFG QHIRAICNLP LGETNLLKPV VMVNILGEHI EGVLRQVNRL TGCYLHLYGK EEAKAQRKMG HVNILNDNIE VALEKAKSLH IWDHQEQLLE GKR
|
| |