Gene BCG9842_B4985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4985 
SymbolpurK 
ID7185287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp295772 
End bp296923 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content38% 
IMG OID643548095 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_002443807 
Protein GI218895396 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones168 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGAA TCATTTTACC TGGAAAAACA ATCGGCATTA TTGGAGGCGG CCAGCTAGGA 
AGAATGATGG CATTGGCAGC CAAGGAGATG GGATATAAAA TTGCTGTTTT AGATCCTACA
AAGCATTCAC CATGTGCACA AGTTGCTGAT ATTGAAATCG TTGCACCGTA TGACGATTTA
AAGGCAATTC AGCATTTAGC AGAGATAAGT GATGTTGTCA CATATGAATT TGAGAATATT
GATTATAGAT GTTTACAATG GCTTGAAAAA CATGCTTACT TGCCACAAGG TAGTCAGTTG
TTAAATAAAA CGCAAAATCG TTTTACAGAA AAGAATGGAA TTGAGAAGGC TGGGTTACCG
GTAGCAACGT ATAGATTAGT TCAAAATCAA GATCAGCTTA CAGAAGCAAT TGCTGAGTTA
TCATTCCCTT CCGTCTTAAA AACGACGACA GGTGGATATG ATGGGAAAGG GCAAGTTGTT
TTAAGAAGTG AGGCTGATGT TGAGACAGCA AGAAATCTTG TGGATAAAGC AGAGTGTATT
TTAGAGAAAT GGGTGCCTTT TGAAAAAGAA GTATCTGTTA TTGTGATTCG TAGTGTAAGT
GGTGAAACGA AAGTGTTTCC AGTAGCGGAA AATATTCATG TAAATAACAT TTTGCATGAA
TCTATCGTTC CAGCTCGTAT TACAGAAGAG CTTTCTCAAA AAGCAATTGC TTATGCAAAG
GTACTTGCGG ATGAATTAAA ACTTGTGGGA ACACTAGCTG TAGAGATGTT TGCTACAGCT
AATGGTGAGA TTTACATTAA TGAATTAGCA CCAAGACCTC ACAATTCAGG ACACTACACA
CAGGATGCAT GTGAAACGAG CCAATTTGGT CAACATATTC GAGCAATCTG TAATTTACCT
CTAGGAGAAA CAAATTTGTT AAAACCAGTT GTCATGGTAA ACATTTTAGG CGAACATATA
GAAGGGGTCC TAAGACAAGT GAATAGACTA ACCGGGTGCT ATTTACACTT GTATGGAAAA
GAAGAAGCAA AAGCACAGCG AAAAATGGGG CATGTTAATA TTTTAAATGA TAATATTGAA
GTTGCTCTAG AAAAAGCGAA GAGTTTGCAT ATTTGGGACC ATCAAGAACA ACTGTTGGAG
GGAAAAAGAT GA
 
Protein sequence
MTRIILPGKT IGIIGGGQLG RMMALAAKEM GYKIAVLDPT KHSPCAQVAD IEIVAPYDDL 
KAIQHLAEIS DVVTYEFENI DYRCLQWLEK HAYLPQGSQL LNKTQNRFTE KNGIEKAGLP
VATYRLVQNQ DQLTEAIAEL SFPSVLKTTT GGYDGKGQVV LRSEADVETA RNLVDKAECI
LEKWVPFEKE VSVIVIRSVS GETKVFPVAE NIHVNNILHE SIVPARITEE LSQKAIAYAK
VLADELKLVG TLAVEMFATA NGEIYINELA PRPHNSGHYT QDACETSQFG QHIRAICNLP
LGETNLLKPV VMVNILGEHI EGVLRQVNRL TGCYLHLYGK EEAKAQRKMG HVNILNDNIE
VALEKAKSLH IWDHQEQLLE GKR