Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00477 |
Symbol | purK |
ID | 8114591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 520413 |
End bp | 521480 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644846759 |
Product | hypothetical protein |
Protein accession | YP_002998332 |
Protein GI | 251784028 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.182977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGG TTTGCGTCCT CGGTAACGGG CAGTTAGGCC GTATGCTGCG TCAGGCAGGT GAACCGTTAG GCATTGCTGT CTGGCCGGTC GGGCTGGACG CTGAACCGGC GGCGGTGCCT TTTCAACAAA GCGTGATTAC CGCTGAGATC GAACGCTGGC CGGAAACCGC ATTAACCCGC GAGCTGGCGC GTCATCCGGC CTTTGTGAAC CGCGATGTGT TCCCGATTAT TGCCGACCGT CTGACTCAGA AGCAGCTTTT CGATAAGCTC CACCTGCCGA CCGCACCGTG GCAGTTACTT GCCGATCGCA GCGAGTGGCC TGCGGTGTTT GAGCGTTTAG GTGAACTGGC GATTGTTAAG CGTCGCACTG GTGGCTATGA CGGTCGCGGT CAATGGCGTT TACGTGCCAA TGAAACCGAA CAGTTACCGG CAGAGTGTTA CGGCGAATGT ATTGTCGAGC AGGGCATTAA CTTCTCTGGT GAAGTGTCGC TGGTTGGCGC ACGCGGATTT GATGGCAGCA CCGTGTTTTA TCCGCTGACG CATAACCTGC ATCAGGACGG TATTTTGCGC ACCAGCGTCG CTTTTCCGCA GGCCAATGCG CAGCAGCAAG CGCAAGCCGA AGAGATGCTG TCGGCGATTA TGCAGGAGCT GGGCTATGTG GGCGTGATGG CGATGGAGTG TTTTGTCACC CCGCAAGGTC TGCTGATCAA CGAACTGGCA CCGCGTGTGC ATAACAGCGG TCACTGGACA CAAAACGGTG CCAGCATCAG CCAGTTTGAG CTGCATCTGC GGGCGATTAC CGATCTGCCG TTACCGCAAC CGGTAGTGAA TAGTCCGTCG GTGATGATCA ACCTGATTGG TAGCGATGTG AATTATGACT GGCTGAAACT GCCGCTGGTG CATCTGCACT GGTACGACAA AGAAGTCCGT CCGGGGCGTA AAGTGGGGCA TCTGAATTTG ACCGACAGCG ACACATCGCG TCTGACCGCG ACGCTGGAAG CCTTGATCCC GCTGCTGCCG CCGGAGTATG CCAGCGGCGT GATGTGGGCG CAGAGTAAGT TCAGTTAA
|
Protein sequence | MKQVCVLGNG QLGRMLRQAG EPLGIAVWPV GLDAEPAAVP FQQSVITAEI ERWPETALTR ELARHPAFVN RDVFPIIADR LTQKQLFDKL HLPTAPWQLL ADRSEWPAVF ERLGELAIVK RRTGGYDGRG QWRLRANETE QLPAECYGEC IVEQGINFSG EVSLVGARGF DGSTVFYPLT HNLHQDGILR TSVAFPQANA QQQAQAEEML SAIMQELGYV GVMAMECFVT PQGLLINELA PRVHNSGHWT QNGASISQFE LHLRAITDLP LPQPVVNSPS VMINLIGSDV NYDWLKLPLV HLHWYDKEVR PGRKVGHLNL TDSDTSRLTA TLEALIPLLP PEYASGVMWA QSKFS
|
| |