Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0623 |
Symbol | purK |
ID | 6966728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 647246 |
End bp | 648313 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643384661 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_002269175 |
Protein GI | 209399025 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.121669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAGG TTTGCGTCCT CGGTAACGGG CAGTTAGGCC GTATGCTGCG TCAGGCAGGC GAACCGTTAG GCATTGCTGT CTGGCCGGTC GGGCTGGACG CTGAACCGGC GGCGGTGCCT TTTCAACAAA GCGTGATTAC CGCTGAGATC GAACGCTGGC CGGAAACAGC ATTAACCCGC GAGCTGGCGC GCCATCCGGC CTTTGTGAAC CGCGATGTGT TCCCGATTAT TGCTGACCGT CTGACTCAGA AGCAGCTTTT CGATAAGCTC CACCTGCCGA CTGCACCGTG GCAGTTACTT GCCGAACGCA GCGAGTGGCC TGCGGTGTTT GAGCGTTTAG GTGAGCTGGC GATTGTTAAG CGTCGCACTG GTGGCTATGA CGGACGCGGT CAATGGCGTT TACGCGCCGA TGAAACCGAG CAGTTACCGG CTGAATGCTA CGGCGAATGT ATTGTCGAGC AGGGCATAAA CTTCTCTGGT GAAGTGTCGC TGGTTGGCGC GCGCTGCTTT GATGGCAGCA CCGTGTTTTA TCCGCTGACG CATAACCTGC ATCAGGACGG TATTTTGCGC ACCAGCGTCG CTTTTCCGCA GGCCAATGCG CAGCAGCAAG CGCAAGCCGA AGAGATGCTG TCGGCGATTA TGCAGGAGCT GGGCTATGTG GGCGTGATGG CGATGGAGTG TTTTGTCACC CCGCAAGGTC TGCTGATCAA CGAACTGGCA CCGCGTGTGC ATAACAGCGG TCACTGGACA CAAAACGGTG CCAGCATCAG CCAGTTTGAG CTACATCTGC GGGCGATTAC CGATCTGCCG TTACCGCAAC CAGTGGTGAA TAATCCGTCG GTGATGATCA ATCTGATTGG TAGCGATGTG AATTATGACT GGCTGAAACT GCCGCTGGTG CATCTGCACT GGTACGACAA AGAAGTCCGT CCGGGCCGTA AAGTGGGGCA TCTGAATTTG ACCGACAGCG ACACATCGCG TCTGTCCGCG ACGCTGGAAG CCTTGATCCC GCTGCTGCCG CCGGAATATG CCAGCGGCGT GATGTGGGCG CAGAGTAAGT TCTGTTAA
|
Protein sequence | MKQVCVLGNG QLGRMLRQAG EPLGIAVWPV GLDAEPAAVP FQQSVITAEI ERWPETALTR ELARHPAFVN RDVFPIIADR LTQKQLFDKL HLPTAPWQLL AERSEWPAVF ERLGELAIVK RRTGGYDGRG QWRLRADETE QLPAECYGEC IVEQGINFSG EVSLVGARCF DGSTVFYPLT HNLHQDGILR TSVAFPQANA QQQAQAEEML SAIMQELGYV GVMAMECFVT PQGLLINELA PRVHNSGHWT QNGASISQFE LHLRAITDLP LPQPVVNNPS VMINLIGSDV NYDWLKLPLV HLHWYDKEVR PGRKVGHLNL TDSDTSRLSA TLEALIPLLP PEYASGVMWA QSKFC
|
| |