Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0567 |
Symbol | purK |
ID | 6145833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 575433 |
End bp | 576500 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641615459 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001742666 |
Protein GI | 170681139 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0007186 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAGG TTTGCGTCCT CGGTAACGGG CAGTTAGGCC GTATGCTGCG TCAGGCAGGC GAACCGTTAG GCATTGCTGT CTGGCCAGTC GGGCTGGACG CTGAACCGGC GGCGGTGCCT TTTCAACAAA GTGTGATTAC CGCTGAGATC GAACGCTGGC CGGAAACCGC ATTAACCCGC GAGCTGGCGC GCCATCCGGC CTTTGTGAAC CGCGATGTGT TCCCGATTAT TGCTGACCGT CTGACTCAGA AGCAGCTTTT CGATAAGCTC CACCTGCCGA CCGCACCGTG GCAGTTACTT GCCGATCGCA GCGAGTGGCC TGCGGTGTTT GATCGTTTAG GTGAGCTGGC GATTGTTAAG CGTCGCACTG GTGGCTATGA CGGACGCGGT CAATGGCGTT TACGCGCCGA TGAAACCGAA CAGTTACCGA CTGAATGCTA CGGCGAATGT ATTGTCGAGC AGGGCATAAA CTTCTCTGGT GAAGTGTCGC TGGTTGGCGC GCGCGGCTTT GATGGTAGTA CCGTGTTTTA TCCGCTGACG CATAACCTGC ATCAGGACGG TATTTTGCGC ACCAGCGTCG CTTTCCCGCA GGCCAACGCG CAGCAGCAGG CGCAGGCCGA AGAGATGCTG TCGGCGATTA TGCAGGAGCT GGGCTATGTG GGCGTGATGG CGATGGAGTG TTTTGTCACC CCACAAGGTC TGCTGATCAA CGAACTGGCT CCGCGTGTGC ATAACAGTGG GCACTGGACG CAAAACGGTG CCAGCATCAG CCAGTTTGAG CTGCATCTGC GGGCGATTAC CGATCTGCCG TTACCACAAC CGGTGGTGAA TAGTCCGTCG GTGATGATCA ACCTGATTGG TAGCGATGTG AATTATGACT GGCTGAAACT GCCGCTGGTG CATCTGCACT GGTACGACAA AGAAGTCCGT CCGGGGCGTA AAGTGGGGCA TTTAAATTTG ACCGACAGCG ACACATCGCG TCTGACCGCG ACGCTGGAAG CCTTAATCCC GCTGCTGCCG CCAGAATATG CTAGCGGCGT GATTTGGGCG CAGAGCAAGT TCAGTTAA
|
Protein sequence | MKQVCVLGNG QLGRMLRQAG EPLGIAVWPV GLDAEPAAVP FQQSVITAEI ERWPETALTR ELARHPAFVN RDVFPIIADR LTQKQLFDKL HLPTAPWQLL ADRSEWPAVF DRLGELAIVK RRTGGYDGRG QWRLRADETE QLPTECYGEC IVEQGINFSG EVSLVGARGF DGSTVFYPLT HNLHQDGILR TSVAFPQANA QQQAQAEEML SAIMQELGYV GVMAMECFVT PQGLLINELA PRVHNSGHWT QNGASISQFE LHLRAITDLP LPQPVVNSPS VMINLIGSDV NYDWLKLPLV HLHWYDKEVR PGRKVGHLNL TDSDTSRLTA TLEALIPLLP PEYASGVIWA QSKFS
|
| |