Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0641 |
Symbol | purK |
ID | 6490480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 643285 |
End bp | 644352 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642740899 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_002044566 |
Protein GI | 194448120 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.132584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 92 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAG TTTGCGTTCT CGGCAACGGG CAACTGGGCC GAATGCTGCG CCAGGCGGGC GAACCGCTGG GTATCGCCGT CTGGCCGGTT GGTCTGGATG CAGAGCCTAC CGCCGTGCCG GTACAGCAGA GCGTCATTAC CGCAGAGATT GAGCGCTGGC CGGAAACCGC GCTTACCCGC GAGCTGGCGC GCCACCCGGC ATTCGTCAAT CGCGATGTAT TTCCGATCAT CGCCGACCGT CTGACACAAA AACAGCTTTT CGATAAACTG GGACTCGCGA CCGCGCCGTG GCAACTGCTG ACCAGCGCTG ACGAGTGGTC CGGCATCTTT GACCGTCTGG GCGAACTGGC GATTATTAAG CGTCGCGTTG GCGGCTACGA TGGTCGCGGG CAGTGGCGTC TACGCGCGGA CGAAACCGGG CAACTGCCGG ATGACTGCTA TGGCGAATGT ATTGTTGAGC GCGGTATCCA TTTTTCCGGC GAAGTGTCGT TAGTCGGCGC GCGCGCTCAT GACGGTAGTA CCGTGTTTTA CCCGCTAACG CACAATTTGC ATCAGGACGG CATCTTGCGG ACCAGCGTCG CGTTCCCACA GGCGAACGCC GAACAGCAGG AGCAGGCGGA ATCGATGCTG TCAGCAATTA TGCAGGCGTT GAACTACGTC GGCGTAATGG CGATGGAATG TTTTATCACG CCGGAAGGCC TGTTAATCAA TGAACTGGCG CCGCGCGTGC ATAACAGCGG ACACTGGACG CAAAATGGCG CCAGCATCAG TCAGTTTGAA TTGCATTTGC GCGCGATTAC CGGCCTGCCG TTGCCCGCGC CGGTGATTAA CGCCCCGTCG GTGATGATCA ATCTGATCGG CAGCGAGCTG AATTACGACT GGCTGAAGCT GCCGCTGGTA CATCTGCACT GGTATGATAA AGCGGTACGT CCGGGGCGAA AAGTCGGCCA TCTGAATCTG ACCGACAGCG ATACGTCACG TCTTAGCGCC ACCCTGGAAG CGCTCTCTCC GCTCCTGCCG GGCGAATACG CCAGCGGCAT TATCTGGGCG CAAAGTAAGC TTAAATAA
|
Protein sequence | MKQVCVLGNG QLGRMLRQAG EPLGIAVWPV GLDAEPTAVP VQQSVITAEI ERWPETALTR ELARHPAFVN RDVFPIIADR LTQKQLFDKL GLATAPWQLL TSADEWSGIF DRLGELAIIK RRVGGYDGRG QWRLRADETG QLPDDCYGEC IVERGIHFSG EVSLVGARAH DGSTVFYPLT HNLHQDGILR TSVAFPQANA EQQEQAESML SAIMQALNYV GVMAMECFIT PEGLLINELA PRVHNSGHWT QNGASISQFE LHLRAITGLP LPAPVINAPS VMINLIGSEL NYDWLKLPLV HLHWYDKAVR PGRKVGHLNL TDSDTSRLSA TLEALSPLLP GEYASGIIWA QSKLK
|
| |