Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0586 |
Symbol | purK |
ID | 6486537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 600433 |
End bp | 601500 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642736003 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_002039777 |
Protein GI | 194442231 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.148002 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAG TTTGTGTTCT CGGCAACGGA CAACTGGGCC GAATGCTGCG CCAGGCGGGC GAACCGCTGG GTATCGCCGT CTGGCCGGTT GGCCTGGATG CAGAGCCTAC CGCCGTGCCG GTACAGCAGA GCGTCATTAC CGCAGAGATT GAGCGCTGGC CGGAAACCGC GCTTACCCGC GAGCTGGCGC GCCACCCGGC CTTCGTCAAT CGCGATGTAT TTCCGATCAT CGCCGACCGT CTGACACAAA AACAGCTTTT CGATAAACTG GGACTCGCGA CCGCGCCGTG GCAGCTGCTG ACCAGCGCCG ACGAGTGGTC CGGCATCTTT GACCGTCTGG GAGAACTGGC GATTATTAAG CGTCGCGTTG GCGGCTACGA TGGTCGCGGG CAGTGGCGTC TACGCGCGGA CGAAACCGGG CAACTGCCGG ATGACTGCTA TGGCGAATGT ATTGTTGAGC GCGGTATCCA TTTTTCCGGC GAAGTGTCGT TAGTCGGCGC GCGCGCTCAT GACGGCAGCA CCGTGTTTTA CCCGCTAACG CACAATTTGC ATCAGGACGG CATCTTGCGG ACCAGCGTCG CGTTCCCACA GGCGAACGCC GAACAGCAGG AGCAGGCGGA ATCGATGCTG TCAGCAATTA TGCAGGCGCT GAACTACGTC GGCGTAATGG CGATGGAATG TTTTATCACG CCGGAAGGCC TGTTAATCAA TGAACTGGCG CCGCGCGTGC ATAACAGCGG ACACTGGACG CAAAATGGCG CCAGCATCAG TCAGTTTGAA TTGCATTTGC GCGCGATTAC CGGCCTGCCG TTGCCCGCGC CGGTGATTAA CGCCCCGTCG GTGATGATCA ACCTGATCGG CAGCGAGCTG AATTACGACT GGCTGAAGCT GCCGCTGGTA CATCTGCACT GGTATGATAA AGCGGTACGT CCGGGGCGAA AAGTCGGCCA TCTGAATCTG ACCGACAGCG ATACGTCACG TCTTAGCGCC ACCCTGGAAG CGCTCTCTCC GCTCCTGCCG GGCGAATACG CCAGCGGCAT TATCTGGGCG CAAAGTAAGC TTAAATAA
|
Protein sequence | MKQVCVLGNG QLGRMLRQAG EPLGIAVWPV GLDAEPTAVP VQQSVITAEI ERWPETALTR ELARHPAFVN RDVFPIIADR LTQKQLFDKL GLATAPWQLL TSADEWSGIF DRLGELAIIK RRVGGYDGRG QWRLRADETG QLPDDCYGEC IVERGIHFSG EVSLVGARAH DGSTVFYPLT HNLHQDGILR TSVAFPQANA EQQEQAESML SAIMQALNYV GVMAMECFIT PEGLLINELA PRVHNSGHWT QNGASISQFE LHLRAITGLP LPAPVINAPS VMINLIGSEL NYDWLKLPLV HLHWYDKAVR PGRKVGHLNL TDSDTSRLSA TLEALSPLLP GEYASGIIWA QSKLK
|
| |