Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A0578 |
Symbol | purK |
ID | 6515633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 583105 |
End bp | 584172 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642745722 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_002113546 |
Protein GI | 194736649 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.396185 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAG TTTGCGTTCT CGGCAACGGA CAACTGGGCC GAATGCTGCG CCAGGCGGGC GAACCGCTGG GTATCGCCGT CTGGCCGGTT GGTCTGGATG CAGAGCCTAC TGCTGTTCCG GTACAGCAGA GCGTCATTAC CGCAGAGATT GAGCGCTGGC CGGAAACCGC GCTTACCCGC GAGCTGGCAC GCCACCCGGC ATTCGTCAAT CGCGATGTAT TTCCGATCAT CGCCGACCGT CTGACACAAA AACAGCTTTT CGATAAACTG GGACTCGCGA CCGCGCCGTG GCAGCTGCTG ACCAGCGCCG ACGAGTGGTC CGGCATCTTT GACCGTCTGG GCGAACTGGC GATTATTAAG CGTCGCGTTG GCGGCTACGA CGGTCGCGGG CAGTGGCGTC TACGCGCGGA CGAAACCGGG CAACTGCCGG ATGACTGCTA TGGCGAATGT ATTGTTGAGC GCGGCATCCA TTTTTCCGGC GAAGTGTCGT TAGTCGGCGC GCGCGCTCAT GACGGCAGTA CCGTGTTTTA CCCGCTGACG CACAATTTGC ATCAGGACGG CATCTTGCGG ACCAGCGTCG CGTTCCCACA GGCGAACGCC GAACAGCAGG AGCAGGCGGA ATTAATGCTG TCAGCAATTA TGCAGGCGCT GAACTACGTC GGCGTAATGG CGATGGAATG TTTTATCACG CCGGAAGGCC TGTTAATCAA TGAACTGGCG CCGCGCGTGC ATAACAGCGG ACACTGGACG CAAAATGGCG CCAGCATCAG TCAGTTTGAA TTGCATTTGC GCGCGATTAC CGGCCTGCCG TTGCCCGCGC CAGTGATTAA CGCCCCGTCG GTGATGATCA ATCTGATCGG CAGCGAGCTG AATTACGACT GGCTGAAGCT GCCGCTGGTA CATCTGCACT GGTATGATAA AGCGGTACGC CCGGGGCGAA AAGTCGGCCA TCTGAATCTG ACCGACAGCG ATACGTCACG TCTTAGCGCC ACCCTGGAAG CGCTCTCTCC GCTCCTGCCG GGCGAATACG CCAGCGGCAT TATCTGGGCG CAAAGTAAGC TTAAATAA
|
Protein sequence | MKQVCVLGNG QLGRMLRQAG EPLGIAVWPV GLDAEPTAVP VQQSVITAEI ERWPETALTR ELARHPAFVN RDVFPIIADR LTQKQLFDKL GLATAPWQLL TSADEWSGIF DRLGELAIIK RRVGGYDGRG QWRLRADETG QLPDDCYGEC IVERGIHFSG EVSLVGARAH DGSTVFYPLT HNLHQDGILR TSVAFPQANA EQQEQAELML SAIMQALNYV GVMAMECFIT PEGLLINELA PRVHNSGHWT QNGASISQFE LHLRAITGLP LPAPVINAPS VMINLIGSEL NYDWLKLPLV HLHWYDKAVR PGRKVGHLNL TDSDTSRLSA TLEALSPLLP GEYASGIIWA QSKLK
|
| |