Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_3018 |
Symbol | purK |
ID | 5385511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 3396498 |
End bp | 3397562 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640866023 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001401978 |
Protein GI | 153948176 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000000190426 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAG TTTGTGTACT GGGTAATGGC CAGTTAGGGC GAATGCTGCG GCAGGCAGGT GAACCGCTAG GAATTGCTGT TTATCCCGTC GGCTTAGATG CTGAACCTGA AGCGGTGCCT TATCAGCACA GTGTGATCAC CGCTGAAATT GAACGTTGGC CGGAAACCGC CTTAACCCGT GAATTAGCTA CCCATACTGC TTTTGTTAAT CGCGATATTT TTCCACGTCT GGCAGATCGT CTGCCCCAAA AGCAGTTACT CGATAGCTTG GGTTTGGCAA CCGCGCCGTG GCAATTGTTA TCCAGCGCCA GTGAATGGCC TGAGGTGTTC GCCACGCTGG GTGAGCTAGC CATCGTAAAA CGGCGGGTCG GCGGCTATGA CGGCCGGGGT CAATGGCGTT TACGCCCTGG TGAGCAGGGT ACCTTACCCC CCGATGCTTA CGGCGAGTGT ATTGTCGAAC AGGGGATTAA CTTCTCCGGC GAAGTCTCAT TGATCGGCGC GCGCAGCCAC CAAGGTGAAT CGGTATTTTA TCCACTGACC CATAATCTGC ATGAAGATGG CATTTTGCGC ATGAGCGTGG CATTACCACA GCCCAACAGC AAACTACAGC AGCAAGCCGA AAAAATGCTG TCAGCCATTA TGGATAAGCT GAATTATGTC GGTGTGATGG CGATGGAGTG TTTTATCGTC GGCGACCGTC TGTTGATCAA TGAACTGGCC CCGCGCGTTC ATAACAGTGG TCACTGGACA CAAAACGGCG CATCGATTAG CCAGTTCGAA TTGCATCTGC GGGCCATTTT GGATCTGCCA CTGCCGCAGC CGGTGGTGAA CACCCCGTCA GCGATGGTTA ATCTGATTGG CACGCCAGTA AATATTCAGT GGCTGTCTCT GCCATTAGTG CATCTGCATT GGTACGACAA AGAAGTCCGT GAAGGCCGCA AAGTTGGTCA TCTGAATTTA AACGATCCAG AGGGTACGGC ATTAAGCGCA TCCCTGGCCG CACTGGCTCC TTTGCTACCC GCGGAGTATC AGAACGCACT GCGTTGGGCG CAAGATAAGT TATAA
|
Protein sequence | MKPVCVLGNG QLGRMLRQAG EPLGIAVYPV GLDAEPEAVP YQHSVITAEI ERWPETALTR ELATHTAFVN RDIFPRLADR LPQKQLLDSL GLATAPWQLL SSASEWPEVF ATLGELAIVK RRVGGYDGRG QWRLRPGEQG TLPPDAYGEC IVEQGINFSG EVSLIGARSH QGESVFYPLT HNLHEDGILR MSVALPQPNS KLQQQAEKML SAIMDKLNYV GVMAMECFIV GDRLLINELA PRVHNSGHWT QNGASISQFE LHLRAILDLP LPQPVVNTPS AMVNLIGTPV NIQWLSLPLV HLHWYDKEVR EGRKVGHLNL NDPEGTALSA SLAALAPLLP AEYQNALRWA QDKL
|
| |