Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A1274 |
Symbol | purK |
ID | 5799740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 1333762 |
End bp | 1334826 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641339240 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001605809 |
Protein GI | 162419683 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000842444 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.961508 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCAG TTTGTGTACT GGGTAATGGC CAGTTAGGGC GAATGCTGCG GCAGGCAGGT GAACCGCTGG GAATTGCGGT TTATCCCGTC GGCTTAGATG CTGAACCTGA AGCGGTGCCT TATCAGCACA GTGTGATCAC CGCTGAAATT GAACGTTGGC CGGAAACCGC CTTAACCCGT GAATTAGCTA CCCATACTGC TTTTGTTAAT CGCGATATTT TTCCACGTCT GGCAGATCGT CTGCCCCAAA AGCAGTTACT CGATAGCTTG GGTTTGGCAA CTGCGCCGTG GCAATTGTTA TCCAGCGCCA GTGAATGGCC TGAGGTGTTC GCCACGTTGG GTGAGCTAGC CATCGTAAAA CGGCGGGTCG GCGGCTATGA CGGCCGGGGT CAATGGCGTT TACGCCCTGG TGAGCAGGGT ACCTTACCCC CCGATGCTTA CGGCGAGTGT ATTGTCGAAC AGGGGATTAA CTTCTCCGGC GAAGTCTCAT TGATCGGCGC GCGCAGCCAC CAAGGTGAAT CGGTATTTTA TCCACTGACC CATAATCTGC ATGAAGATGG CATTTTGCGC ATGAGCGTGG CATTACCACA GCCCAACAGC AAACTACAGC AGCAAGCCGA AAAAATGCTG TCAGCCATTA TGGATAAGCT GAATTATGTC GGTGTGATGG CGATGGAGTG TTTTATCGTC GGCGACCGTC TGTTGATCAA TGAACTGGCC CCGCGCGTTC ATAACAGTGG TCACTGGACA CAAAACGGCG CATCAATTAG CCAGTTCGAA TTGCATCTGC GGGCCATTTT GGATCTGCCA CTGCCGCAGC CGGTGGTGAA TACCCCGTCA GCGATGGTTA ATCTGATTGG CACGCCAGTA AATATTCAGT GGCTGTCTCT GCCATTAGTA CATCTGCATT GGTACGACAA AGAAGTCCGT GAAGGCCGCA AAGTTGGTCA TCTGAATTTA AACGATCCAG AGGGTACGGC ATTAAGCGCA TCCCTGGCCG CACTGGCTCC TTTGCTACCC GCGGAGTATC AGAACGCACT GCGTTGGGCG CAAGATAAGT TATAA
|
Protein sequence | MKPVCVLGNG QLGRMLRQAG EPLGIAVYPV GLDAEPEAVP YQHSVITAEI ERWPETALTR ELATHTAFVN RDIFPRLADR LPQKQLLDSL GLATAPWQLL SSASEWPEVF ATLGELAIVK RRVGGYDGRG QWRLRPGEQG TLPPDAYGEC IVEQGINFSG EVSLIGARSH QGESVFYPLT HNLHEDGILR MSVALPQPNS KLQQQAEKML SAIMDKLNYV GVMAMECFIV GDRLLINELA PRVHNSGHWT QNGASISQFE LHLRAILDLP LPQPVVNTPS AMVNLIGTPV NIQWLSLPLV HLHWYDKEVR EGRKVGHLNL NDPEGTALSA SLAALAPLLP AEYQNALRWA QDKL
|
| |