Gene YpsIP31758_3018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3018 
SymbolpurK 
ID5385511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3396498 
End bp3397562 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content53% 
IMG OID640866023 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001401978 
Protein GI153948176 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000000190426 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAG TTTGTGTACT GGGTAATGGC CAGTTAGGGC GAATGCTGCG GCAGGCAGGT 
GAACCGCTAG GAATTGCTGT TTATCCCGTC GGCTTAGATG CTGAACCTGA AGCGGTGCCT
TATCAGCACA GTGTGATCAC CGCTGAAATT GAACGTTGGC CGGAAACCGC CTTAACCCGT
GAATTAGCTA CCCATACTGC TTTTGTTAAT CGCGATATTT TTCCACGTCT GGCAGATCGT
CTGCCCCAAA AGCAGTTACT CGATAGCTTG GGTTTGGCAA CCGCGCCGTG GCAATTGTTA
TCCAGCGCCA GTGAATGGCC TGAGGTGTTC GCCACGCTGG GTGAGCTAGC CATCGTAAAA
CGGCGGGTCG GCGGCTATGA CGGCCGGGGT CAATGGCGTT TACGCCCTGG TGAGCAGGGT
ACCTTACCCC CCGATGCTTA CGGCGAGTGT ATTGTCGAAC AGGGGATTAA CTTCTCCGGC
GAAGTCTCAT TGATCGGCGC GCGCAGCCAC CAAGGTGAAT CGGTATTTTA TCCACTGACC
CATAATCTGC ATGAAGATGG CATTTTGCGC ATGAGCGTGG CATTACCACA GCCCAACAGC
AAACTACAGC AGCAAGCCGA AAAAATGCTG TCAGCCATTA TGGATAAGCT GAATTATGTC
GGTGTGATGG CGATGGAGTG TTTTATCGTC GGCGACCGTC TGTTGATCAA TGAACTGGCC
CCGCGCGTTC ATAACAGTGG TCACTGGACA CAAAACGGCG CATCGATTAG CCAGTTCGAA
TTGCATCTGC GGGCCATTTT GGATCTGCCA CTGCCGCAGC CGGTGGTGAA CACCCCGTCA
GCGATGGTTA ATCTGATTGG CACGCCAGTA AATATTCAGT GGCTGTCTCT GCCATTAGTG
CATCTGCATT GGTACGACAA AGAAGTCCGT GAAGGCCGCA AAGTTGGTCA TCTGAATTTA
AACGATCCAG AGGGTACGGC ATTAAGCGCA TCCCTGGCCG CACTGGCTCC TTTGCTACCC
GCGGAGTATC AGAACGCACT GCGTTGGGCG CAAGATAAGT TATAA
 
Protein sequence
MKPVCVLGNG QLGRMLRQAG EPLGIAVYPV GLDAEPEAVP YQHSVITAEI ERWPETALTR 
ELATHTAFVN RDIFPRLADR LPQKQLLDSL GLATAPWQLL SSASEWPEVF ATLGELAIVK
RRVGGYDGRG QWRLRPGEQG TLPPDAYGEC IVEQGINFSG EVSLIGARSH QGESVFYPLT
HNLHEDGILR MSVALPQPNS KLQQQAEKML SAIMDKLNYV GVMAMECFIV GDRLLINELA
PRVHNSGHWT QNGASISQFE LHLRAILDLP LPQPVVNTPS AMVNLIGTPV NIQWLSLPLV
HLHWYDKEVR EGRKVGHLNL NDPEGTALSA SLAALAPLLP AEYQNALRWA QDKL