Gene YpAngola_A1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1274 
SymbolpurK 
ID5799740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1333762 
End bp1334826 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content52% 
IMG OID641339240 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001605809 
Protein GI162419683 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000842444 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.961508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAG TTTGTGTACT GGGTAATGGC CAGTTAGGGC GAATGCTGCG GCAGGCAGGT 
GAACCGCTGG GAATTGCGGT TTATCCCGTC GGCTTAGATG CTGAACCTGA AGCGGTGCCT
TATCAGCACA GTGTGATCAC CGCTGAAATT GAACGTTGGC CGGAAACCGC CTTAACCCGT
GAATTAGCTA CCCATACTGC TTTTGTTAAT CGCGATATTT TTCCACGTCT GGCAGATCGT
CTGCCCCAAA AGCAGTTACT CGATAGCTTG GGTTTGGCAA CTGCGCCGTG GCAATTGTTA
TCCAGCGCCA GTGAATGGCC TGAGGTGTTC GCCACGTTGG GTGAGCTAGC CATCGTAAAA
CGGCGGGTCG GCGGCTATGA CGGCCGGGGT CAATGGCGTT TACGCCCTGG TGAGCAGGGT
ACCTTACCCC CCGATGCTTA CGGCGAGTGT ATTGTCGAAC AGGGGATTAA CTTCTCCGGC
GAAGTCTCAT TGATCGGCGC GCGCAGCCAC CAAGGTGAAT CGGTATTTTA TCCACTGACC
CATAATCTGC ATGAAGATGG CATTTTGCGC ATGAGCGTGG CATTACCACA GCCCAACAGC
AAACTACAGC AGCAAGCCGA AAAAATGCTG TCAGCCATTA TGGATAAGCT GAATTATGTC
GGTGTGATGG CGATGGAGTG TTTTATCGTC GGCGACCGTC TGTTGATCAA TGAACTGGCC
CCGCGCGTTC ATAACAGTGG TCACTGGACA CAAAACGGCG CATCAATTAG CCAGTTCGAA
TTGCATCTGC GGGCCATTTT GGATCTGCCA CTGCCGCAGC CGGTGGTGAA TACCCCGTCA
GCGATGGTTA ATCTGATTGG CACGCCAGTA AATATTCAGT GGCTGTCTCT GCCATTAGTA
CATCTGCATT GGTACGACAA AGAAGTCCGT GAAGGCCGCA AAGTTGGTCA TCTGAATTTA
AACGATCCAG AGGGTACGGC ATTAAGCGCA TCCCTGGCCG CACTGGCTCC TTTGCTACCC
GCGGAGTATC AGAACGCACT GCGTTGGGCG CAAGATAAGT TATAA
 
Protein sequence
MKPVCVLGNG QLGRMLRQAG EPLGIAVYPV GLDAEPEAVP YQHSVITAEI ERWPETALTR 
ELATHTAFVN RDIFPRLADR LPQKQLLDSL GLATAPWQLL SSASEWPEVF ATLGELAIVK
RRVGGYDGRG QWRLRPGEQG TLPPDAYGEC IVEQGINFSG EVSLIGARSH QGESVFYPLT
HNLHEDGILR MSVALPQPNS KLQQQAEKML SAIMDKLNYV GVMAMECFIV GDRLLINELA
PRVHNSGHWT QNGASISQFE LHLRAILDLP LPQPVVNTPS AMVNLIGTPV NIQWLSLPLV
HLHWYDKEVR EGRKVGHLNL NDPEGTALSA SLAALAPLLP AEYQNALRWA QDKL