Gene SbBS512_E0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0452 
SymbolpurK 
ID6271010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp440070 
End bp441137 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content57% 
IMG OID641724677 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001879225 
Protein GI187730443 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0002427 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGG TTTGCGTCCT CGGTAACGGG CAGTTAGGCC GTATGCTGCG TCAGGCAGGC 
GAACCGTTAG GCATTGCTGT CTGGCCGGTC GGGCTGGACG CTGAACCGGC GGCGGTGCCT
TTTCAACAAA GCGTGATTAC CGCTGAGATC GAACGCTGGC CGGAAACCGT ATTAACCCGC
GAGCTGGCGC GTCATCCGGC CTTTGTGAAC CGCGATGTGT TCCCGATTAT TGCTGACCGT
CTGACTCAGA AGCAGCTTTT CGATAAGCTC CACCTGCCGA CCGCGCCGTG GCAGTTACTT
GCCGATCGCA GCGAGTGGCC TGCGGTGTTT GATCGTTTAG GTGAACTGGC GATTGTTAAG
CGTCGCACTG GTGGCTATGA CGGTCGCGGT CAATGGCGTT TACGCGCCAA TGAAACCGAA
CAGTTACCGG CAGAGTGTTA CGGCGAATGT ATTGTCGAGC AGGGCATTAA CTTCTCTGGT
GAAGTGTCGC TGGTTGGCGC GCGCGGCTTT GATGGCAGCA CCGTGTTTTA TCCGCTGACG
CATAACCTGC ATCAGGACGG TATTTTGCGC ACCAGCGTCG CTTTTCCGCA GGCCAACGCA
CAGCAGCAGG CGCAAGCCGA AGAGATGCTG TCGGCGATTA TGCAGGAGCT GGGCTATGTG
GGCGTGATGA CGATGGAGTG TTTTGTCACC CCGCAAGGTC TGCTGATCAA CGAACTGGCT
CCGCGTGTGC ATAACAGCGG TCACTGGACA CAAAACGGTG CCAGCATCAG CCAGTTTGAG
CTGCATCTGC GGGCGATTAC CGCTCTGCCG TTACCGCAAC CAGTAGTGAA TAATCCGTCG
GTGATGATCA ATCTGATTGG TAGCGATGCG AATTATGACT GGCTGAAATT GCCGCTGGTG
CATCTGCACT GGTACGACAA AGAAGTCCAT CCGGGGCGTA AAGTGGGGCA TCTGAATTTG
ACCGACAGCG ACACATCGCG TCTGACTGCG ACGCTGGAAG CCTTAATCCC GCTGCTGCCG
CCAGAGTATG CCAGCGGCGT GATTTGGGCG CAGAGTAAGT TCAGTTAA
 
Protein sequence
MKQVCVLGNG QLGRMLRQAG EPLGIAVWPV GLDAEPAAVP FQQSVITAEI ERWPETVLTR 
ELARHPAFVN RDVFPIIADR LTQKQLFDKL HLPTAPWQLL ADRSEWPAVF DRLGELAIVK
RRTGGYDGRG QWRLRANETE QLPAECYGEC IVEQGINFSG EVSLVGARGF DGSTVFYPLT
HNLHQDGILR TSVAFPQANA QQQAQAEEML SAIMQELGYV GVMTMECFVT PQGLLINELA
PRVHNSGHWT QNGASISQFE LHLRAITALP LPQPVVNNPS VMINLIGSDA NYDWLKLPLV
HLHWYDKEVH PGRKVGHLNL TDSDTSRLTA TLEALIPLLP PEYASGVIWA QSKFS