Gene B21_00477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00477 
SymbolpurK 
ID8114591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp520413 
End bp521480 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content57% 
IMG OID644846759 
Producthypothetical protein 
Protein accessionYP_002998332 
Protein GI251784028 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.182977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGG TTTGCGTCCT CGGTAACGGG CAGTTAGGCC GTATGCTGCG TCAGGCAGGT 
GAACCGTTAG GCATTGCTGT CTGGCCGGTC GGGCTGGACG CTGAACCGGC GGCGGTGCCT
TTTCAACAAA GCGTGATTAC CGCTGAGATC GAACGCTGGC CGGAAACCGC ATTAACCCGC
GAGCTGGCGC GTCATCCGGC CTTTGTGAAC CGCGATGTGT TCCCGATTAT TGCCGACCGT
CTGACTCAGA AGCAGCTTTT CGATAAGCTC CACCTGCCGA CCGCACCGTG GCAGTTACTT
GCCGATCGCA GCGAGTGGCC TGCGGTGTTT GAGCGTTTAG GTGAACTGGC GATTGTTAAG
CGTCGCACTG GTGGCTATGA CGGTCGCGGT CAATGGCGTT TACGTGCCAA TGAAACCGAA
CAGTTACCGG CAGAGTGTTA CGGCGAATGT ATTGTCGAGC AGGGCATTAA CTTCTCTGGT
GAAGTGTCGC TGGTTGGCGC ACGCGGATTT GATGGCAGCA CCGTGTTTTA TCCGCTGACG
CATAACCTGC ATCAGGACGG TATTTTGCGC ACCAGCGTCG CTTTTCCGCA GGCCAATGCG
CAGCAGCAAG CGCAAGCCGA AGAGATGCTG TCGGCGATTA TGCAGGAGCT GGGCTATGTG
GGCGTGATGG CGATGGAGTG TTTTGTCACC CCGCAAGGTC TGCTGATCAA CGAACTGGCA
CCGCGTGTGC ATAACAGCGG TCACTGGACA CAAAACGGTG CCAGCATCAG CCAGTTTGAG
CTGCATCTGC GGGCGATTAC CGATCTGCCG TTACCGCAAC CGGTAGTGAA TAGTCCGTCG
GTGATGATCA ACCTGATTGG TAGCGATGTG AATTATGACT GGCTGAAACT GCCGCTGGTG
CATCTGCACT GGTACGACAA AGAAGTCCGT CCGGGGCGTA AAGTGGGGCA TCTGAATTTG
ACCGACAGCG ACACATCGCG TCTGACCGCG ACGCTGGAAG CCTTGATCCC GCTGCTGCCG
CCGGAGTATG CCAGCGGCGT GATGTGGGCG CAGAGTAAGT TCAGTTAA
 
Protein sequence
MKQVCVLGNG QLGRMLRQAG EPLGIAVWPV GLDAEPAAVP FQQSVITAEI ERWPETALTR 
ELARHPAFVN RDVFPIIADR LTQKQLFDKL HLPTAPWQLL ADRSEWPAVF ERLGELAIVK
RRTGGYDGRG QWRLRANETE QLPAECYGEC IVEQGINFSG EVSLVGARGF DGSTVFYPLT
HNLHQDGILR TSVAFPQANA QQQAQAEEML SAIMQELGYV GVMAMECFVT PQGLLINELA
PRVHNSGHWT QNGASISQFE LHLRAITDLP LPQPVVNSPS VMINLIGSDV NYDWLKLPLV
HLHWYDKEVR PGRKVGHLNL TDSDTSRLTA TLEALIPLLP PEYASGVMWA QSKFS