Gene ECD_00472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00472 
SymbolpurK 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp520614 
End bp521681 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content57% 
IMG OID 
Productphosphoribosylaminoimidazole carboxylase 
Protein accessionACT42371 
Protein GI253976701 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.140967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGG TTTGCGTCCT CGGTAACGGG CAGTTAGGCC GTATGCTGCG TCAGGCAGGT 
GAACCGTTAG GCATTGCTGT CTGGCCGGTC GGGCTGGACG CTGAACCGGC GGCGGTGCCT
TTTCAACAAA GCGTGATTAC CGCTGAGATC GAACGCTGGC CGGAAACCGC ATTAACCCGC
GAGCTGGCGC GTCATCCGGC CTTTGTGAAC CGCGATGTGT TCCCGATTAT TGCCGACCGT
CTGACTCAGA AGCAGCTTTT CGATAAGCTC CACCTGCCGA CCGCACCGTG GCAGTTACTT
GCCGATCGCA GCGAGTGGCC TGCGGTGTTT GAGCGTTTAG GTGAACTGGC GATTGTTAAG
CGTCGCACTG GTGGCTATGA CGGTCGCGGT CAATGGCGTT TACGTGCCAA TGAAACCGAA
CAGTTACCGG CAGAGTGTTA CGGCGAATGT ATTGTCGAGC AGGGCATTAA CTTCTCTGGT
GAAGTGTCGC TGGTTGGCGC ACGCGGATTT GATGGCAGCA CCGTGTTTTA TCCGCTGACG
CATAACCTGC ATCAGGACGG TATTTTGCGC ACCAGCGTCG CTTTTCCGCA GGCCAATGCG
CAGCAGCAAG CGCAAGCCGA AGAGATGCTG TCGGCGATTA TGCAGGAGCT GGGCTATGTG
GGCGTGATGG CGATGGAGTG TTTTGTCACC CCGCAAGGTC TGCTGATCAA CGAACTGGCA
CCGCGTGTGC ATAACAGCGG TCACTGGACA CAAAACGGTG CCAGCATCAG CCAGTTTGAG
CTGCATCTGC GGGCGATTAC CGATCTGCCG TTACCGCAAC CGGTAGTGAA TAGTCCGTCG
GTGATGATCA ACCTGATTGG TAGCGATGTG AATTATGACT GGCTGAAACT GCCGCTGGTG
CATCTGCACT GGTACGACAA AGAAGTCCGT CCGGGGCGTA AAGTGGGGCA TCTGAATTTG
ACCGACAGCG ACACATCGCG TCTGACCGCG ACGCTGGAAG CCTTGATCCC GCTGCTGCCG
CCGGAGTATG CCAGCGGCGT GATGTGGGCG CAGAGTAAGT TCAGTTAA
 
Protein sequence
MKQVCVLGNG QLGRMLRQAG EPLGIAVWPV GLDAEPAAVP FQQSVITAEI ERWPETALTR 
ELARHPAFVN RDVFPIIADR LTQKQLFDKL HLPTAPWQLL ADRSEWPAVF ERLGELAIVK
RRTGGYDGRG QWRLRANETE QLPAECYGEC IVEQGINFSG EVSLVGARGF DGSTVFYPLT
HNLHQDGILR TSVAFPQANA QQQAQAEEML SAIMQELGYV GVMAMECFVT PQGLLINELA
PRVHNSGHWT QNGASISQFE LHLRAITDLP LPQPVVNSPS VMINLIGSDV NYDWLKLPLV
HLHWYDKEVR PGRKVGHLNL TDSDTSRLTA TLEALIPLLP PEYASGVMWA QSKFS