Gene ECH74115_1688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1688 
Symbolprs 
ID6968392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1628306 
End bp1629253 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content53% 
IMG OID643385647 
Productribose-phosphate pyrophosphokinase 
Protein accessionYP_002270141 
Protein GI209398206 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0462] Phosphoribosylpyrophosphate synthetase 
TIGRFAM ID[TIGR01251] ribose-phosphate pyrophosphokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000327484 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.706483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGATA TGAAGCTTTT TGCTGGTAAC GCCACCCCGG AACTAGCACA ACGTATTGCC 
AACCGCCTGT ACACTTCACT CGGCGACGCC GCTGTAGGTC GCTTTAGCGA TGGCGAAGTC
AGCGTACAAA TTAATGAAAA TGTACGCGGT GGTGATATTT TCATCATCCA GTCCACTTGT
GCCCCTACTA ACGACAACCT GATGGAATTA GTCGTTATGG TTGATGCCCT GCGTCGTGCT
TCCGCAGGTC GTATCACCGC TGTTATCCCC TACTTTGGCT ATGCGCGCCA GGACCGTCGC
GTCCGTTCCG CTCGTGTACC AATCACTGCG AAAGTGGTTG CAGACTTCCT CTCCAGCGTC
GGTGTTGACC GTGTGCTGAC AGTGGATCTG CACGCTGAAC AGATTCAGGG TTTCTTCGAC
GTTCCGGTTG ATAACGTATT TGGTAGCCCG ATCCTGCTGG AAGACATGCT GCAGCTGAAT
CTGGATAACC CAATTGTGGT TTCTCCGGAC ATCGGCGGCG TTGTGCGTGC CCGCGCTATC
GCTAAGCTGC TGAACGATAC CGATATGGCA ATCATCGACA AACGTCGTCC GCGTGCGAAC
GTTTCCCAGG TGATGCATAT CATCGGTGAC GTTGCAGGTC GTGACTGCGT ACTGGTCGAT
GATATGATCG ACACTGGCGG TACGCTGTGT AAAGCTGCTG AAGCTCTGAA AGAACGTGGT
GCTAAACGTG TATTTGCGTA CGCGACTCAC CCGATCTTCT CTGGCAACGC GGCGAACAAC
CTGCGTAACT CTGTAATTGA TGAAGTCGTT GTCTGCGATA CCATTCCGCT GAGCGATGAA
ATCAAATCAC TGCCGAACGT GCGTACTCTG ACCCTGTCAG GTATGCTGGC CGAAGCGATT
CGTCGTATCA GCAACGAAGA ATCGATCTCT GCCATGTTCG AACACTAA
 
Protein sequence
MPDMKLFAGN ATPELAQRIA NRLYTSLGDA AVGRFSDGEV SVQINENVRG GDIFIIQSTC 
APTNDNLMEL VVMVDALRRA SAGRITAVIP YFGYARQDRR VRSARVPITA KVVADFLSSV
GVDRVLTVDL HAEQIQGFFD VPVDNVFGSP ILLEDMLQLN LDNPIVVSPD IGGVVRARAI
AKLLNDTDMA IIDKRRPRAN VSQVMHIIGD VAGRDCVLVD DMIDTGGTLC KAAEALKERG
AKRVFAYATH PIFSGNAANN LRNSVIDEVV VCDTIPLSDE IKSLPNVRTL TLSGMLAEAI
RRISNEESIS AMFEH