Gene ECH74115_3452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3452 
SymbolpurF 
ID6968436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3196253 
End bp3197770 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content53% 
IMG OID643387258 
Productamidophosphoribosyltransferase 
Protein accessionYP_002271721 
Protein GI209397230 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGGTA TTGTCGGTAT CGCCGGTGTT ATGCCGGTTA ACCAGTCGAT TTATGATGCC 
TTAACGGTGC TTCAGCATCG CGGTCAGGAT GCCGCCGGCA TCATCACCAT AGATGCCAAT
AACTGCTTCC GTTTGCGTAA AGCGAACGGG CTGGTGAGCG ATGTATTTGA AGCTCGCCAT
ATGCAGCGTT TGCAGGGCAA TATGGGCATT GGTCATGTGC GTTACCCTAC GGCTGGCAGC
TCCAGCGCCT CTGAAGCGCA GCCGTTTTAC GTTAACTCTC CGTATGGCAT TACGCTTGCC
CACAACGGCA ATCTGACCAA CGCTCACGAG TTGCGTAAAA AACTGTTTGA AGAAAAACGC
CGCCACATCA ACACCACTTC CGACTCGGAA ATTCTGCTTA ATATCTTCGC CAGCGAGCTG
GACAACTTCC GCCACTACCC GCTTGAAGCC GACAACATTT TCGCCGCCAT CGCTGCTACA
AACCGCTTAA TCCGCGGCGC GTATGCCTGT GTGGCGATGA TTATCGGCCA CGGTATGGTT
GCTTTCCGCG ATCCAAACGG TATTCGTCCG CTGGTTCTGG GAAAACGTGA TATTGACGAG
AACCGCACAG AATATATGGT CGCTTCCGAA AGCGTTGCGC TCGATACACT GGGCTTTGAT
TTCCTGCGTG ACGTCGCGCC TGGCGAAGCG ATTTACATCA CTGAAGAAGG GCAGTTGTTT
ACCCGTCAAT GTGCTGACAA TCCGGTCAGC AATCCGTGCC TGTTTGAGTA TGTATACTTT
GCTCGCCCGG ACTCGTTCAT CGACAAAATT TCCGTTTACA GCGCGCGTGT GAATATGGGC
ACGAAGCTGG GCGAGAAAAT TGCCCGCGAA TGGGAAGATC TGGATATCGA CGTGGTGATC
CCGATCCCGG AAACCTCGTG TGATATCGCG CTGGAAATTG CTCGTATTCT GGGCAAACCG
TACCGCCAGG GCTTCGTTAA AAACCGCTAT GTTGGCCGCA CCTTCATCAT GCCGGGTCAG
CAGCTGCGTC GTAAGTCCGT GCGCCGTAAA CTGAACGCCA ACCGCGCCGA GTTCCGCGAT
AAAAACGTCC TGCTGGTCGA CGACTCTATC GTCCGTGGCA CCACTTCTGA GCAGATTATC
GAGATGGCAC GCGAAGCCGG AGCGAAGAAA GTGTACCTCG CTTCTGCGGC ACCGGAAATT
CGCTTCCCGA ACGTTTATGG TATTGATATG CCGAGCGCCA CGGAACTGAT CGCTCACGGT
CGCGAAGTAG ATGAAATTCG CCAGATCATC GGTGCTGACG GGTTGATTTT CCAGGATCTG
AACGATCTGA TCGAAGCCGT TCGCGCTGAA AACCCGGATA TCCAGCAGTT TGAATGCTCG
GTGTTCAACG GCGTCTACGT CACCAAAGAT GTTGATCAGG GCTACCTCGA TTTCCTCGAT
ACTTTACGTA ATGACGACGC CAAAGCAGTG CAACGTCAGA ACGAAGTGGA AAATCTCGAA
ATGCATAACG AAGGATGA
 
Protein sequence
MCGIVGIAGV MPVNQSIYDA LTVLQHRGQD AAGIITIDAN NCFRLRKANG LVSDVFEARH 
MQRLQGNMGI GHVRYPTAGS SSASEAQPFY VNSPYGITLA HNGNLTNAHE LRKKLFEEKR
RHINTTSDSE ILLNIFASEL DNFRHYPLEA DNIFAAIAAT NRLIRGAYAC VAMIIGHGMV
AFRDPNGIRP LVLGKRDIDE NRTEYMVASE SVALDTLGFD FLRDVAPGEA IYITEEGQLF
TRQCADNPVS NPCLFEYVYF ARPDSFIDKI SVYSARVNMG TKLGEKIARE WEDLDIDVVI
PIPETSCDIA LEIARILGKP YRQGFVKNRY VGRTFIMPGQ QLRRKSVRRK LNANRAEFRD
KNVLLVDDSI VRGTTSEQII EMAREAGAKK VYLASAAPEI RFPNVYGIDM PSATELIAHG
REVDEIRQII GADGLIFQDL NDLIEAVRAE NPDIQQFECS VFNGVYVTKD VDQGYLDFLD
TLRNDDAKAV QRQNEVENLE MHNEG