Gene ECH74115_1717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1717 
SymbolpurU 
ID6972428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1654323 
End bp1655165 
Gene Length843 bp 
Protein Length280 aa 
Translation table11 
GC content48% 
IMG OID643385671 
Productformyltetrahydrofolate deformylase 
Protein accessionYP_002270163 
Protein GI209396262 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0788] Formyltetrahydrofolate hydrolase 
TIGRFAM ID[TIGR00639] phosphoribosylglycinamide formyltransferase, formyltetrahydrofolate-dependent
[TIGR00655] formyltetrahydrofolate deformylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0183504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.737156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCAC TCCAACGTAA AGTTCTGCGT ACTATTTGTC CGGACCAAAA AGGTCTGATC 
GCACGTATTA CCAATATTTG CTACAAGCAC GAGTTAAATA TCGTACAGAA CAATGAATTT
GTTGATCACC GTACCGGGCG CTTTTTTATG CGCACGGAAC TGGAAGGGAT TTTTAATGAT
TCCACCCTGC TGGCGGATCT CGATAGCGCA TTGCCAGAAG GCTCCGTGCG TGAGCTGAAT
CCTGCCGGTC GTCGCCGGAT AGTGATTCTG GTCACTAAAG AAGCGCATTG CCTTGGCGAT
TTGTTGATGA AAGCCAATTA TGGCGGCCTG GATGTCGAAA TCGCGGCAGT GATTGGTAAC
CACGATACTT TACGTTCTCT GGTTGAGCGT TTTGATATAC CGTTTGAGCT GGTAAGCCAT
GAAGGGTTAA GCCGCAACGA GCACGATCAA AAGATGGCGG ATGCCATTGA TGCTTATCAA
CCTGACTACG TGGTGCTGGC GAAGTATATG CGGGTATTAA CACCGGAATT TGTGTCACGC
TTCCCGAATA AGATCATCAA TATTCACCAT TCCTTCCTGC CAGCGTTTAT CGGCGCACGT
CCTTATCACC AGGCCTATGA ACGTGGCGTG AAGATTATTG GCGCAACCGC TCACTATGTG
AATGACAATC TGGACGAAGG CCCAATCATC ATGCAGGACG TTATTCATGT CGATCATACC
TACACAGCTG AAGATATGAT GCGCGCAGGT CGTGACGTCG AGAAAAACGT CTTAAGTCGC
GCGCTCTACA AAGTACTGGC GCAGCGCGTC TTTGTTTACG GTAATCGGAC GATTATTCTT
TAA
 
Protein sequence
MHSLQRKVLR TICPDQKGLI ARITNICYKH ELNIVQNNEF VDHRTGRFFM RTELEGIFND 
STLLADLDSA LPEGSVRELN PAGRRRIVIL VTKEAHCLGD LLMKANYGGL DVEIAAVIGN
HDTLRSLVER FDIPFELVSH EGLSRNEHDQ KMADAIDAYQ PDYVVLAKYM RVLTPEFVSR
FPNKIINIHH SFLPAFIGAR PYHQAYERGV KIIGATAHYV NDNLDEGPII MQDVIHVDHT
YTAEDMMRAG RDVEKNVLSR ALYKVLAQRV FVYGNRTIIL