Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1717 |
Symbol | purU |
ID | 6972428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1654323 |
End bp | 1655165 |
Gene Length | 843 bp |
Protein Length | 280 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643385671 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_002270163 |
Protein GI | 209396262 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00639] phosphoribosylglycinamide formyltransferase, formyltetrahydrofolate-dependent [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0183504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.737156 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTCAC TCCAACGTAA AGTTCTGCGT ACTATTTGTC CGGACCAAAA AGGTCTGATC GCACGTATTA CCAATATTTG CTACAAGCAC GAGTTAAATA TCGTACAGAA CAATGAATTT GTTGATCACC GTACCGGGCG CTTTTTTATG CGCACGGAAC TGGAAGGGAT TTTTAATGAT TCCACCCTGC TGGCGGATCT CGATAGCGCA TTGCCAGAAG GCTCCGTGCG TGAGCTGAAT CCTGCCGGTC GTCGCCGGAT AGTGATTCTG GTCACTAAAG AAGCGCATTG CCTTGGCGAT TTGTTGATGA AAGCCAATTA TGGCGGCCTG GATGTCGAAA TCGCGGCAGT GATTGGTAAC CACGATACTT TACGTTCTCT GGTTGAGCGT TTTGATATAC CGTTTGAGCT GGTAAGCCAT GAAGGGTTAA GCCGCAACGA GCACGATCAA AAGATGGCGG ATGCCATTGA TGCTTATCAA CCTGACTACG TGGTGCTGGC GAAGTATATG CGGGTATTAA CACCGGAATT TGTGTCACGC TTCCCGAATA AGATCATCAA TATTCACCAT TCCTTCCTGC CAGCGTTTAT CGGCGCACGT CCTTATCACC AGGCCTATGA ACGTGGCGTG AAGATTATTG GCGCAACCGC TCACTATGTG AATGACAATC TGGACGAAGG CCCAATCATC ATGCAGGACG TTATTCATGT CGATCATACC TACACAGCTG AAGATATGAT GCGCGCAGGT CGTGACGTCG AGAAAAACGT CTTAAGTCGC GCGCTCTACA AAGTACTGGC GCAGCGCGTC TTTGTTTACG GTAATCGGAC GATTATTCTT TAA
|
Protein sequence | MHSLQRKVLR TICPDQKGLI ARITNICYKH ELNIVQNNEF VDHRTGRFFM RTELEGIFND STLLADLDSA LPEGSVRELN PAGRRRIVIL VTKEAHCLGD LLMKANYGGL DVEIAAVIGN HDTLRSLVER FDIPFELVSH EGLSRNEHDQ KMADAIDAYQ PDYVVLAKYM RVLTPEFVSR FPNKIINIHH SFLPAFIGAR PYHQAYERGV KIIGATAHYV NDNLDEGPII MQDVIHVDHT YTAEDMMRAG RDVEKNVLSR ALYKVLAQRV FVYGNRTIIL
|
| |