Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1909 |
Symbol | purU |
ID | 6147441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1929657 |
End bp | 1930499 |
Gene Length | 843 bp |
Protein Length | 280 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641616785 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_001743963 |
Protein GI | 170683587 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0165731 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.142161 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTCAC TCCAACGTAA AGTTCTGCGT ACTATTTGTC CGGACCAAAA AGGTCTGATC GCACGTATCA CCAATATTTG CTACAAGCAC GAGTTAAATA TCGTACAGAA CAATGAATTT GTTGATCACC GTACCGGGCG CTTTTTTATG CGCACGGAAC TGGAGGGGAT TTTTAATGAT TCCACCCTGC TGGCGGATCT CGATAGCGCA TTGCCAGAAG GCTCCGTACG TGAGCTGAAT CCTGCCGGTC GTCGCCGGAT AGTGATTCTG GTCACTAAAG AGGCGCATTG CCTTGGCGAT TTGTTGATGA AAGCCAATTA CGGCGGCCTG GATGTCGAAA TCGCGGCAGT TATTGGTAAC CACGATACTT TACGTTCTCT GGTTGAGCGT TTTGATATTC CGTTTGAGCT GGTAAGTCAT GAAGGATTAA GTCGCAACGA GCACGATCAA AAGATGGCGG ATGCCATTGA TGCTTATCAG CCTGACTACG TGGTGCTGGC GAAGTATATG CGGGTATTAA CGCCAGAATT TGTGGCACGC TTCCCGAATA AGATCATCAA TATTCACCAT TCATTCCTGC CAGCGTTTAT CGGCGCACGT CCTTATCACC AGGCCTATGA ACGTGGCGTG AAGATTATTG GCGCAACCGC TCACTATGTG AATGACAATC TGGACGAAGG TCCAATCATC ATGCAGGACG TTATTCATGT CGATCATACC TACACAGCTG AAGATATGAT GCGCGCAGGT CGTGACGTCG AGAAAAACGT CTTAAGTCGC GCACTCTACA AAGTACTGGC GCAGCGCGTC TTTGTTTACG GTAATCGGAC GATTATTCTT TAA
|
Protein sequence | MHSLQRKVLR TICPDQKGLI ARITNICYKH ELNIVQNNEF VDHRTGRFFM RTELEGIFND STLLADLDSA LPEGSVRELN PAGRRRIVIL VTKEAHCLGD LLMKANYGGL DVEIAAVIGN HDTLRSLVER FDIPFELVSH EGLSRNEHDQ KMADAIDAYQ PDYVVLAKYM RVLTPEFVAR FPNKIINIHH SFLPAFIGAR PYHQAYERGV KIIGATAHYV NDNLDEGPII MQDVIHVDHT YTAEDMMRAG RDVEKNVLSR ALYKVLAQRV FVYGNRTIIL
|
| |