Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1783 |
Symbol | purT |
ID | 6064921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 1983855 |
End bp | 1985033 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641601198 |
Product | phosphoribosylglycinamide formyltransferase 2 |
Protein accession | YP_001724760 |
Protein GI | 170019806 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) |
TIGRFAM ID | [TIGR01142] phosphoribosylglycinamide formyltransferase 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.163487 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTTAT TAGGCACTGC GCTGCGTCCG GCAGCAACTC GCGTGATGTT ATTAGGCTCC GGTGAACTGG GTAAAGAAGT GGCAATCGAG TGTCAGCGTC TCGGCGTAGA GGTGATTGCC GTCGATCGCT ATTCCGACGC ACCAGCCATG CATGTCGCGC ATCGCTCCCA TGTCATTAAT ATGCTTGATG GTGATGCATT ACGCCGTGTG GTTGAACTGG AAAAACCACA TTATATCGTG CCGGAGATCG AAGCTATTGC CACCGATATG CTGATCCAAC TTGAAGAGGA AGGACTGAAT GTTGTCCCCT GCGCTCGCGC AACGAAATTA ACGATGAATC GCGAGGGTAT CCGTCGCCTG GCGGCAGAAG AGCTGCAGCT GCCCACTTCC ACTTATCGTT TTGCCGATAG CGAAAGCCTT TTCCGCGAGG CGGTTGCCGC CATTGGCTAT CCCTGCATTG TAAAACCGGT GATGAGCTCT TCCGGCAAGG GGCAGACGTT TATTCGCTCT GCAGAGCAAC TTGCTCAGGC ATGGGAATAC GCTCAGCAAG GCGGTCGCGC CGGAGCGGGC CGCGTAATTG TTGAAGGCGT CGTTAAGTTT GACTTCGAAA TTACCCTGCT AACCGTCAAC GCGGTGGATG GCGTCCATTT CTGTGCACCA GTAGGTCATC GCCAGGAAGA TGGCGACTAC CGTGAATCCT GGCAACCACA GCAAATGAGC CCGCTTGCCC TTGAACGTGC GCAGGAGATT GCCCGTAAAG TGGTGCTGGC ACTGGGCGGT TATGGGTTGT TTGGTGTCGA GCTATTTGTC TGTGGTGATG AGGTGATTTT CAGTGAGGTC TCCCCTCGTC CACATGATAC CGGGATGGTG ACGTTAATTT CTCAAGATCT CTCAGAGTTT GCCCTGCATG TACGTGCCTT CCTCGGACTT CCGGTTGGCG GGATCCGTCA GTATGGTCCT GCAGCTTCTG CCGTTATTCT GCCACAACTG ACCAGTCAGA ATGTCACGTT TGATAATGTG CAGAATGCCG TAGGCGCAGA TTTGCAGATT CGTTTATTTG GTAAGCCGGA AATTGATGGC AGCCGTCGTC TGGGTGTTGC GCTGGCGACC GCTGAGAGTG TTGTTGACGC CATTGAACGC GCGAAGTACG CCGCCGGACA GGTAAAAGTA CAGGGTTAA
|
Protein sequence | MTLLGTALRP AATRVMLLGS GELGKEVAIE CQRLGVEVIA VDRYSDAPAM HVAHRSHVIN MLDGDALRRV VELEKPHYIV PEIEAIATDM LIQLEEEGLN VVPCARATKL TMNREGIRRL AAEELQLPTS TYRFADSESL FREAVAAIGY PCIVKPVMSS SGKGQTFIRS AEQLAQAWEY AQQGGRAGAG RVIVEGVVKF DFEITLLTVN AVDGVHFCAP VGHRQEDGDY RESWQPQQMS PLALERAQEI ARKVVLALGG YGLFGVELFV CGDEVIFSEV SPRPHDTGMV TLISQDLSEF ALHVRAFLGL PVGGIRQYGP AASAVILPQL TSQNVTFDNV QNAVGADLQI RLFGKPEIDG SRRLGVALAT AESVVDAIER AKYAAGQVKV QG
|
| |