Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1338 |
Symbol | purT |
ID | 6146771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1328046 |
End bp | 1329224 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616216 |
Product | phosphoribosylglycinamide formyltransferase 2 |
Protein accession | YP_001743396 |
Protein GI | 170684011 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) |
TIGRFAM ID | [TIGR01142] phosphoribosylglycinamide formyltransferase 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTTAT TAGGCACTGC GCTGCGTCCG GCAGCAACTC GCGTGATGTT ATTAGGCTCC GGTGAACTGG GTAAAGAAGT GGCAATCGAG TGTCAGCGTC TCGGCGTAGA GGTGATTGCC GTCGATCGCT ATGCCGACGC GCCAGCTATG CATGTCGCGC ATCGCTCCCA TGTCATTAAT ATGCTTGATG GTGATGCATT ACGCCGTGTG GTTGAACTGG AAAAACCACA TTATATCGTG CCGGAGATCG AAGCTATTGC CACCGATATG CTGATCCAAC TTGAAGAGGA AGGACTGAAT GTTGTCCCCT GCGCGCGCGC AACGAAATTA ACGATGAATC GCGAGGGTAT CCGTCGCCTG GCGGCAGAAG AGCTGCAGCT GCCCACTTCC ACTTATCGTT TTGCCGATAG CGAAAACCTT TTCCGCGAGG CGGTTGCTGA CATCGGCTAT CCCTGCATTG TTAAACCGGT GATGAGTTCC TCCGGCAAGG GGCAGACGTT TATTCGTTCT GCAGAGCAAC TTGCTCAGGC ATGGGAGTAC GCTCAGCAAG GCGGTCGCGC CGGAGCGGGC CGTGTAATTG TTGAAGGCGT CGTTAAGTTT GATTTCGAAA TTACCCTGCT AACCGTCAGC GCGGTGGATG GCGTCCATTT CTGTGCACCG GTAGGTCATC GCCAGGAAGA TGGCGACTAT CGTGAATCCT GGCAACCACA ACAAATGAGC CCGCTTGCCC TTGAACGTGC GCAGGAGATT GCCCGTAAAG TGGTGCTGGC ACTGGGCGGT TATGGGCTGT TTGGTGTCGA GCTGTTTGTC TGTGGTGATG AGGTGATTTT CAGTGAAGTC TCCCCTCGCC CGCATGACAC CGGGATGGTG ACGCTAATTT CTCAGGATCT CTCGGAGTTT GCCCTGCATG TACGTGCCTT CCTCGGACTT CCGGTTGGCG GGATCCGCCA GTATGGTCCT GCTGCTTCTG CCGTTATCCT GCCGCAACTG ACCAGTCAGA ATGTCACGTT TGATAATGTG CAGAATGCCG TAGGCGCAGA TTTACAGATT CGTTTATTTG GTAAGCCGGA AATTGATGGC AGCCGTCGTC TGGGGGTAGC ACTGGCTACT GCAGAGAGTG TTGTTGACGC CATTGAACGC GCGAAGCACG CCGCCGGACA GGTAAAAGTA CAGGGTTAA
|
Protein sequence | MTLLGTALRP AATRVMLLGS GELGKEVAIE CQRLGVEVIA VDRYADAPAM HVAHRSHVIN MLDGDALRRV VELEKPHYIV PEIEAIATDM LIQLEEEGLN VVPCARATKL TMNREGIRRL AAEELQLPTS TYRFADSENL FREAVADIGY PCIVKPVMSS SGKGQTFIRS AEQLAQAWEY AQQGGRAGAG RVIVEGVVKF DFEITLLTVS AVDGVHFCAP VGHRQEDGDY RESWQPQQMS PLALERAQEI ARKVVLALGG YGLFGVELFV CGDEVIFSEV SPRPHDTGMV TLISQDLSEF ALHVRAFLGL PVGGIRQYGP AASAVILPQL TSQNVTFDNV QNAVGADLQI RLFGKPEIDG SRRLGVALAT AESVVDAIER AKHAAGQVKV QG
|
| |