Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C2097 |
Symbol | purT |
ID | 6489481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 2024685 |
End bp | 2025863 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642742293 |
Product | phosphoribosylglycinamide formyltransferase 2 |
Protein accession | YP_002045936 |
Protein GI | 194448671 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) |
TIGRFAM ID | [TIGR01142] phosphoribosylglycinamide formyltransferase 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.310007 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTAT TAGGCACTGC GCTGCGTCCG GCAGCAACGC GGGTGATGTT ATTAGGGGCA GGTGAATTGG GAAAAGAGGT GGCGATTGAA TGCCAACGCC TGGGGATCGA GGTTATCGCC GTCGATCGCT ATCCTGATGC TCCCGCCATG CATGTGGCTC ACCGTTCACA CGTCATTAAT ATGCTGGACG GCGAGGCGCT ACGTCATGTG ATTACAGAGG AAAAACCGCA TTATATCGTG CCGGAAATAG AAGCGATCGC CACCGATACG CTGCGCGAGC TGGAGGGCGA AGGGCTGAAT GTCGTGCCTT GCGCCCGTGC AACGCAGCTC ACGATGAACC GCGAAGGGAT CCGTCGCCTG GCCGCAGAAG AATTAGGACT GCCGACATCG ACGTATCGCT TTGCCGACAG TGAGGCCAGT TTTCATGATG CGGTAGCCGC AGTGGGTTTT CCTTGCATCG TCAAACCGGT CATGAGCTCT TCCGGCAAAG GCCAGAGCTT TATCCGCTCG GCCGAACAAC TCGCGCAGGC ATGGGAGTAT GCTCAACAGG GCGGACGCGC TGGCGCGGGT CGCGTGATTG TGGAAGGCGT GGTTAAATTT GATTTTGAAA TTACGCTGCT CACCGTTAGC GCCGTCGATG GCGTGCATTT CTGCGCGCCG GTCGGTCATC GTCAGCAAGA TGGTGACTAT CGCGAATCCT GGCAGCCACA GCAGATGAGC GAACTGGCGC TGAAGCGGGC GCAAGAGATT GCTCGTCATG TGGTACTGGC GTTAGGCGGT CATGGCCTGT TCGGCGTTGA ACTCTTCGTC TGTGGCGATG AAGTCATTTT CAGCGAAGTC TCCCCTCGCC CGCACGATAC CGGAATGGTC ACGTTGATTT CTCAGGATCT CTCTGAGTTT GCGCTGCATG TGCGCGCCTT TCTGGGAATG CCCGTAGGCG CTATTCGCCA GTATGGTCCC GCTGCCTCGG CCGTGATTCT GCCGCAGCTT ACCAGTCAAA ATGTGACGTT TGATAATGTA CACGCGGCGG TAGGAGCCGG AGTGCAGGTA CGGCTGTTTG GTAAGCCTGA GATCGACGGC ACCCGTCGTC TTGGTGTAGC GTTAGCGACA GGTGAAAACG TTGAAGAAGC GGTGATAAGA GCGAAAAAGG CCGTCAGCCG CGTGACGGTA AAAGGCTAA
|
Protein sequence | MTLLGTALRP AATRVMLLGA GELGKEVAIE CQRLGIEVIA VDRYPDAPAM HVAHRSHVIN MLDGEALRHV ITEEKPHYIV PEIEAIATDT LRELEGEGLN VVPCARATQL TMNREGIRRL AAEELGLPTS TYRFADSEAS FHDAVAAVGF PCIVKPVMSS SGKGQSFIRS AEQLAQAWEY AQQGGRAGAG RVIVEGVVKF DFEITLLTVS AVDGVHFCAP VGHRQQDGDY RESWQPQQMS ELALKRAQEI ARHVVLALGG HGLFGVELFV CGDEVIFSEV SPRPHDTGMV TLISQDLSEF ALHVRAFLGM PVGAIRQYGP AASAVILPQL TSQNVTFDNV HAAVGAGVQV RLFGKPEIDG TRRLGVALAT GENVEEAVIR AKKAVSRVTV KG
|
| |