Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_0054 |
Symbol | purH |
ID | 4436851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | + |
Start bp | 42145 |
End bp | 43692 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639675819 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_819622 |
Protein GI | 116627003 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.191385 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAAAC GCGCACTAAT TAGTGTCTCA GATAAAGCGG GCATTGTTGA ATTTGCCCAA GAACTCAAAA AACTTGGTTG GGATATCATC TCAACAGGTG GTACCAAAGT TACCCTTGAC AATGCTGGTG TTGACACCAT TGCCATTGAC GATGTAACTG GTTTCCCAGA AATGATGGAC GGTCGTGTGA AGACTCTTCA TCCAAATATC CACGGTGGTC TCCTCGCTCG TCGTGACCTT CATAGCCACC TTCAAGCGGC TAAGGACAAT AATATCGAAC TTATCGATCT TGTTGTGGTA AACCTTTACC CATTCAAGGA GACTATTCTC AAACCAGACG TGACCTATGC TGACGCAGTT GAAAACATCG ATATCGGTGG GCCATCAATG CTTCGTTCAG CGGCTAAAAA CCACGCTAGC GTAACAGTTG TTGTAGATCC TGCTGACTAT GCTGTTGTTC TTGACGAATT GTCAGCAAAC GGTGAAACAA CTTACGAAAC TCGCCAACGT TTGGCAGCGA AAGTATACCG TCACACAGCT TCATACGACG CTTTGATTGC AGAATACTTC ACAGCTCAAG TGGGTGAAAC AAAACCTGAA AAACTCACTT TGACTTATGA CCTTAAGCAA CCAATGCGTT ACGGTGAAAA CCCTCAACAA GACGCAGACT TCTACCAAAA AGGTTTGCCA ACGGCTTACT CCATTGCTTC AGCTAAACAG CTTAACGGTA AAGAATTGTC ATTCAACAAT ATCCGTGACG CTGATGCCGC TATCCGTATC ATCCGTGATT TCAAAGACCG TCCAACAGTC GTGGCTCTCA AACATATGAA CCCATGTGGT ATCGGTCAAG CTGATGACAT TGAAACAGCT TGGGACTACG CTTATGAAGC TGACCCAGTG TCAATCTTTG GTGGTATTGT AGTCCTCAAC CGTGAAGTTG ACGCTGCGAC GGCTAAGAAA ATGCACGGTG TCTTCCTTGA AATCATCATT GCACCAAGCT ATACAGATGA AGCACTTGAA ATCTTGACTA CCAAGAAGAA AAACTTGCGT ATCCTTGAGT TGCCATTTGA CGCTCAAGAT GCCAGCGAAG CAGAAGCAGA ATACACTGGT GTTGTCGGTG GACTTCTCGT TCAAAACCAA GACGTTGTTA AAGAAAGTCC AGCTGACTGG CAAGTGGTTA CTAAACGCCA ACCAACTGAT ACAGAAGTGA CAGCTCTTGA GTTTGCTTGG AAAGCCGTCA AGTACGTCAA ATCAAATGGT ATCATCGTGA CTAACGACCA CATGACACTT GGTGTTGGCC CTGGCCAAAC TAACCGTGTG GCTTCCGTCC GTATCGCTAT TGACCAAGCC AAAGGGCGTC TTGACGGTGC TGTTCTTGCT TCAGATGCCT TCTTCCCATT TGCAGATAAC GTGGAAGAAA TCGCCAAAGC AGGTATCAAG GCTATTATCC AACCAGGTGG CTCAGTACGT GACCAAGAGT CTATCGAAGC AGCTGATAAA TATGGATTAA CGATGATCTT TACAGGCGTT CGTCACTTCC GTCATTAA
|
Protein sequence | MTKRALISVS DKAGIVEFAQ ELKKLGWDII STGGTKVTLD NAGVDTIAID DVTGFPEMMD GRVKTLHPNI HGGLLARRDL HSHLQAAKDN NIELIDLVVV NLYPFKETIL KPDVTYADAV ENIDIGGPSM LRSAAKNHAS VTVVVDPADY AVVLDELSAN GETTYETRQR LAAKVYRHTA SYDALIAEYF TAQVGETKPE KLTLTYDLKQ PMRYGENPQQ DADFYQKGLP TAYSIASAKQ LNGKELSFNN IRDADAAIRI IRDFKDRPTV VALKHMNPCG IGQADDIETA WDYAYEADPV SIFGGIVVLN REVDAATAKK MHGVFLEIII APSYTDEALE ILTTKKKNLR ILELPFDAQD ASEAEAEYTG VVGGLLVQNQ DVVKESPADW QVVTKRQPTD TEVTALEFAW KAVKYVKSNG IIVTNDHMTL GVGPGQTNRV ASVRIAIDQA KGRLDGAVLA SDAFFPFADN VEEIAKAGIK AIIQPGGSVR DQESIEAADK YGLTMIFTGV RHFRH
|
| |