Gene STER_0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_0054 
SymbolpurH 
ID4436851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp42145 
End bp43692 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content45% 
IMG OID639675819 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_819622 
Protein GI116627003 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.191385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAC GCGCACTAAT TAGTGTCTCA GATAAAGCGG GCATTGTTGA ATTTGCCCAA 
GAACTCAAAA AACTTGGTTG GGATATCATC TCAACAGGTG GTACCAAAGT TACCCTTGAC
AATGCTGGTG TTGACACCAT TGCCATTGAC GATGTAACTG GTTTCCCAGA AATGATGGAC
GGTCGTGTGA AGACTCTTCA TCCAAATATC CACGGTGGTC TCCTCGCTCG TCGTGACCTT
CATAGCCACC TTCAAGCGGC TAAGGACAAT AATATCGAAC TTATCGATCT TGTTGTGGTA
AACCTTTACC CATTCAAGGA GACTATTCTC AAACCAGACG TGACCTATGC TGACGCAGTT
GAAAACATCG ATATCGGTGG GCCATCAATG CTTCGTTCAG CGGCTAAAAA CCACGCTAGC
GTAACAGTTG TTGTAGATCC TGCTGACTAT GCTGTTGTTC TTGACGAATT GTCAGCAAAC
GGTGAAACAA CTTACGAAAC TCGCCAACGT TTGGCAGCGA AAGTATACCG TCACACAGCT
TCATACGACG CTTTGATTGC AGAATACTTC ACAGCTCAAG TGGGTGAAAC AAAACCTGAA
AAACTCACTT TGACTTATGA CCTTAAGCAA CCAATGCGTT ACGGTGAAAA CCCTCAACAA
GACGCAGACT TCTACCAAAA AGGTTTGCCA ACGGCTTACT CCATTGCTTC AGCTAAACAG
CTTAACGGTA AAGAATTGTC ATTCAACAAT ATCCGTGACG CTGATGCCGC TATCCGTATC
ATCCGTGATT TCAAAGACCG TCCAACAGTC GTGGCTCTCA AACATATGAA CCCATGTGGT
ATCGGTCAAG CTGATGACAT TGAAACAGCT TGGGACTACG CTTATGAAGC TGACCCAGTG
TCAATCTTTG GTGGTATTGT AGTCCTCAAC CGTGAAGTTG ACGCTGCGAC GGCTAAGAAA
ATGCACGGTG TCTTCCTTGA AATCATCATT GCACCAAGCT ATACAGATGA AGCACTTGAA
ATCTTGACTA CCAAGAAGAA AAACTTGCGT ATCCTTGAGT TGCCATTTGA CGCTCAAGAT
GCCAGCGAAG CAGAAGCAGA ATACACTGGT GTTGTCGGTG GACTTCTCGT TCAAAACCAA
GACGTTGTTA AAGAAAGTCC AGCTGACTGG CAAGTGGTTA CTAAACGCCA ACCAACTGAT
ACAGAAGTGA CAGCTCTTGA GTTTGCTTGG AAAGCCGTCA AGTACGTCAA ATCAAATGGT
ATCATCGTGA CTAACGACCA CATGACACTT GGTGTTGGCC CTGGCCAAAC TAACCGTGTG
GCTTCCGTCC GTATCGCTAT TGACCAAGCC AAAGGGCGTC TTGACGGTGC TGTTCTTGCT
TCAGATGCCT TCTTCCCATT TGCAGATAAC GTGGAAGAAA TCGCCAAAGC AGGTATCAAG
GCTATTATCC AACCAGGTGG CTCAGTACGT GACCAAGAGT CTATCGAAGC AGCTGATAAA
TATGGATTAA CGATGATCTT TACAGGCGTT CGTCACTTCC GTCATTAA
 
Protein sequence
MTKRALISVS DKAGIVEFAQ ELKKLGWDII STGGTKVTLD NAGVDTIAID DVTGFPEMMD 
GRVKTLHPNI HGGLLARRDL HSHLQAAKDN NIELIDLVVV NLYPFKETIL KPDVTYADAV
ENIDIGGPSM LRSAAKNHAS VTVVVDPADY AVVLDELSAN GETTYETRQR LAAKVYRHTA
SYDALIAEYF TAQVGETKPE KLTLTYDLKQ PMRYGENPQQ DADFYQKGLP TAYSIASAKQ
LNGKELSFNN IRDADAAIRI IRDFKDRPTV VALKHMNPCG IGQADDIETA WDYAYEADPV
SIFGGIVVLN REVDAATAKK MHGVFLEIII APSYTDEALE ILTTKKKNLR ILELPFDAQD
ASEAEAEYTG VVGGLLVQNQ DVVKESPADW QVVTKRQPTD TEVTALEFAW KAVKYVKSNG
IIVTNDHMTL GVGPGQTNRV ASVRIAIDQA KGRLDGAVLA SDAFFPFADN VEEIAKAGIK
AIIQPGGSVR DQESIEAADK YGLTMIFTGV RHFRH