Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1625 |
Symbol | purH |
ID | 4241152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1849542 |
End bp | 1851140 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638105211 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_719830 |
Protein GI | 113461761 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTAA ATCATCCTAT TCGTCAAGCA TTACTCAGCG TTTCAGATAA ATCAGGAATT GTTGAATTTG CACAAGGTTT AGTTAAACGA GGTGTAAAAC TATTATCAAC AGGGGGGACG GCAAAATTAC TCGCTGAAAA CGGGATTCCT GTTACGGAAG TATCTGATTA TACAGGCTTT CCTGAAATGA TGGAGGGACG TGTAAAAACC TTGCATCCTA AAATTCATGG AGGCATTTTA GGTCGCCGTG GTATAGATGA TGAAGTTATG ATGCAACATC AAATTGATGC TATTGATATG GTTGTAGTGA ACTTATATCC TTTTGCGGCA ACTGTAGCAA AACCTGATTG CACACTTGAA GATGCAGTAG AAAACATTGA TATTGGCGGT CCGACAATGG TACGTTCTGC CGCAAAAAAT CATCAACATG TGGCTATTGT AGTCAATAAT AGCGATTTTA ATGCAATTCT TGCTGAAATG GATCAAAATC GAAATAGTTT AACATTAGAG ACAAGATTTG ATTTAGCCAT TAAAGCGTTT GAACATACCG CACAATATGA CAGCATGATC GCCAATTATT TCGGACAAAT GGTAAAACCT TATTTCAGAG CTGAAGAAGA AGCTGAAGCG AAGTGCGGTC AATTTCCACG AACTTTAAAT CTTAATTTTA TACGTAAACA ATCTATGCGT TATGGTGAAA ACGGTCATCA AAAAGCAGCA TTCTATGTAG AACAAGACGT AAAAGAAGCA AGTGTCTCAA CCGCTAAACA GTTACAAGGT AAAGCACTTT CTTATAATAA TATTGCCGAC ACTGATGCCG CACTTGAATG TGTGAAATCG TTTTCCGAGC CGGCTTGTGT TATTGTTAAG CATGCTAACC CTTGCGGTGT AGCACTGGGC AAGGATATTC TCGAAGCCTA TAATCGAGCT TACCAAACGG ATCCAACCTC AGCTTTCGGT GGAATTATTG CATTTAATCG TGAGTTAGAT GAAGACACGG CAAAAGCCAT TATTGAGCGG CAATTCGTTG AAGTGATCAT TGCACCGACC GTCAGTTCCG CCGCCCAAGA AATTGTAAAA AGTAAGAAAA ATGTTCGCTT ATTGACGTGT GGCAATTGGG AAAGTGCAAT ACAACGCTTG GATTTTAAAC GTGTCAATGG CGGTTTGTTG GTACAAGAGG CTGATTTATC TATGGTGGAT TTAGCAGATC TTGAAGTAGT CAGTAAACGT CAACCGACCA AACAAGAGTT GGAAGATCTT TTATTCTGTT GGAAAGTGGC AAAATTTGTG AAATCCAACG CTATTGTATA CGCTAAAAAT AATCAAACTG TAGGGATTGG TGCCGGACAA ATGAGCCGTG TTTATTCAGC GAAAATTGCA GGGATCAAAG CAAAAGATGA AGGTTTGGAA GTAAAAGGCT GTGTAATGGC ATCGGATGCT TTCTTTCCGT TCCGTGATGG CATTGATGCA GCCGCAAAAG TTGGTATTGA ATGCGTAATC CATCCGGGCG GTTCAATGCG TGATCAAGAA GTTATCGATG CCGCCAATGA GCATAATATG GTAATGGTAC TCACTAAAAT GCGTCATTTT AGACATTAA
|
Protein sequence | MQLNHPIRQA LLSVSDKSGI VEFAQGLVKR GVKLLSTGGT AKLLAENGIP VTEVSDYTGF PEMMEGRVKT LHPKIHGGIL GRRGIDDEVM MQHQIDAIDM VVVNLYPFAA TVAKPDCTLE DAVENIDIGG PTMVRSAAKN HQHVAIVVNN SDFNAILAEM DQNRNSLTLE TRFDLAIKAF EHTAQYDSMI ANYFGQMVKP YFRAEEEAEA KCGQFPRTLN LNFIRKQSMR YGENGHQKAA FYVEQDVKEA SVSTAKQLQG KALSYNNIAD TDAALECVKS FSEPACVIVK HANPCGVALG KDILEAYNRA YQTDPTSAFG GIIAFNRELD EDTAKAIIER QFVEVIIAPT VSSAAQEIVK SKKNVRLLTC GNWESAIQRL DFKRVNGGLL VQEADLSMVD LADLEVVSKR QPTKQELEDL LFCWKVAKFV KSNAIVYAKN NQTVGIGAGQ MSRVYSAKIA GIKAKDEGLE VKGCVMASDA FFPFRDGIDA AAKVGIECVI HPGGSMRDQE VIDAANEHNM VMVLTKMRHF RH
|
| |