Gene HS_1625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1625 
SymbolpurH 
ID4241152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1849542 
End bp1851140 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content40% 
IMG OID638105211 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_719830 
Protein GI113461761 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTAA ATCATCCTAT TCGTCAAGCA TTACTCAGCG TTTCAGATAA ATCAGGAATT 
GTTGAATTTG CACAAGGTTT AGTTAAACGA GGTGTAAAAC TATTATCAAC AGGGGGGACG
GCAAAATTAC TCGCTGAAAA CGGGATTCCT GTTACGGAAG TATCTGATTA TACAGGCTTT
CCTGAAATGA TGGAGGGACG TGTAAAAACC TTGCATCCTA AAATTCATGG AGGCATTTTA
GGTCGCCGTG GTATAGATGA TGAAGTTATG ATGCAACATC AAATTGATGC TATTGATATG
GTTGTAGTGA ACTTATATCC TTTTGCGGCA ACTGTAGCAA AACCTGATTG CACACTTGAA
GATGCAGTAG AAAACATTGA TATTGGCGGT CCGACAATGG TACGTTCTGC CGCAAAAAAT
CATCAACATG TGGCTATTGT AGTCAATAAT AGCGATTTTA ATGCAATTCT TGCTGAAATG
GATCAAAATC GAAATAGTTT AACATTAGAG ACAAGATTTG ATTTAGCCAT TAAAGCGTTT
GAACATACCG CACAATATGA CAGCATGATC GCCAATTATT TCGGACAAAT GGTAAAACCT
TATTTCAGAG CTGAAGAAGA AGCTGAAGCG AAGTGCGGTC AATTTCCACG AACTTTAAAT
CTTAATTTTA TACGTAAACA ATCTATGCGT TATGGTGAAA ACGGTCATCA AAAAGCAGCA
TTCTATGTAG AACAAGACGT AAAAGAAGCA AGTGTCTCAA CCGCTAAACA GTTACAAGGT
AAAGCACTTT CTTATAATAA TATTGCCGAC ACTGATGCCG CACTTGAATG TGTGAAATCG
TTTTCCGAGC CGGCTTGTGT TATTGTTAAG CATGCTAACC CTTGCGGTGT AGCACTGGGC
AAGGATATTC TCGAAGCCTA TAATCGAGCT TACCAAACGG ATCCAACCTC AGCTTTCGGT
GGAATTATTG CATTTAATCG TGAGTTAGAT GAAGACACGG CAAAAGCCAT TATTGAGCGG
CAATTCGTTG AAGTGATCAT TGCACCGACC GTCAGTTCCG CCGCCCAAGA AATTGTAAAA
AGTAAGAAAA ATGTTCGCTT ATTGACGTGT GGCAATTGGG AAAGTGCAAT ACAACGCTTG
GATTTTAAAC GTGTCAATGG CGGTTTGTTG GTACAAGAGG CTGATTTATC TATGGTGGAT
TTAGCAGATC TTGAAGTAGT CAGTAAACGT CAACCGACCA AACAAGAGTT GGAAGATCTT
TTATTCTGTT GGAAAGTGGC AAAATTTGTG AAATCCAACG CTATTGTATA CGCTAAAAAT
AATCAAACTG TAGGGATTGG TGCCGGACAA ATGAGCCGTG TTTATTCAGC GAAAATTGCA
GGGATCAAAG CAAAAGATGA AGGTTTGGAA GTAAAAGGCT GTGTAATGGC ATCGGATGCT
TTCTTTCCGT TCCGTGATGG CATTGATGCA GCCGCAAAAG TTGGTATTGA ATGCGTAATC
CATCCGGGCG GTTCAATGCG TGATCAAGAA GTTATCGATG CCGCCAATGA GCATAATATG
GTAATGGTAC TCACTAAAAT GCGTCATTTT AGACATTAA
 
Protein sequence
MQLNHPIRQA LLSVSDKSGI VEFAQGLVKR GVKLLSTGGT AKLLAENGIP VTEVSDYTGF 
PEMMEGRVKT LHPKIHGGIL GRRGIDDEVM MQHQIDAIDM VVVNLYPFAA TVAKPDCTLE
DAVENIDIGG PTMVRSAAKN HQHVAIVVNN SDFNAILAEM DQNRNSLTLE TRFDLAIKAF
EHTAQYDSMI ANYFGQMVKP YFRAEEEAEA KCGQFPRTLN LNFIRKQSMR YGENGHQKAA
FYVEQDVKEA SVSTAKQLQG KALSYNNIAD TDAALECVKS FSEPACVIVK HANPCGVALG
KDILEAYNRA YQTDPTSAFG GIIAFNRELD EDTAKAIIER QFVEVIIAPT VSSAAQEIVK
SKKNVRLLTC GNWESAIQRL DFKRVNGGLL VQEADLSMVD LADLEVVSKR QPTKQELEDL
LFCWKVAKFV KSNAIVYAKN NQTVGIGAGQ MSRVYSAKIA GIKAKDEGLE VKGCVMASDA
FFPFRDGIDA AAKVGIECVI HPGGSMRDQE VIDAANEHNM VMVLTKMRHF RH