Gene HS_1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1551 
SymbolpurM 
ID4241072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1747749 
End bp1748786 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content41% 
IMG OID638105131 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_719756 
Protein GI113461687 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTAAAC AATCATTAAG TTATAAAGAC GCAGGTGTGG ATATTAATGC GGGGAACACA 
TTAGTAGAAC GTATTAAATC TGATGTAAAA CGTACAACTA GACCCGAGGT TATTGGTGGG
TTAGGGGGCT TCGGTGCACT ATGTGCATTG CCAAGTAAAT ATAAGGATCC TATTCTTGTA
TCCGGAACTG ATGGTGTTGG GACTAAGTTA CGCCTTGCGA TTGACCTAAA AAAACATGAC
ACAATTGGTG TTGATTTGGT CGCAATGTGT GTCAATGATC TTGTGGTGCA AGGTGCAGAA
CCGTTATTTT TTCTCGACTA TTATGCAACA GGTAAATTGG ATGTAGACGT TGCAGCAGAT
GTCATCAAAG GTATTGCTGA TGGTTGTGTG CAAGCCGGTT GTGCTTTAGT AGGGGGTGAA
ACCGCAGAAA TGCCGGGAAT GTATCATACC GGTGATTATG ATTTGGCAGG TTTTTGTGTG
GGTGTAGTTG AGAAATCGGA AATTATTGAC GGTTCCAACG TTAAAGCAGG CGATGCATTA
CTTGCCTTAG CTTCAAGCGG TCCTCATTCA AATGGATATT CATTAATTCG CAAAGTCATT
GAAGTTTCAG GTATTGATCC GACAACAACA CAATTAGCCG AGCATTCATT CGCTGAACAA
GTTCTTGCAC CGACAAAAAT TTATGTAAAA CCGGTGTTGC AATTAATTAA ACATACTGAC
GTTCATGCTA TTTGCCATTT AACAGGCGGC GGTTTTTGGG AAAATATTCC GCGTGTTTTA
CCGTCTTCCG TTAAAGCGGT AATTAATGAA AAGAGTTGGG AATGGCATCC TATTTTCAAA
TGGTTACAAG AACAAGGAAA TATTGATCGC TATGAAATGT ATAGAACCTT TAACTGTGGC
GTAGGCATGA TTATCGCTCT CCCACAGGAA GATGTGGAAA CTGCATTGGC ATTATTACAA
CAAGTAGGCG AAAAAGCATG GGTAATCGGT AAAATCGAAC ATGCGAATGC TGATGAAGAA
AAAGTTGTGA TTTGTTGA
 
Protein sequence
MSKQSLSYKD AGVDINAGNT LVERIKSDVK RTTRPEVIGG LGGFGALCAL PSKYKDPILV 
SGTDGVGTKL RLAIDLKKHD TIGVDLVAMC VNDLVVQGAE PLFFLDYYAT GKLDVDVAAD
VIKGIADGCV QAGCALVGGE TAEMPGMYHT GDYDLAGFCV GVVEKSEIID GSNVKAGDAL
LALASSGPHS NGYSLIRKVI EVSGIDPTTT QLAEHSFAEQ VLAPTKIYVK PVLQLIKHTD
VHAICHLTGG GFWENIPRVL PSSVKAVINE KSWEWHPIFK WLQEQGNIDR YEMYRTFNCG
VGMIIALPQE DVETALALLQ QVGEKAWVIG KIEHANADEE KVVIC