Gene HS_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0046 
SymbolhemX 
ID4239554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp49546 
End bp50745 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content35% 
IMG OID638103577 
Producturoporphyrin-III C-methyltransferase 
Protein accessionYP_718252 
Protein GI113460195 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2959] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.120112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGAG AGAATGAACA AGTTGGCGAA AAAAGAACAG CAGCTCAAGT TGAAACTGTA 
GTGGTGAAAA AAGGAGGGAC AGTCATTGCT CTATTAGCTT TGCTTATTGC ATTGGGTATT
GGGGGAGCTG GCTATTATTT TGGTCAACAA AAAGTAGAAG AAATTCAGCA AAAGTTGACC
GCACTTAGCC AACCGTCGGA AGCTATGCCG TCGGAAAACA ATGATACTTT GTTGGCAACA
ATTGAAGAAT ATAAGCAAAC TTTTATACAG AAGATTGAGC GACTGGAAAA TGAAATGACA
AAACAAAACC AGCTTATTCA AAGTTTACAA GCACAAGTAA ATAAATTGGA TGCGGTTGGT
AAAGTTGAAC AATCGACGGA TTGGTTGTTG TTTGAAACAG ATTATTTGTT GAATAATGCT
TTACGTAAAA TCGTATTGGA TAATGATGTG GAGACAGCTA TTGCGTTATT GAAAGTGGCG
GATGAAACAC TCGTTAAGGT TAATGATCCT AAAGTCATTA ATATTCGTCA GGCAATTAAT
GCAGATTTAA AACAGTTATT ATCAGTAAAT AATGTAGATC AAAATGCCAT CATGCAACAT
TTATCTCAAT TAGCAAACGG TATTGATGAA TTAGTTGTAT TAAATGTGAA TTTCGATGAG
CAAGAAAATA CTCAATTAAG CGATTCCTTA CAGGATTGGA AAGAAAACGT AGAAAAAAGT
GCGGTATCTT TTTTAAATCA TTTTATTCGT GTAAAACCTC GCCATGTGAA CTCAAAGGAA
TTACTTGCAC CAAATCAAGA TATTTATTTA CGTGAAAATA TTCGTTTACG TTTGCAAATT
GCGATTATGG CGGTCCCTCG TCAGCAAAAT GATTTATATA AACAATCACT TGAAATTGTA
GGTTCTTGGA TAAGAAGCTA TTTTGATACA AGCACTGAAG TGGCACAAAA CTTCTTGAAA
GAGATTGATG AACTTGCTGA GAAATCTATC TATGTTGATG TTCCTAACCA ATTAAAAAGT
TTGCTTTTGT TGGATAAGTT ATTGAATAAG GAACAGTCAT CTGTACAAAA AATTGAAATG
ACAGTAGATA AAGACTTGGT TAGCTCAACA GATCAAGTAT CTGAGGAAGG GAAAACAGAT
CAAGCTGTTG AAAAATCTGA TGAAAAGCCA ATTGAACAGC CTGTTGAGCA AGCACAGTAA
 
Protein sequence
MERENEQVGE KRTAAQVETV VVKKGGTVIA LLALLIALGI GGAGYYFGQQ KVEEIQQKLT 
ALSQPSEAMP SENNDTLLAT IEEYKQTFIQ KIERLENEMT KQNQLIQSLQ AQVNKLDAVG
KVEQSTDWLL FETDYLLNNA LRKIVLDNDV ETAIALLKVA DETLVKVNDP KVINIRQAIN
ADLKQLLSVN NVDQNAIMQH LSQLANGIDE LVVLNVNFDE QENTQLSDSL QDWKENVEKS
AVSFLNHFIR VKPRHVNSKE LLAPNQDIYL RENIRLRLQI AIMAVPRQQN DLYKQSLEIV
GSWIRSYFDT STEVAQNFLK EIDELAEKSI YVDVPNQLKS LLLLDKLLNK EQSSVQKIEM
TVDKDLVSST DQVSEEGKTD QAVEKSDEKP IEQPVEQAQ