Gene HS_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1047 
SymbolhemH 
ID4240545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1153936 
End bp1155066 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content38% 
IMG OID638104608 
Productferrochelatase 
Protein accessionYP_719259 
Protein GI113461190 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.258045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAGGC ATTGTGCTGG GGATTTTTCT CGTATTTTTT GGCTTATCCA TTTCAGTAGA 
ACCTCTCAGA GCTATATTGG CACTCAATCT ATTGCTGGGA TTGGCAGTAG TAACCAACGG
GATTCAACTC CTAACTGTTC AACTGATGAA AGAGAAATAG AGGGCTACCG AATGAAAAAA
ATCGGTATTA TTCTTGCCAA TTTAGGTACA CCCGATGAAC CTACTCCTAA AGCATTATCT
CGCTATTTAT GGCAATTTTT GACTGATCCA CGTGTAGTGG ATCTACCTAA ATGGCGTTGG
TATCCGTTAC TTAAAAGCAT TATTTTGCCT CGCCGATCAG CTCGGGTCGC CAAAATATAT
CAAACCGTTT GGACAGATAA AGGTTCTCCT TTACTTGTCA TTTCCAATCA ACAAAAGCAA
GCGTTACAAT CCCACTTCGA TGAACACCGG ATTAATGCAA CAGTAGAAAT TGCAATGACG
TATGGTAACC CGTCAATGGA AAGTGCGGTT GAAAAACTAT TGAAAAAGCA CGTGAATGAA
ATCATCTTGT TACCCCTTTT CCCACAATAT AGCAGTACAA CTACCGGTGC TGTCTTTGAT
GCTTTTGCAC AGGCATTAAA AAAACAACGC AACATTGTGC CTTTTCAGTT CATTCATTCA
TACCATTTAC ATGAAGATTA TATCGAGGCA CTGGTAAATA GTATTAACGC TCAACACAAA
CCGGATGAAT ACTTAATTTT TTCTTTTCAT GGCATACCGT TACGCTATGA AAATGAAGGA
GATTATTATC GTAAACATTG TCATGAAACA GTTTTAGCTG TAGTAGAACG TTTAGGCTTG
CGTGAAAATC AGTGGCAAAT GACGTTCCAA TCAAGATTCG GAAAAGAAGA ATGGTTGCAA
CCTTATACGG ATAAAGTGTT GGAAAATATT TATCAACGAA ATATACAAAA AGTTGCCGTG
GTTTGTCCCG GATTTTCCGC AGATTGTTTA GAAACAATCG AAGAAATTAA TGAAGAAAAT
CGAAGAATTT TTCTCTCTCA TGGGGGAGCG TCTTTTCAAT ATATTCCCGC ACTTAATGCA
GAAATGCAAC ATATTGAAAT GATGTATAAA TTAATCTCAA GTAGATTATA A
 
Protein sequence
MGRHCAGDFS RIFWLIHFSR TSQSYIGTQS IAGIGSSNQR DSTPNCSTDE REIEGYRMKK 
IGIILANLGT PDEPTPKALS RYLWQFLTDP RVVDLPKWRW YPLLKSIILP RRSARVAKIY
QTVWTDKGSP LLVISNQQKQ ALQSHFDEHR INATVEIAMT YGNPSMESAV EKLLKKHVNE
IILLPLFPQY SSTTTGAVFD AFAQALKKQR NIVPFQFIHS YHLHEDYIEA LVNSINAQHK
PDEYLIFSFH GIPLRYENEG DYYRKHCHET VLAVVERLGL RENQWQMTFQ SRFGKEEWLQ
PYTDKVLENI YQRNIQKVAV VCPGFSADCL ETIEEINEEN RRIFLSHGGA SFQYIPALNA
EMQHIEMMYK LISSRL