Gene HS_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0047 
SymbolhemY 
ID4239555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp50760 
End bp52037 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content37% 
IMG OID638103578 
Productporphyrin biosynthesis protein 
Protein accessionYP_718253 
Protein GI113460196 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID[TIGR00540] hemY protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAGAG TTCTATTTTT AATGCTGGTT TTATTGGCTG GATTAATTGG CGGTCCCTAT 
TTATCAGGTA AACAGGGATA TGTGTTAATT CAAACGGCAA GCTATAACAT CGAAATGTCA
ATCACGATGT TGGTTATTTT CTTTGTGATT TTAATGGCCA TTGTTTATCT GATTGAATGG
GTGGTTACAC GTTTCTTCCG TTTGAGTAAT AATACCTATA GTTGGTTCTC TCGTCGTAAA
CGTGTTAAGG CACAACGTCA AACTTTGGAA GGTTTAGTGA AGATGAATGA GGGCGATTAT
TCCAAAGCTG AGAAATTGAT TGGAAAAAAT GCTAGACATT CAGATAAACC AATATTGAAT
TTAATTAAAG CGGCGGAAGC GGCACAACAG CGAGGCGATG ATTTTGTTGC CAATCGCTAT
TTAATTGAAG CAACAGAATT AGCAGGTACA GACAGTTTAA TTGTCGAAAT TGCACGTACT
AGAATTTTAT TGCAACAAAA TAAACTTCCG GCAGCTCGTA GTTCGGTGGA TAGTTTACTG
GAGATGACCT CTCGTAATAA AGAAGTATTA AAGCTGGCAG TGAAAATTTA CCTGAAATCT
GCCGCCTTCC ACGCATTGGA TAAAATTCTA GATCAAATTG AAAAAGTTGG ATTATATTCT
TCCGATGAAT TCACCGCCCT TCAGCGTAAA GTCGAAGATG GTTTATTAGA CGAGAAAATG
AACGAAGAGG GTGTTGATGG GTTATTACGT TGGTGGGATG AACAGCCTCG TAAACGCCGT
AATGATTTAT ATGTTAAGGT TGGTTTAATT CGTCGTTTAC TGGACAGTGA TGATCATGAA
AGTGCTTATG AGTTAGCTAT TGATGCTCTG AAAAAAGTTG AAAATAGTGT TGAAGCTAAT
GTTGCTCTTT GTACTCAAAT TACTCGTTTA CAACCGGAAG ATAACAGTAA GTTATTAAAA
CTTTTGGAAA AACGTGCAAA ACAGTCAAAC AGTAAAGATT GTTGCTGTGT TGAACGTGCG
TTAGGTTACC TTTATGTGCG TAATGATGAT TTCGCTAAAG CGGCAGAAGC ATTCAAGAAA
GTGATAGAGA ATAAAGCGAG CTTACAAGCA AATGATATTA CTATGGCGGC TTATGTTTTC
GAACAGGTTG GAGAGCTTGA ATTAGCACAA AAAGTTCGTG AGGAAGGATT GAGAAATGCG
ATGTCAATTA AAGAGTCGGA AAATAAAACC AAAAAAACAG CAGAAAATCC GACCGCCCTT
TTAGAACAAA AATCCTAA
 
Protein sequence
MFRVLFLMLV LLAGLIGGPY LSGKQGYVLI QTASYNIEMS ITMLVIFFVI LMAIVYLIEW 
VVTRFFRLSN NTYSWFSRRK RVKAQRQTLE GLVKMNEGDY SKAEKLIGKN ARHSDKPILN
LIKAAEAAQQ RGDDFVANRY LIEATELAGT DSLIVEIART RILLQQNKLP AARSSVDSLL
EMTSRNKEVL KLAVKIYLKS AAFHALDKIL DQIEKVGLYS SDEFTALQRK VEDGLLDEKM
NEEGVDGLLR WWDEQPRKRR NDLYVKVGLI RRLLDSDDHE SAYELAIDAL KKVENSVEAN
VALCTQITRL QPEDNSKLLK LLEKRAKQSN SKDCCCVERA LGYLYVRNDD FAKAAEAFKK
VIENKASLQA NDITMAAYVF EQVGELELAQ KVREEGLRNA MSIKESENKT KKTAENPTAL
LEQKS