Gene SeD_A4324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4324 
SymbolhemY 
ID6875794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4170530 
End bp4171729 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content58% 
IMG OID642787252 
Productputative protoheme IX biogenesis protein 
Protein accessionYP_002217868 
Protein GI198243522 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID[TIGR00540] hemY protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00653056 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTAA AAGTATTATT GCTCTTTGTG TTGTTGCTTG CGGGTATCGT GGTCGGCCCG 
ATGATAGCCG GTCACCAGGG CTATGTGTTA ATCCAGACCG ATAACTACAA CATTGAAACC
AGCGTCACCG GCCTGGCGAT CATTCTCATC GTCGCTATGG TGGTGCTATT CGCCATTGAG
TGGTTACTAC GCCGTCTGTT TCGTACCGGC GCGCACACCC GCGGTTGGTT TGCCGGTCGT
AAACGCCGTC GCGCCCGCAA GCAGACCGAA CAGGCGCTGC TGAAGCTGGC GGAGGGCGAC
TATCAGCAGG TTGAAAAGCT GATGTCAAAA AATGCCGACC ATGCTGAACA ACCGGTGGTA
AATTATCTGC TGGCGGCAGA AGCGGCGCAG CAGCGCGGCG ATGAAGCACG CGCTAATCAG
CACCTTGAAC GTGCGGCAGA GCTGGCGGGG AACGACACCA TCCCGGTCGA GATCACGCGC
GTGCGCTTAC AGCTGGCGCG CAATGAAAAT CACGCAGCGC GTCACGGCGT GGACAAACTG
CTGGAGGTCA CGCCGCGTCA CCCGGAAGTT CTGCGTCTGG CGGAGCAGGC CTATATTCGC
ACCAGCGCAT GGAGTTCTTT ACTGGATATC ATCCCATCCA TGGCGAAAGC GCACGTCGGC
GATGAAGCGC ATCGCGCCAT ACTCGAACAA CAGGCGTGGA TAGGACTCAT GGATCAGGCG
CGCGCCGAGC AGGGCAGCGA AGGATTACGC ACCTGGTGGA AAAACCAAAG CCGTAAAACC
CGCCATCAGG TGGCGCTGCA GGTCGCGATG GCCGAGCATC TGATCGAATG TGACGATCAT
GATATGGCGC AGCAGATTAT CATCGACGGT CTGAAACGCC AGTATGACGA TCGGCTGGTG
CTGCCCATTC CTCGCCTCAG AACCAATAAT CCGGAACAAC TGGAGAAAGT GCTGCGTCAG
CAAATCAAGA CGGTAGGCGA TCGCCCGCTG TTATGGAGCA CGCTCGGCCA GTCGCTAATG
AAACACGGTG AATGGCAGGA GGCAACCCTG GCTTTCCGCG CCGCGCTAAA ACAACGCCCG
GACGCGTATG ATTATGCCTG GCTTGCCGAT GCGCTTGATC GACTGCATCA ACCGGAAGAG
GCCGCCGCCA TGCGGCGCGA TGGCTTGATG CTGACATTAC AGAACAATCC CCCGCAGTAA
 
Protein sequence
MMLKVLLLFV LLLAGIVVGP MIAGHQGYVL IQTDNYNIET SVTGLAIILI VAMVVLFAIE 
WLLRRLFRTG AHTRGWFAGR KRRRARKQTE QALLKLAEGD YQQVEKLMSK NADHAEQPVV
NYLLAAEAAQ QRGDEARANQ HLERAAELAG NDTIPVEITR VRLQLARNEN HAARHGVDKL
LEVTPRHPEV LRLAEQAYIR TSAWSSLLDI IPSMAKAHVG DEAHRAILEQ QAWIGLMDQA
RAEQGSEGLR TWWKNQSRKT RHQVALQVAM AEHLIECDDH DMAQQIIIDG LKRQYDDRLV
LPIPRLRTNN PEQLEKVLRQ QIKTVGDRPL LWSTLGQSLM KHGEWQEATL AFRAALKQRP
DAYDYAWLAD ALDRLHQPEE AAAMRRDGLM LTLQNNPPQ