Gene SNSL254_A4217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4217 
SymbolhemY 
ID6486949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4108232 
End bp4109431 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content58% 
IMG OID642739470 
Productputative protoheme IX biogenesis protein 
Protein accessionYP_002043169 
Protein GI194443869 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID[TIGR00540] hemY protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0192261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTAA AAGTATTATT GCTCTTTGTG TTGTTGCTCG CGGGTATCGT GGTCGGCCCG 
ATGATAGCCG GTCACCAGGG CTATGTGTTA ATCCAGACCG ATAACTACAA CATTGAAACC
AGCGTCACCG GACTGGCGAT CATTCTCATC GTCGCTATGG TGGTGCTATT CGCCATTGAG
TGGTTGCTAC GCCGTCTGTT TCGTACCGGC GCGCACACCC GCGGTTGGTT TGCCGGCCGT
AAACGCCGTC GCGCCCGCAA GCAGACCGAA CAGGCGCTGC TGAAGCTGGC GGAGGGCGAC
TATCAGCAGG TTGAAAAGCT GATGTCAAAA AATGCCGACC ATGCTGAACA ACCGGTGGTA
AATTATCTGC TGGCGGCAGA AGCGGCGCAG CAGCGCGGCG ATGAAGCACG CGCTAATCAG
CACCTTGAAC GTGCGGCAGA GCTGGCGGGG AACGACACCA TCCCGGTCGA GATCACGCGC
GTGCGCTTAC AGCTGGCGCG CAATGAAAAT CACGCAGCGC GTCACGGCGT GGACAAACTG
CTGGAGGTCA CGCCGCGTCA CCCGGAAGTT CTGCGTCTGG CGGAGCAGGC CTATATTCGC
ACCAGCGCAT GGAGTTCTTT ACTGGATATC ATCCCATCCA TGGCGAAAGC GCACGTCGGC
GATGAAGCGC ATCGCGCCAT GCTCGAACAA CAGGCGTGGA TAGGACTCAT GGATCAGGCG
CGCGCCGAGC AGGGCAGCGA AGGATTACGC ACCTGGTGGA AAAACCAAAG CCGTAAAACC
CGCCATCAGG TGGCGCTGCA GGTCGCGATG GCCGAGCATC TGATCGAATG TGACGATCAT
GATATGGCGC AGCAGATTAT CATCGATGGT CTGAAACGCC AGTATGACGA TCGGCTGGTG
CTGCCCATTC CTCGCCTCAG AACCAATAAT CCGGAACAAC TGGAGAAAGT GCTGCGCCAG
CAAATCAAGA CGGTAGGCGA TCGCCCGCTG TTATGGAGCA CGCTCGGCCA GTCGCTAATG
AAACACGGTG AATGGCAGGA GGCGACTCTG GCTTTCCGCG CCGCGCTGAA ACAACGCCCG
GACGCGTATG ATTATGCCTG GCTTGCCGAT GCGCTTGATC GACTGCATCA ACCGGAAGAG
GCTGCCGCCA TGCGGCGCGA CGGCCTGATG CTGACATTAC AGAACAATCC CTCGCAGTAA
 
Protein sequence
MMLKVLLLFV LLLAGIVVGP MIAGHQGYVL IQTDNYNIET SVTGLAIILI VAMVVLFAIE 
WLLRRLFRTG AHTRGWFAGR KRRRARKQTE QALLKLAEGD YQQVEKLMSK NADHAEQPVV
NYLLAAEAAQ QRGDEARANQ HLERAAELAG NDTIPVEITR VRLQLARNEN HAARHGVDKL
LEVTPRHPEV LRLAEQAYIR TSAWSSLLDI IPSMAKAHVG DEAHRAMLEQ QAWIGLMDQA
RAEQGSEGLR TWWKNQSRKT RHQVALQVAM AEHLIECDDH DMAQQIIIDG LKRQYDDRLV
LPIPRLRTNN PEQLEKVLRQ QIKTVGDRPL LWSTLGQSLM KHGEWQEATL AFRAALKQRP
DAYDYAWLAD ALDRLHQPEE AAAMRRDGLM LTLQNNPSQ