Gene SeSA_A4148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4148 
SymbolhemY 
ID6519936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4021239 
End bp4022438 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content58% 
IMG OID642749115 
Productputative protoheme IX biogenesis protein 
Protein accessionYP_002116867 
Protein GI194737284 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID[TIGR00540] hemY protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0333333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTAA AAGTATTATT GCTCTTTGTG TTGTTGCTCG CGGGTATCGT GGTCGGCCCG 
ATGATAGCCG GTCACCAGGG CTATGTGTTA ATCCAGACCG ATAACTACAA CATTGAAACC
AGCGTCACCG GCCTGGCGAT CATTCTCATC GTCGCTATGG TGGTGCTATT CGCCATTGAG
TGGTTGCTAC GCCGTCTGTT TCGTACCGGC GCGCACACCC GCGGTTGGTT TGCCGGCCGT
AAACGCCGTC GCGCCCGCAA GCAGACCGAA CAGGCGCTGC TGAAGCTGGC GGAGGGCGAC
TATCAGCAGG TTGAAAAGCT GATGTCAAAA AATGCCGACC ATGCTGAACA ACCGGTGGTA
AATTATCTGC TGGCGGCAGA AGCGGCGCAG CAGCGCGGCG ATGAAGCACG CGCTAATCAG
CACCTTGAAC GTGCGGCAGA GCTGGCGGGG AACGACACCA TCCCGGTCGA GATTACACGC
GTGCGCTTAC AGCTGGCGCG CAATGAAAAT CACGCGGCGC GTCACGGCGT GGACAAACTG
CTGGAGGTCA CGCCGCGTCA CCCGGAAGTT CTGCGTCTGG CGGAGCAGGC CTATATTCGC
ACCAGCGCAT GGAGTTCTTT ACTGGATATC ATCCCATCCA TGGCGAAAGC GCACGTTGGC
GATGAAGCGC ATCGCGCCAT GCTCGAACAA CAGGCGTGGA TAGGACTCAT GGATCAGGCG
CGCGCCGAGC AGGGCAGCGA AGGATTACGC ACCTGGTGGA AAAACCAAAG CCGTAAAACC
CGCCATCAGG TGGCGCTGCA GGTCGCGATG GCCGAGCATC TGATCGAATG TGACGATCAT
GATATGGCGC AGCAGATTAT CATCGACGGT CTGAAACGCC AGTATGACGA TCGACTGGTG
CTGCCCATTC CTCGCCTCAG AACCAATAAT CCGGAACAAC TGGAGAAAGT GCTGCGCCAG
CAAATTAAGA CAGTAGGCGA TCGCCCGCTG TTATGGAGCA CACTCGGTCA GTCGCTAATG
AAACACGGTG AATGGCAGGA GGCGACTCTG GCTTTCCGCG CCGCGCTGAA ACAACGCCCG
GACGCGTATG ATTATGCCTG GCTTGCCGAT GCGCTTGATC GACTGCATCA ACCGGAAGAG
GCCGCCGCCA TGCGGCGCGA CGGCCTGATG CTGACATTAC AGAACAATCC CCCGCAGTAA
 
Protein sequence
MMLKVLLLFV LLLAGIVVGP MIAGHQGYVL IQTDNYNIET SVTGLAIILI VAMVVLFAIE 
WLLRRLFRTG AHTRGWFAGR KRRRARKQTE QALLKLAEGD YQQVEKLMSK NADHAEQPVV
NYLLAAEAAQ QRGDEARANQ HLERAAELAG NDTIPVEITR VRLQLARNEN HAARHGVDKL
LEVTPRHPEV LRLAEQAYIR TSAWSSLLDI IPSMAKAHVG DEAHRAMLEQ QAWIGLMDQA
RAEQGSEGLR TWWKNQSRKT RHQVALQVAM AEHLIECDDH DMAQQIIIDG LKRQYDDRLV
LPIPRLRTNN PEQLEKVLRQ QIKTVGDRPL LWSTLGQSLM KHGEWQEATL AFRAALKQRP
DAYDYAWLAD ALDRLHQPEE AAAMRRDGLM LTLQNNPPQ