Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4217 |
Symbol | hemY |
ID | 6486949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4108232 |
End bp | 4109431 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642739470 |
Product | putative protoheme IX biogenesis protein |
Protein accession | YP_002043169 |
Protein GI | 194443869 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG3071] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | [TIGR00540] hemY protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0192261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 86 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCTAA AAGTATTATT GCTCTTTGTG TTGTTGCTCG CGGGTATCGT GGTCGGCCCG ATGATAGCCG GTCACCAGGG CTATGTGTTA ATCCAGACCG ATAACTACAA CATTGAAACC AGCGTCACCG GACTGGCGAT CATTCTCATC GTCGCTATGG TGGTGCTATT CGCCATTGAG TGGTTGCTAC GCCGTCTGTT TCGTACCGGC GCGCACACCC GCGGTTGGTT TGCCGGCCGT AAACGCCGTC GCGCCCGCAA GCAGACCGAA CAGGCGCTGC TGAAGCTGGC GGAGGGCGAC TATCAGCAGG TTGAAAAGCT GATGTCAAAA AATGCCGACC ATGCTGAACA ACCGGTGGTA AATTATCTGC TGGCGGCAGA AGCGGCGCAG CAGCGCGGCG ATGAAGCACG CGCTAATCAG CACCTTGAAC GTGCGGCAGA GCTGGCGGGG AACGACACCA TCCCGGTCGA GATCACGCGC GTGCGCTTAC AGCTGGCGCG CAATGAAAAT CACGCAGCGC GTCACGGCGT GGACAAACTG CTGGAGGTCA CGCCGCGTCA CCCGGAAGTT CTGCGTCTGG CGGAGCAGGC CTATATTCGC ACCAGCGCAT GGAGTTCTTT ACTGGATATC ATCCCATCCA TGGCGAAAGC GCACGTCGGC GATGAAGCGC ATCGCGCCAT GCTCGAACAA CAGGCGTGGA TAGGACTCAT GGATCAGGCG CGCGCCGAGC AGGGCAGCGA AGGATTACGC ACCTGGTGGA AAAACCAAAG CCGTAAAACC CGCCATCAGG TGGCGCTGCA GGTCGCGATG GCCGAGCATC TGATCGAATG TGACGATCAT GATATGGCGC AGCAGATTAT CATCGATGGT CTGAAACGCC AGTATGACGA TCGGCTGGTG CTGCCCATTC CTCGCCTCAG AACCAATAAT CCGGAACAAC TGGAGAAAGT GCTGCGCCAG CAAATCAAGA CGGTAGGCGA TCGCCCGCTG TTATGGAGCA CGCTCGGCCA GTCGCTAATG AAACACGGTG AATGGCAGGA GGCGACTCTG GCTTTCCGCG CCGCGCTGAA ACAACGCCCG GACGCGTATG ATTATGCCTG GCTTGCCGAT GCGCTTGATC GACTGCATCA ACCGGAAGAG GCTGCCGCCA TGCGGCGCGA CGGCCTGATG CTGACATTAC AGAACAATCC CTCGCAGTAA
|
Protein sequence | MMLKVLLLFV LLLAGIVVGP MIAGHQGYVL IQTDNYNIET SVTGLAIILI VAMVVLFAIE WLLRRLFRTG AHTRGWFAGR KRRRARKQTE QALLKLAEGD YQQVEKLMSK NADHAEQPVV NYLLAAEAAQ QRGDEARANQ HLERAAELAG NDTIPVEITR VRLQLARNEN HAARHGVDKL LEVTPRHPEV LRLAEQAYIR TSAWSSLLDI IPSMAKAHVG DEAHRAMLEQ QAWIGLMDQA RAEQGSEGLR TWWKNQSRKT RHQVALQVAM AEHLIECDDH DMAQQIIIDG LKRQYDDRLV LPIPRLRTNN PEQLEKVLRQ QIKTVGDRPL LWSTLGQSLM KHGEWQEATL AFRAALKQRP DAYDYAWLAD ALDRLHQPEE AAAMRRDGLM LTLQNNPSQ
|
| |