Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4324 |
Symbol | hemY |
ID | 6875794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4170530 |
End bp | 4171729 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642787252 |
Product | putative protoheme IX biogenesis protein |
Protein accession | YP_002217868 |
Protein GI | 198243522 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG3071] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | [TIGR00540] hemY protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00653056 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCTAA AAGTATTATT GCTCTTTGTG TTGTTGCTTG CGGGTATCGT GGTCGGCCCG ATGATAGCCG GTCACCAGGG CTATGTGTTA ATCCAGACCG ATAACTACAA CATTGAAACC AGCGTCACCG GCCTGGCGAT CATTCTCATC GTCGCTATGG TGGTGCTATT CGCCATTGAG TGGTTACTAC GCCGTCTGTT TCGTACCGGC GCGCACACCC GCGGTTGGTT TGCCGGTCGT AAACGCCGTC GCGCCCGCAA GCAGACCGAA CAGGCGCTGC TGAAGCTGGC GGAGGGCGAC TATCAGCAGG TTGAAAAGCT GATGTCAAAA AATGCCGACC ATGCTGAACA ACCGGTGGTA AATTATCTGC TGGCGGCAGA AGCGGCGCAG CAGCGCGGCG ATGAAGCACG CGCTAATCAG CACCTTGAAC GTGCGGCAGA GCTGGCGGGG AACGACACCA TCCCGGTCGA GATCACGCGC GTGCGCTTAC AGCTGGCGCG CAATGAAAAT CACGCAGCGC GTCACGGCGT GGACAAACTG CTGGAGGTCA CGCCGCGTCA CCCGGAAGTT CTGCGTCTGG CGGAGCAGGC CTATATTCGC ACCAGCGCAT GGAGTTCTTT ACTGGATATC ATCCCATCCA TGGCGAAAGC GCACGTCGGC GATGAAGCGC ATCGCGCCAT ACTCGAACAA CAGGCGTGGA TAGGACTCAT GGATCAGGCG CGCGCCGAGC AGGGCAGCGA AGGATTACGC ACCTGGTGGA AAAACCAAAG CCGTAAAACC CGCCATCAGG TGGCGCTGCA GGTCGCGATG GCCGAGCATC TGATCGAATG TGACGATCAT GATATGGCGC AGCAGATTAT CATCGACGGT CTGAAACGCC AGTATGACGA TCGGCTGGTG CTGCCCATTC CTCGCCTCAG AACCAATAAT CCGGAACAAC TGGAGAAAGT GCTGCGTCAG CAAATCAAGA CGGTAGGCGA TCGCCCGCTG TTATGGAGCA CGCTCGGCCA GTCGCTAATG AAACACGGTG AATGGCAGGA GGCAACCCTG GCTTTCCGCG CCGCGCTAAA ACAACGCCCG GACGCGTATG ATTATGCCTG GCTTGCCGAT GCGCTTGATC GACTGCATCA ACCGGAAGAG GCCGCCGCCA TGCGGCGCGA TGGCTTGATG CTGACATTAC AGAACAATCC CCCGCAGTAA
|
Protein sequence | MMLKVLLLFV LLLAGIVVGP MIAGHQGYVL IQTDNYNIET SVTGLAIILI VAMVVLFAIE WLLRRLFRTG AHTRGWFAGR KRRRARKQTE QALLKLAEGD YQQVEKLMSK NADHAEQPVV NYLLAAEAAQ QRGDEARANQ HLERAAELAG NDTIPVEITR VRLQLARNEN HAARHGVDKL LEVTPRHPEV LRLAEQAYIR TSAWSSLLDI IPSMAKAHVG DEAHRAILEQ QAWIGLMDQA RAEQGSEGLR TWWKNQSRKT RHQVALQVAM AEHLIECDDH DMAQQIIIDG LKRQYDDRLV LPIPRLRTNN PEQLEKVLRQ QIKTVGDRPL LWSTLGQSLM KHGEWQEATL AFRAALKQRP DAYDYAWLAD ALDRLHQPEE AAAMRRDGLM LTLQNNPPQ
|
| |