Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4167 |
Symbol | hemY |
ID | 6145480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4266317 |
End bp | 4267513 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641618990 |
Product | putative protoheme IX biogenesis protein |
Protein accession | YP_001746118 |
Protein GI | 170682558 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG3071] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | [TIGR00540] hemY protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.130048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.307244 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAAAAG TGTTATTGCT CTTTGTTTTG CTGATTGCGG GGATCGTGGT TGGCCCGATG ATTGCCGGTC ATCAGGGGTA CGTATTGATT CAGACCGACA ACTACAATAT CGAAACCAGC GTCACTGGCC TCGCGATTAT TTTGATTCTG GCGATGGTAG TGCTGTTTGC CATTGAGTGG CTACTGCGGC GGATCTTCCG CACAGGCGCG CACACCCGTG GGTGGTTTGT CGGACGTAAG CGTCGCCGTG CCCGTAAGCA GACCGAACAG GCGCTGCTGA AACTAGCGGA AGGCGATTAT CAGCAAGTTG AAAAGCTGAT GGCGAAAAAT GCCGATCACG CTGAACAACC GGTGGTGAAC TATCTGCTGG CTGCCGAAGC CGCGCAACAA CGCGGTGATG AAGCACGCGC CAACCAACAT CTGGAACGCG CAGCGGAGCT GGCCGGCAAC GACACAATTC CGGTAGAAAT CACCCGTGTA CGTCTGCAAC TGGCCCGTAA TGAAAACCAT GCTGCACGCC ACGGCGTGGA TAAGCTGCTG GAAGTTACGC CACGCCATCC GGAAGTATTA CGTCTGGCGG AACAGGCGTA TATCCGCACT GGTGCATGGA GTTCGCTGCT GGATATCATC CCATCAATGG CGAAAGCCCA CGTTGGCGAT GAAGAACATC GTGCAATGCT GGAACAACAG GCATGGATTG GCCTGATGGA TCAGGCGCGT GCCGATAACG GTAGCGAAGG TTTGCGTAAC TGGTGGAAAA ACCAAAGCCG GAAAACGCGT CATCAGGTGG CGTTACAGGT GGCAATGGCG GAACACCTTA TTGAGTGTGA TGACCATGAC ACCGCCCAGC AAATTATCAT CGACGGCCTG AAACGCCAGT ATGACGATCG CCTGCTGCTG CCGATCCCGC GTCTGAAAAC CAATAACCCG GAACAGCTGG AAAAAGTGCT GCGCCAACAG ATCAAAAACG TGGGCGATCG CCCGTTGTTG TGGAGCACAC TGGGTCAGTC GCTGATGAAG CACGGCGAAT GGCAGGAAGC ATCGCTCGCC TTCCGCGCGG CGCTGAAACA ACGTCCGGAC GCCTACGATT ACGCATGGCT TGCCGACGCG CTGGACAGAC TGCACAAACC GGAAGAAGCC GCAGCGATGC GTCGCGATGG CTTGATGCTG ACCTTACAGA ATAACCCGTC ACAGTAG
|
Protein sequence | MLKVLLLFVL LIAGIVVGPM IAGHQGYVLI QTDNYNIETS VTGLAIILIL AMVVLFAIEW LLRRIFRTGA HTRGWFVGRK RRRARKQTEQ ALLKLAEGDY QQVEKLMAKN ADHAEQPVVN YLLAAEAAQQ RGDEARANQH LERAAELAGN DTIPVEITRV RLQLARNENH AARHGVDKLL EVTPRHPEVL RLAEQAYIRT GAWSSLLDII PSMAKAHVGD EEHRAMLEQQ AWIGLMDQAR ADNGSEGLRN WWKNQSRKTR HQVALQVAMA EHLIECDDHD TAQQIIIDGL KRQYDDRLLL PIPRLKTNNP EQLEKVLRQQ IKNVGDRPLL WSTLGQSLMK HGEWQEASLA FRAALKQRPD AYDYAWLADA LDRLHKPEEA AAMRRDGLML TLQNNPSQ
|
| |