Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4317 |
Symbol | hemY |
ID | 5585971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4306958 |
End bp | 4308154 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640927934 |
Product | putative protoheme IX biogenesis protein |
Protein accession | YP_001465283 |
Protein GI | 157157643 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG3071] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | [TIGR00540] hemY protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00516083 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAAG TGTTATTACT CTTTGTTTTG CTGATTGCGG GGATAGTGGT TGGCCCGATG ATTGCCGGTC ATCAGGGTTA CGTGCTGATC CAGACTGACA ACTACAATAT CGAAACCAGC GTCACGGGCC TGGCGATTAT TTTGATTCTG GCGATGGTAG TGCTGTTTGC CATTGAGTGG CTACTGCGGC GGATCTTCCG CACAGGCGCA CACACCCGTG GGTGGTTTGT CGGACGTAAG CGTCGCCGTG CCCGTAAGCA GACCGAACAG GCGCTGCTGA AACTGGCGGA AGGCGATTAT CAGCAAGTTG AAAAGCTGAT GGCGAAAAAT GCCGATCACG CGGAACAACC GGTGGTGAAC TATCTACTGG CTGCCGAAGC CGCGCAACAA CGTGGTGATG AAGCACGCGC CAACCAACAT CTGGAACGCG CAGCAGAGCT GGCCGGCAAT GACACCATTC CGGTAGAAAT CACCCGTGTA CGTCTGCAAC TGGCCCGTAA TGAAAACCAT GCTGCACGCC ACGGCGTGGA TAAGCTGCTG GAAGTTACGC CACGCCATCC GGAAGTATTA CGTCTGGCGG AACAGGCGTA TATCCGCACA GGTGCATGGA GTTCTCTGCT GGATATCATC CCATCAATGG CGAAAGCCCA TGTTGGTGAT GAAGAACATC GTGCAATGCT GGAACAACAG GCATGGATTG GCCTGATGGA TCAGGCGCGT GCCGATAACG GTAGTGAAGG TTTGCGTAAC TGGTGGAAAA ACCAAAGCCG GAAAACGCGT CATCAGGTAG CGTTGCAGGT GGCAATGGCG GAACATCTTA TTGAATGTGA CGATCATGAT ACTGCCCAGC AAATTATCAT CGATGGCCTG AAACGCCAGT ACGACGATCG CCTACTGCTG CCGATTCCTC GACTGAAAAC CAACAATCCG GAACAGCTGG AAAAAGTGCT GCGCCAGCAG ATCAAAAACG TCGGCGATCG CCCGCTGTTG TGGAGCACAC TGGGCCAGTC ACTGATGAAG CACGGAGAAT GGCAGGAAGC ATCGCTCGCC TTCCGCGCGG CGCTGAAACA ACGTCCGGAC GCCTACGATT ACGCATGGCT TGCCGACGCG CTGGACAGAC TGCACAAGCC GGAAGAAGCT GCAGCGATGC GTCGCGACGG TTTGATGTTA ACGTTGCAGA ACAACCCACC ACAGTAG
|
Protein sequence | MLKVLLLFVL LIAGIVVGPM IAGHQGYVLI QTDNYNIETS VTGLAIILIL AMVVLFAIEW LLRRIFRTGA HTRGWFVGRK RRRARKQTEQ ALLKLAEGDY QQVEKLMAKN ADHAEQPVVN YLLAAEAAQQ RGDEARANQH LERAAELAGN DTIPVEITRV RLQLARNENH AARHGVDKLL EVTPRHPEVL RLAEQAYIRT GAWSSLLDII PSMAKAHVGD EEHRAMLEQQ AWIGLMDQAR ADNGSEGLRN WWKNQSRKTR HQVALQVAMA EHLIECDDHD TAQQIIIDGL KRQYDDRLLL PIPRLKTNNP EQLEKVLRQQ IKNVGDRPLL WSTLGQSLMK HGEWQEASLA FRAALKQRPD AYDYAWLADA LDRLHKPEEA AAMRRDGLML TLQNNPPQ
|
| |