Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_03677 |
Symbol | hemY |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | - |
Start bp | 3876932 |
End bp | 3878128 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | predicted protoheme IX synthesis protein |
Protein accession | ACT45470 |
Protein GI | 253979800 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.28898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAAG TGTTATTGCT CTTTGTTTTG CTGATTGCGG GGATCGTGGT TGGCCCGATG ATTGCCGGTC ATCAGGGGTA CGTATTGATT CAGACCGACA ACTACAATAT CGAAACCAGC GTCACTGGCC TGGCGATTAT TTTGATTCTG GCGATGGTAG TGCTGTTTGC CATTGAGTGG CTACTGCGGC GGATCTTCCG CACAGGCGCA CACACCCGTG GGTGGTTTGT CGGACGTAAG CGTCGCCGTG CCCGTAAGCA GACCGAACAG GCGCTGCTGA AACTAGCGGA AGGCGATTAT CAGCAAGTTG AAAAGCTGAT GGCGAAAAAT GCCGATCACG CTGAACAACC GGTGGTGAAC TATCTGCTGG CTGCCGAAGC CGCGCAACAA CGCGGTGATG AAGCACGCGC CAACCAACAT CTGGAACGCG CAGCGGAGCT GGCCGGCAAC GACACAATTC CGGTAGAAAT CACCCGTGTA CGTCTGCAAC TGGCCCGTAA TGAAAACCAT GCTGCACGCC ACGGCGTGGA TAAGCTGCTG GAAGTTACGC CACGCCATCC GGAAGTATTA CGTCTGGCGG AACAGGCGTA TATCCGCACT GGTGCATGGA GTTCGCTGCT GGATATCATC CCATCAATGG CGAAAGCCCA CGTTGGCGAT GAAGAACATC GTGCAATGCT GGAACAACAG GCATGGATTG GCCTGATGGA TCAGGCGCGT GCCGATAACG GTAGCGAAGG CTTGCGTAAC TGGTGGAAAA ACCAAAGCCG GAAAACGCGT CATCAGGTAG CGTTGCAGGT GGCAATGGCG GAACATCTTA TTGAATGTGA CGATCATGAT ACTGCCCAGC AAATTATCAT CGACGGCCTG AAACGCCAGT ATGACGATCG CCTACTGCTG CCGATCCCGC GTCTGAAAAC CAATAACCCG GAGCAGCTGG AAAAAGTGCT GCGCCAACAG ATCAAAAACG TGGGCGATCG CCCGTTGTTG TGGAGCACAC TGGGTCAGTC GCTGATGAAG CACGGCGAAT GGCAGGAAGC ATCGCTCGCC TTCCGCGCGG CGCTGAAACA ACGTCCGGAC GCCTACGATT ACGCATGGCT TGCCGACGCG CTGGACAGAC TGCACAAACC AGAAGAAGCT GCAGCGATGC GCCGCGACGG TTTGATGTTA ACGTTGCAGA ACAACCCGCC ACAGTAG
|
Protein sequence | MLKVLLLFVL LIAGIVVGPM IAGHQGYVLI QTDNYNIETS VTGLAIILIL AMVVLFAIEW LLRRIFRTGA HTRGWFVGRK RRRARKQTEQ ALLKLAEGDY QQVEKLMAKN ADHAEQPVVN YLLAAEAAQQ RGDEARANQH LERAAELAGN DTIPVEITRV RLQLARNENH AARHGVDKLL EVTPRHPEVL RLAEQAYIRT GAWSSLLDII PSMAKAHVGD EEHRAMLEQQ AWIGLMDQAR ADNGSEGLRN WWKNQSRKTR HQVALQVAMA EHLIECDDHD TAQQIIIDGL KRQYDDRLLL PIPRLKTNNP EQLEKVLRQQ IKNVGDRPLL WSTLGQSLMK HGEWQEASLA FRAALKQRPD AYDYAWLADA LDRLHKPEEA AAMRRDGLML TLQNNPPQ
|
| |