Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_4177 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4526588 |
End bp | 4527784 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | HemY protein |
Protein accession | ACX41777 |
Protein GI | 260451355 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.849055 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAAG TGTTATTGCT CTTTGTGTTG CTGATTGCGG GGATCGTGGT TGGCCCGATG ATTGCCGGCC ATCAGGGTTA TGTGCTGATC CAGACCGACA ACTACAATAT CGAAACCAGC GTCACGGGCC TGGCGATCAT ATTGATTCTG GCGATGGTAG TGCTGTTTGC CATTGAGTGG CTACTGCGGC GGATCTTCCG CACTGGCGCG CACACCCGTG GGTGGTTTGT CGGACGTAAG CGTCGCCGTG CACGTAAGCA GACCGAACAG GCGCTGCTGA AACTGGCGGA AGGCGATTAT CAGCAAGTTG AAAAGCTGAT GGCGAAAAAT GCCGATCACG CGGAACAACC GGTGGTGAAC TATCTACTGG CTGCCGAAGC CGCGCAACAA CGTGGTGATG AAGCACGCGC CAACCAACAT CTGGAACGCG CAGCGGAGCT GGCCGGCAAC GACACCATTC CGGTAGAAAT CACCCGCGTA CGTCTGCAAC TGGCCCGTAA TGAAAACCAT GCTGCACGCC ACGGCGTGGA TAAGCTGCTG GAAGTTACGC CACGCCATCC GGAAGTATTA CGTCTGGCGG AACAGGCGTA TATCCGCACA GGTGCATGGA GTTCGCTGCT GGATATTATC CCATCAATGG CGAAAGCCCA TGTTGGTGAT GAAGAACATC GTGCAATGCT GGAACAACAG GCATGGATTG GCCTGATGGA TCAGGCGCGT GCCGATAACG GTAGCGAAGG TTTGCGTAAC TGGTGGAAAA ACCAAAGCCG GAAAACGCGT CATCAGGTAG CGTTGCAGGT GGCAATGGCG GAACATCTTA TTGAATGTGA CGATCATGAT ACTGCCCAGC AAATTATCAT CGATGGCCTG AAACGCCAGT ACGACGATCG CCTACTGCTG CCGATTCCTC GACTGAAAAC AAACAATCCG GAACAGCTGG AAAAAGTGCT GCGCCAGCAA ATCAAAAACG TCGGCGATCG CCCGCTGTTG TGGAGCACAC TGGGCCAGTC ACTGATGAAG CACGGAGAAT GGCAGGAAGC ATCGCTCGCC TTCCGCGCAG CGCTGAAACA ACGTCCGGAC GCCTACGATT ACGCATGGCT TGCCGACGCG CTGGACAGAC TGCACAAGCC GGAAGAAGCT GCAGCTATGC GTCGCGACGG TTTGATGTTA ACGTTGCAGA ATAACCCGCC ACAGTAG
|
Protein sequence | MLKVLLLFVL LIAGIVVGPM IAGHQGYVLI QTDNYNIETS VTGLAIILIL AMVVLFAIEW LLRRIFRTGA HTRGWFVGRK RRRARKQTEQ ALLKLAEGDY QQVEKLMAKN ADHAEQPVVN YLLAAEAAQQ RGDEARANQH LERAAELAGN DTIPVEITRV RLQLARNENH AARHGVDKLL EVTPRHPEVL RLAEQAYIRT GAWSSLLDII PSMAKAHVGD EEHRAMLEQQ AWIGLMDQAR ADNGSEGLRN WWKNQSRKTR HQVALQVAMA EHLIECDDHD TAQQIIIDGL KRQYDDRLLL PIPRLKTNNP EQLEKVLRQQ IKNVGDRPLL WSTLGQSLMK HGEWQEASLA FRAALKQRPD AYDYAWLADA LDRLHKPEEA AAMRRDGLML TLQNNPPQ
|
| |