Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_4176 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4525404 |
End bp | 4526585 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | protein of unknown function DUF513 hemX |
Protein accession | ACX41776 |
Protein GI | 260451354 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.743979 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAC AAGAAAAAAC CTCCGCCGTG GTTGAAGAGA CCAGGGAGGC CGTGGACACC ACGTCACAAC CTGTCGCAAC AGAAAAAAAG AGTAAGAACA ATACCGCATT GATTCTCAGC GCGGTGGCTA TCGCTATTGC TCTGGCGGCG GGCATCGGTT TGTATGGCTG GGGTAAACAA CAGGCCGTCA ATCAGACCGC CACCAGCGAT GCCCTGGCTA ACCAACTGAC GGCATTGCAA AAAGCCCAGG AGAGCCAAAA AGCCGAGCTG GAAGGCATTA TTAAGCAACA AGCTGCACAA CTTAAGCAGG CGAATCGTCA GCAAGAAACG CTGGCAAAAC AGTTGGATGA AGTCCAACAA AAGGTCGCCA CCATTTCCGG CAGCGATGCT AAAACCTGGC TGCTGGCTCA GGCCGATTTT CTGGTGAAAC TCGCCGGACG GAAGCTGTGG AGCGATCAGG ACGTCACGAC CGCTGCAGCG TTGCTGAAAA GTGCAGACGC CAGCCTGGCG GATATGAATG ACCCGAGTCT GATTACCGTT CGTCGGGCAA TTACCGATGA TATCGCCAGC CTTTCTGCAG TATCGCAGGT GGATTATGAC GGCATCATCC TTAAGCTTAA TCAGCTTTCA AATCAGGTAG ATAACCTGCG TCTGGCCGAT AATGACAGCG ATGGTTCGCC GATGGATTCA GACGGTGAAG AGCTTTCCAG TTCCATCAGC GAATGGCGTA TCAATCTGCA AAAAAGCTGG CAGAACTTTA TGGACAACTT CATTACGATT CGCCGTCGTG ATGACACCGC CGTACCGCTG TTAGCGCCAA ATCAGGATAT CTATCTGCGC GAAAATATTC GCTCTCGCCT GCTGGTCGCA GCACAAGCTG TACCGCGTCA CCAGGAAGAG ACTTATCGCC AGGCGCTGGA GAACGTCTCC ACCTGGGTAC GTGCTTACTA CGATACTGAT GATGCCACCA CCAAAGCGTT CCTCGACGAG GTGGACCAGT TAAGCCAGCA AAATATCTCG ATGGATCTTC CGGAAACCCT GCAAAGCCAG GCGATGCTGG AAAAACTGAT GCAGACTCGC GTGCGTAACC TGCTGGCACA ACCGGCAGCG GGGACAACGG AAGCTAAACC TGCACCTGCA CCGCAAGCTG ATACTCCGGC AGCCGCGCCG CAAGGAGAAT AA
|
Protein sequence | MTEQEKTSAV VEETREAVDT TSQPVATEKK SKNNTALILS AVAIAIALAA GIGLYGWGKQ QAVNQTATSD ALANQLTALQ KAQESQKAEL EGIIKQQAAQ LKQANRQQET LAKQLDEVQQ KVATISGSDA KTWLLAQADF LVKLAGRKLW SDQDVTTAAA LLKSADASLA DMNDPSLITV RRAITDDIAS LSAVSQVDYD GIILKLNQLS NQVDNLRLAD NDSDGSPMDS DGEELSSSIS EWRINLQKSW QNFMDNFITI RRRDDTAVPL LAPNQDIYLR ENIRSRLLVA AQAVPRHQEE TYRQALENVS TWVRAYYDTD DATTKAFLDE VDQLSQQNIS MDLPETLQSQ AMLEKLMQTR VRNLLAQPAA GTTEAKPAPA PQADTPAAAP QGE
|
| |