Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0398 |
Symbol | hemB |
ID | 6142825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 411063 |
End bp | 412037 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615294 |
Product | delta-aminolevulinic acid dehydratase |
Protein accession | YP_001742501 |
Protein GI | 170680798 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0113] Delta-aminolevulinic acid dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACT TAATCCAACG CCCTCGTCGC CTGCGCAAAT CTCCTGCGCT GCGCGCTATG TTTGAAGAGA CAACACTTAG CCTTAACGAC CTGGTGTTGC CGATCTTTGT TGAAGAAGAA ATTGACGACT ACAAAGCCGT TGAAGCCATG CCAGGCGTGA TGCGCATTCC AGAGAAACAT CTGGCACGCG AAATTGAACG TATCGCCAAC GCAGGTATTC GTTCCGTGAT GACTTTCGGC ATCTCTCACC ATACCGATGA AACCGGCAGC GATGCCTGGC GGGAAGATGG TCTGGTGGCG CGTATGTCGC GCATCTGCAA GCAGACCGTG CCAGAAATGA TCGTCATGTC AGACACCTGT TTCTGTGAAT ACACTTCTCA CGGTCACTGC GGTGTGCTGT GCGAGCATGG CGTCGACAAC GACGCGACCC TGGAAAATTT AGGCAAGCAG GCCGTGGTTG CAGCTGCTGC AGGTGCAGAC TTCATCGCCC CTTCCGCCGC AATGGATGGC CAGGTACAAG CGATTCGCCA GGCGCTGGAC GCTGCGGGCT TTAAAGATAC GGCGATTATG TCGTATTCGA CCAAGTTTGC CTCTTCCTTT TATGGCCCGT TCCGTGAAGC TGCCGGAAGC GCATTAAAAG GCGACCGCAA AAGCTATCAG ATGAACCCAA TGAACCGTCG TGAGGCGATT CATGAGTCAC TGCTGGATGA AGCCCAGGGC GCAGACTGCC TGATGGTTAA ACCTGCTGGA GCGTACCTCG ACATCGTGCG TGAGCTGCGT GAACGTACTG AATTGCCGAT TGGCGCGTAT CAGGTGAGCG GTGAGTATGC GATGATTAAG TTCGCCGCGC TGGCGGGTGC TATAGATGAA GAGAAAGTTG TGCTCGAAAG CTTAGGTTCG ATTAAGCGTG CGGGTGCGGA TCTGATTTTC AGCTACTTTG CGCTGGATTT GGCTGAGAAG AAGATTCTGC GTTAA
|
Protein sequence | MTDLIQRPRR LRKSPALRAM FEETTLSLND LVLPIFVEEE IDDYKAVEAM PGVMRIPEKH LAREIERIAN AGIRSVMTFG ISHHTDETGS DAWREDGLVA RMSRICKQTV PEMIVMSDTC FCEYTSHGHC GVLCEHGVDN DATLENLGKQ AVVAAAAGAD FIAPSAAMDG QVQAIRQALD AAGFKDTAIM SYSTKFASSF YGPFREAAGS ALKGDRKSYQ MNPMNRREAI HESLLDEAQG ADCLMVKPAG AYLDIVRELR ERTELPIGAY QVSGEYAMIK FAALAGAIDE EKVVLESLGS IKRAGADLIF SYFALDLAEK KILR
|
| |