Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3805 |
Symbol | |
ID | 6144110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3871574 |
End bp | 3872911 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618631 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_001745771 |
Protein GI | 170680480 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0536703 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACAA ATAACACCCT GGATCTCACG CCACATTTTG CTTTAGACGG TGATCAGCCC TTTAAAGACA GGCGCGCCAT GATGCCGTTT CGTGGGGCCA TTCCGGTTGC CAAAGAGCAA CTGGCACAGA CCTGGCAAGA GATGATCAAT CAAACGGTAT CACCACGTAA GCGACTGGTT TACCTACACA TTCCTTTTTG TGCGACACAC TGCACGTTTT GTGGTTTTTA TCAGAACCGT TTTAATGAAG ATGCGTGTGC TCATTATACC GACGCCCTGA TTCGTGAAAT AGAACTGGAA GCAGACAGCG TATTGCATCA GTCTGCCCCC ATCCACGCGG TCTATTTCGG TGGCGGTACG CCTTCCGCGC TTTCGGCACA CGATTTGGCC AGAATCATCA CCACATTACG AGAAAAGCTG CCACTGGCCC CTGATTGTGA AATCACTATC GAAGGCCGGG TACTGAATTT TGACGCTGAG CGAATCGATG CTTGCCTTGA TGCTGGAGCA AACCGCTTCT CAATTGGTAT TCAGTCGTTT AACAGCAAAA TCCGCAAGAA AATGGCCCGC ACCTCGGATG GCCCAACCGC TATTGCGTTT ATGGAAAGCC TGGTTAAACG CGACCGTGCC GCGGTGGTCT GTGACCTGCT GTTTGGTCTG CCGGGTCAGG ATGCGCAAAC CTGGGGAGAA GATCTGGCTA TTGCCCGCGA TATCGGTCTC GACGGCGTCG ATCTCTATGC GCTCAATGTC CTGCCCAATA CACCGCTGGG CAAAGCCGTG GAAAATGGGC GTACTACCGT GCCCTCTCCG GCAGAACGTC GCGATCTTTA CCTGCAAGGG TGTGATTTTA TGGACGATGC CGGTTGGCGC TGCATCAGTA ACAGCCACTG GGGCCGTACC ACACGCGAAC GCAATCTCTA TAACCTGCTG ATAAAACAAG GTGCCGATTG TCTGGCCTTT GGTTCCGGAG CCGGTGGTTC GATTAATGGT TACTCCTGGA TGAACGAACG CAATCTTCAG ACCTGGCATG AATCCGTCGC GGCAGGCAAA AAACCGCTGA TGATGATCAT GCGTAACGCC GAACGTGATG CGCAATGGCG TCATACCTTG CAGTCAGGTG TTGAAACAGC GCGTGTACCG CTGGACGAAC TTACGCCACA TGCCGAAAAA CTCGCGCCGT TACTGGCTCA ATGGCACCAA AAAGGCTTAA GCCGCGATGC GTCAACTTGC CTGCGGCTGA CTAATGAAGG TCGTTTCTGG GCAAGCAATA TTTTGCAGTC TCTTAATGAA CTAATTCAGG TACTTAATGC GCCAGCGATT GTGCGTGAAA AACCATAA
|
Protein sequence | MNTNNTLDLT PHFALDGDQP FKDRRAMMPF RGAIPVAKEQ LAQTWQEMIN QTVSPRKRLV YLHIPFCATH CTFCGFYQNR FNEDACAHYT DALIREIELE ADSVLHQSAP IHAVYFGGGT PSALSAHDLA RIITTLREKL PLAPDCEITI EGRVLNFDAE RIDACLDAGA NRFSIGIQSF NSKIRKKMAR TSDGPTAIAF MESLVKRDRA AVVCDLLFGL PGQDAQTWGE DLAIARDIGL DGVDLYALNV LPNTPLGKAV ENGRTTVPSP AERRDLYLQG CDFMDDAGWR CISNSHWGRT TRERNLYNLL IKQGADCLAF GSGAGGSING YSWMNERNLQ TWHESVAAGK KPLMMIMRNA ERDAQWRHTL QSGVETARVP LDELTPHAEK LAPLLAQWHQ KGLSRDASTC LRLTNEGRFW ASNILQSLNE LIQVLNAPAI VREKP
|
| |