Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3097 |
Symbol | |
ID | 6144156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3184160 |
End bp | 3185296 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617965 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_001745116 |
Protein GI | 170681312 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.000345052 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTTAAAT TACCGCCGCT GAGTCTCTAC ATTCACATCC CGTGGTGCGT GCAGAAATGC CCGTACTGCG ATTTCAACTC TCACGCGTTG AAAGGAGAAG TGCCGCACGA CGATTATGTT CAGCATCTGC TTAACGATCT GGACAACGAT GTGGCTTACG CTCAGAGCCG TGAAGTAAAG ACAATTTTTA TTGGCGGTGG TACGCCGAGC CTGCTTTCCG GCCCGGCGAT GCAAACGCTG CTGGACGGCG TGCGTGCGCG TTTGCCGCTG GCAGCGGATG CAGAAATTAC TATGGAAGCG AACCCTGGTA CGGTAGAAGC CGATCGCTTT GTCGATTATC AGCGTGCTGG TGTGAACCGC ATCTCTATTG GTGTGCAGAG TTTTAGCGAA GAAAAGCTGA AACGACTTGG GCGCATTCAT GGCCCGCAAG AAGCGAAACG AGCTGCGAAG CTGGCGAGCG GTTTAGGGTT ACGTAGCTTT AACCTCGATT TGATGCATGG GCTGCCGGAT CAATCACTGG AAGAGGCGCT AGGCGATCTG CGCCAGGCTA TTGAACTGAA TCCTCCGCAT CTTTCCTGGT ATCAACTGAC CATCGAACCT AATACGCTGT TTGGCTCTCG CCCACCGGTG CTGCCGGACG ACGATGCGTT GTGGGATATA TTCGAACAGG GGCATCAGTT ATTAACCGCA GCGGGCTATC AGCAGTATGA AACATCCGCT TACGCCAAAC CAGGTTATCA GTGCCAGCAC AATCTCAACT ACTGGCGATT TGGCGACTAC ATCGGTATTG GCTGCGGCGC GCACGGCAAA GTCACCTTCC TGGATGGTCG CATTCTGCGT ACCACCAAAA CGCGTCATCC GCGTGGTTTT ATGCAGGGAA GATATCTGGA AAGCCAGCGT GATGTCGATG CTGCAGATAA ACCGTTTGAG TTCTTTATGA ATCGCTTCCG TCTGCTGGAA GCCGCGCCGC GCGTGGAGTT TAGCCAGTAT ACTGGCCTTT CTGAAGAAGT TATTCGCCCA CAGTTAGAAG AGGCTATCGC TCAGGGTTAT CTCACAGAAT GTGCAGATTA CTGGCAGATA ACGGAACATG GGAAGTTGTT TTTAAATTCG TTGCTGGAGC TTTTTCTGGC GGAGTAA
|
Protein sequence | MVKLPPLSLY IHIPWCVQKC PYCDFNSHAL KGEVPHDDYV QHLLNDLDND VAYAQSREVK TIFIGGGTPS LLSGPAMQTL LDGVRARLPL AADAEITMEA NPGTVEADRF VDYQRAGVNR ISIGVQSFSE EKLKRLGRIH GPQEAKRAAK LASGLGLRSF NLDLMHGLPD QSLEEALGDL RQAIELNPPH LSWYQLTIEP NTLFGSRPPV LPDDDALWDI FEQGHQLLTA AGYQQYETSA YAKPGYQCQH NLNYWRFGDY IGIGCGAHGK VTFLDGRILR TTKTRHPRGF MQGRYLESQR DVDAADKPFE FFMNRFRLLE AAPRVEFSQY TGLSEEVIRP QLEEAIAQGY LTECADYWQI TEHGKLFLNS LLELFLAE
|
| |