Gene EcSMS35_3097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3097 
Symbol 
ID6144156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3184160 
End bp3185296 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content52% 
IMG OID641617965 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001745116 
Protein GI170681312 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.000345052 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTAAAT TACCGCCGCT GAGTCTCTAC ATTCACATCC CGTGGTGCGT GCAGAAATGC 
CCGTACTGCG ATTTCAACTC TCACGCGTTG AAAGGAGAAG TGCCGCACGA CGATTATGTT
CAGCATCTGC TTAACGATCT GGACAACGAT GTGGCTTACG CTCAGAGCCG TGAAGTAAAG
ACAATTTTTA TTGGCGGTGG TACGCCGAGC CTGCTTTCCG GCCCGGCGAT GCAAACGCTG
CTGGACGGCG TGCGTGCGCG TTTGCCGCTG GCAGCGGATG CAGAAATTAC TATGGAAGCG
AACCCTGGTA CGGTAGAAGC CGATCGCTTT GTCGATTATC AGCGTGCTGG TGTGAACCGC
ATCTCTATTG GTGTGCAGAG TTTTAGCGAA GAAAAGCTGA AACGACTTGG GCGCATTCAT
GGCCCGCAAG AAGCGAAACG AGCTGCGAAG CTGGCGAGCG GTTTAGGGTT ACGTAGCTTT
AACCTCGATT TGATGCATGG GCTGCCGGAT CAATCACTGG AAGAGGCGCT AGGCGATCTG
CGCCAGGCTA TTGAACTGAA TCCTCCGCAT CTTTCCTGGT ATCAACTGAC CATCGAACCT
AATACGCTGT TTGGCTCTCG CCCACCGGTG CTGCCGGACG ACGATGCGTT GTGGGATATA
TTCGAACAGG GGCATCAGTT ATTAACCGCA GCGGGCTATC AGCAGTATGA AACATCCGCT
TACGCCAAAC CAGGTTATCA GTGCCAGCAC AATCTCAACT ACTGGCGATT TGGCGACTAC
ATCGGTATTG GCTGCGGCGC GCACGGCAAA GTCACCTTCC TGGATGGTCG CATTCTGCGT
ACCACCAAAA CGCGTCATCC GCGTGGTTTT ATGCAGGGAA GATATCTGGA AAGCCAGCGT
GATGTCGATG CTGCAGATAA ACCGTTTGAG TTCTTTATGA ATCGCTTCCG TCTGCTGGAA
GCCGCGCCGC GCGTGGAGTT TAGCCAGTAT ACTGGCCTTT CTGAAGAAGT TATTCGCCCA
CAGTTAGAAG AGGCTATCGC TCAGGGTTAT CTCACAGAAT GTGCAGATTA CTGGCAGATA
ACGGAACATG GGAAGTTGTT TTTAAATTCG TTGCTGGAGC TTTTTCTGGC GGAGTAA
 
Protein sequence
MVKLPPLSLY IHIPWCVQKC PYCDFNSHAL KGEVPHDDYV QHLLNDLDND VAYAQSREVK 
TIFIGGGTPS LLSGPAMQTL LDGVRARLPL AADAEITMEA NPGTVEADRF VDYQRAGVNR
ISIGVQSFSE EKLKRLGRIH GPQEAKRAAK LASGLGLRSF NLDLMHGLPD QSLEEALGDL
RQAIELNPPH LSWYQLTIEP NTLFGSRPPV LPDDDALWDI FEQGHQLLTA AGYQQYETSA
YAKPGYQCQH NLNYWRFGDY IGIGCGAHGK VTFLDGRILR TTKTRHPRGF MQGRYLESQR
DVDAADKPFE FFMNRFRLLE AAPRVEFSQY TGLSEEVIRP QLEEAIAQGY LTECADYWQI
TEHGKLFLNS LLELFLAE