Gene EcE24377A_3299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3299 
Symbol 
ID5590244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3312411 
End bp3313547 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID640926936 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001464308 
Protein GI157155940 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAAT TACCGCCGCT GAGTCTCTAC ATTCACATCC CGTGGTGCGT GCAGAAATGC 
CCGTACTGCG ATTTCAACTC TCACGCGTTG AAAGGAGAAG TGCCGCACGA CGATTACGTT
CAGCATCTGC TTTGCGATCT GGACAACGAT GTGGCTTACG CTCAGGGCCG TGAAGTAAAG
ACAATCTTTA TTGGCGGTGG TACGCCGAGC CTGCTTTCCG GCCCGGCGAT GCAAACGCTG
CTGGACGGCG TGCGTGCGCG TTTGCCGCTG GCAGCGGATG CAGAAATTAC TATGGAAGCG
AACCCTGGTA CGGTAGAAAC CGATCGCTTT GTCGATTATC AGCGTGCTGG TGTGAACCGC
ATCTCTATTG GTGTGCAGAG TTTTAGCGAA GACAAGCTGA AACGACTTGG GCGTATTCAT
GGCCCGCAAG AAGCGAAACA CGCGGCAAAG CTGGCGAGCG GGCTGGGGCT GCGTAGCTTT
AACCTCGATT TGATGCATGG ACTACCGGAT CAATCACTGG AAGAGGCGCT TGGCGATCTA
CGCCAGGCCA TTGAACTGAA TCCGCCGCAT CTTTCCTGGT ATCAACTGAC CATCGAACCT
AATACGCTGT TTGGTTCACG CCCGCCGGTA CTGCCGGACG ACGACGCGCT GTGGGATATA
TTCGAACAGG GGCATCAGTT ATTAACCGCA GCGGGTTATC AGCAATATGA AACTTCCGCT
TACGCCAAAC CCAGTTATCA GTGCCAGCAC AATCTCAACT ACTGGCGCTT TGGCGACTAC
ATCGGTATTG GCTGCGGCGC GCACGGCAAA GTGACCTTCC CGGATGGGCG CATTCTGCGT
ACCACTAAAA CGCGTCATCC GCGTGGTTTT ATGCAAGGAA GGTATCTGGA AAGCCAGCGT
GATGTCGAAG CCGCAGATAA GCCGTTTGAG TTCTTTATGA ATCGCTTCCG TCTGCTGGAG
GCCGCGCCGC GCGTGGAGTT TATTGCGTAT ACCGGGCTTT GCGAAGATGT GATTCGCCCA
CAGTTAGACG AGGCGATTGC CCAGGGTTAT CTCACCGAAT GTGCGGATTA CTGGCAGATA
ACAGAACATG GGAAGCTGTT TTTAAATTCG CTGCTGGAGC TTTTTCTGGC TGAGTAA
 
Protein sequence
MVKLPPLSLY IHIPWCVQKC PYCDFNSHAL KGEVPHDDYV QHLLCDLDND VAYAQGREVK 
TIFIGGGTPS LLSGPAMQTL LDGVRARLPL AADAEITMEA NPGTVETDRF VDYQRAGVNR
ISIGVQSFSE DKLKRLGRIH GPQEAKHAAK LASGLGLRSF NLDLMHGLPD QSLEEALGDL
RQAIELNPPH LSWYQLTIEP NTLFGSRPPV LPDDDALWDI FEQGHQLLTA AGYQQYETSA
YAKPSYQCQH NLNYWRFGDY IGIGCGAHGK VTFPDGRILR TTKTRHPRGF MQGRYLESQR
DVEAADKPFE FFMNRFRLLE AAPRVEFIAY TGLCEDVIRP QLDEAIAQGY LTECADYWQI
TEHGKLFLNS LLELFLAE