Gene EcDH1_0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0740 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp785360 
End bp786496 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionACX38424 
Protein GI260448002 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAAT TACCTCCGCT GAGTCTCTAC ATTCACATCC CGTGGTGCGT GCAGAAATGC 
CCGTACTGCG ATTTCAACTC TCACGCGTTG AAAGGAGAAG TGCCGCACGA CGATTATGTT
CAGCATCTGC TTAACGATCT GGACAACGAT GTGGCTTACG CTCAGGGCCG TGAAGTAAAG
ACAATCTTTA TTGGCGGTGG TACGCCGAGC CTGCTTTCCG GCCCGGCGAT GCAAACGCTG
CTGGACGGCG TGCGTGCGCG TTTGCCGCTG GCAGCGGATG CAGAAATTAC TATGGAAGCG
AACCCTGGCA CGGTAGAAGC CGATCGCTTT GTCGATTATC AGCGTGCTGG TGTGAACCGC
ATCTCTATTG GTGTGCAGAG TTTTAGCGAA GAAAAGCTGA AACGACTTGG GCGTATTCAT
GGCCCGCAAG AAGCGAAACG CGCGGCGAAG CTGGCGAGCG GTTTAGGGTT ACGTAGCTTT
AACCTTGATT TGATGCATGG GCTGCCGGAT CAATCACTGG AAGAGGCGCT TGGCGATCTA
CGCCAGGCCA TTGAACTGAA TCCGCCGCAT CTTTCCTGGT ATCAACTGAC CATCGAACCC
AATACGCTGT TTGGTTCGCG ACCACCGGTG CTGCCGGACG ATGACGCGTT GTGGGATATA
TTCGAACAGG GGCATCAGTT ATTAACCGCA GCGGGTTATC AGCAATATGA AACTTCCGCT
TACGCCAAAC CCGGTTATCA GTGCCAGCAC AATCTCAACT ACTGGCGCTT TGGTGACTAC
ATCGGTATTG GCTGCGGCGC ACACGGCAAA GTGACCTTCC CGGATGGGCG CATTCTGCGT
ACCACCAAAA CGCGTCATCC GCGTGGTTTT ATGCAAGGAA GGTATCTGGA AAGCCAGCGT
GATGTCGAAG CCACAGATAA GCCGTTTGAG TTCTTTATGA ATCGCTTCCG TCTGCTGGAG
GCCGCGCCGC GCGTGGAGTT TATTGCGTAT ACCGGGCTTT GCGAAGATGT GATTCGCCCA
CAGTTAGACG AGGCGATTGC CCAGGGTTAT CTCACCGAAT GTGCGGATTA CTGGCAGATA
ACGGAACATG GGAAGCTGTT TTTAAATTCG CTGCTGGAGC TTTTTCTGGC TGAGTAA
 
Protein sequence
MVKLPPLSLY IHIPWCVQKC PYCDFNSHAL KGEVPHDDYV QHLLNDLDND VAYAQGREVK 
TIFIGGGTPS LLSGPAMQTL LDGVRARLPL AADAEITMEA NPGTVEADRF VDYQRAGVNR
ISIGVQSFSE EKLKRLGRIH GPQEAKRAAK LASGLGLRSF NLDLMHGLPD QSLEEALGDL
RQAIELNPPH LSWYQLTIEP NTLFGSRPPV LPDDDALWDI FEQGHQLLTA AGYQQYETSA
YAKPGYQCQH NLNYWRFGDY IGIGCGAHGK VTFPDGRILR TTKTRHPRGF MQGRYLESQR
DVEATDKPFE FFMNRFRLLE AAPRVEFIAY TGLCEDVIRP QLDEAIAQGY LTECADYWQI
TEHGKLFLNS LLELFLAE