Gene ECH74115_0130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0130 
SymbolcueO 
ID6968717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp141436 
End bp142986 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content54% 
IMG OID643384207 
Productmulticopper oxidase 
Protein accessionYP_002268730 
Protein GI209399964 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00889109 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGTC GTGATTTCTT GAAATATTCC GTCGCACTGG GTGTGGCTTC AGCCTTGCCA 
CTGTGGAGCC GTGCAGTCTT TGCGGCGGAA CGCCCAACGT TACCGATCCC TGATTTGCTC
ACGACCGATG CCCGTAATCG CATTCAGTTA ACTATTGGCG CAGGTCAGTC TACCTTTGGC
GAGAAAACTG CAACTACCTG GGGCTATAAC GGCAATCTGC TGGGGCCGGC GGTGAAATTA
CAGCGCGGCA AAGCGGTAAC GGTTGATATC TATAACCAAC TGACGGAAGA GACGACGTTG
CACTGGCACG GGCTGGAAGT ACCGGGTGAA GTCGACGGCG GCCCACAGGG AATTATTCCG
CCAGGTGGCA AGCGCTCGGT GACGTTGAAC GTTGATCAAC CTGCCGCTAC CTGCTGGTTC
CATCCACATC AACATGGCAA GACCGGGCGA CAGGTGGCGA TGGGGCTGGC TGGTCTGGTG
GTGATTGAAG ATGACGAGAT CCTGAAATTA ATGCTGCCAA AACAGTGGGG TATCGATGAT
GTTCCGGTGA TCGTTCAGGA TAAGAAATTT AGCGCCGACG GGCAGATTGA TTATCAACTG
GATGTGATGA CCGCCGCCGT GGGCTGGTTT GGCGATACGT TGCTGACCAA CGGTGCAATC
TACCCGCAAC ACGCTGCCCC GCGTGGTTGG CTGCGCCTGC GTTTGCTCAA TGGCTGTAAT
GCCCGCTCGC TCAATTTCGC CACCAGCGAC AATCGCCCGC TTTATGTGAT TGCCAGCGAC
GGTGGTCTGC TACCTGAACC GGTGAAGGTG AACGAGCTGC CGGTGCTGAT GGGCGAGCGT
TTTGAAGTGC TGGTGGAGGT TAACGACAAC AAACCCTTTG ACCTGGTGAC GCTGCCGGTC
AGCCAGATGG GGATGGCGAT TGCGCCGTTT GATAAGCCTC ATCCGGTAAT GCGGATTCAG
CCGATTGCTA TTAGTGCTTC CGGTGCTTTG CCAGACACAT TAAGTAGCCT GCCTGCGTTA
CCTTCGCTGG AAGGGCTGAC GGTACGCAAG CTGCAACTTT CTATGGACCC GATGCTCGAT
ATGATGGGGA TGCAGATGCT AATGGAGAAA TATGGCGATC AGGCGATGGC CGGAATGGAT
CACAGCCAGA TGATGGGCCA TATGGGGCAC GGCAATATGA ATCATATGAA CCACGGCGGG
AAGTTCGATT TCCACCATGC CAATAAAATC AACGGTCAGG CGTTTGATAT GAATAAGCCG
ATGTTTGCGG CGGCGAAAGG GCAGTACGAA CGTTGGGTTA TCTCTGGCGT GGGCGACATG
ATGCTGCATC CGTTCCATAT TCACGGCACG CAGTTCCGTA TCTTGTCAGA AAATGGCAAA
CCGCCAGCGG CTCATCGCGC GGGCTGGAAA GATACCGTTA AGGTAGAAGG CAATGTCAGT
GAAGTGCTGG TGAAGTTTAA TCACGACGCA CCGAAAGAAC GTGCTTATAT GGCGCACTGC
CATCTGCTGG AGCATGAAGA TACGGGGATG ATGTTAGGGT TTACGGTATA A
 
Protein sequence
MQRRDFLKYS VALGVASALP LWSRAVFAAE RPTLPIPDLL TTDARNRIQL TIGAGQSTFG 
EKTATTWGYN GNLLGPAVKL QRGKAVTVDI YNQLTEETTL HWHGLEVPGE VDGGPQGIIP
PGGKRSVTLN VDQPAATCWF HPHQHGKTGR QVAMGLAGLV VIEDDEILKL MLPKQWGIDD
VPVIVQDKKF SADGQIDYQL DVMTAAVGWF GDTLLTNGAI YPQHAAPRGW LRLRLLNGCN
ARSLNFATSD NRPLYVIASD GGLLPEPVKV NELPVLMGER FEVLVEVNDN KPFDLVTLPV
SQMGMAIAPF DKPHPVMRIQ PIAISASGAL PDTLSSLPAL PSLEGLTVRK LQLSMDPMLD
MMGMQMLMEK YGDQAMAGMD HSQMMGHMGH GNMNHMNHGG KFDFHHANKI NGQAFDMNKP
MFAAAKGQYE RWVISGVGDM MLHPFHIHGT QFRILSENGK PPAAHRAGWK DTVKVEGNVS
EVLVKFNHDA PKERAYMAHC HLLEHEDTGM MLGFTV