Gene EcSMS35_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0133 
SymbolcueO 
ID6144781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp147291 
End bp148841 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content54% 
IMG OID641615034 
Productmulticopper oxidase 
Protein accessionYP_001742250 
Protein GI170682517 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0240845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGTC GTGATTTCTT AAAATATTCC GTCGCGCTGG GTGTGGCTTC GGCTTTGCCG 
CTGTGGAGCC GCGCAGTATT TGCGGCAGAA CGCCCAACGT TACCGATCCC TGATTTGCTC
ACGACCGATG CCCGTAATCG CATTCAGTTA ACTATTGGCG CAGGTCAGTC CACCTTTGGC
GGGAAAACCG CAACTACCTG GGGCTATAAC GGCAATCTGC TGGGGCCGGC GGTGAAATTA
CAGCGCGGCA AAGCGGTAAC GGTTGATATC TACAACCAAC TGACGGAAGA GACGACGCTG
CACTGGCACG GGCTGGAAGT ACCGGGTGAA GTCGACGGCG GCCCGCAGGG AATTATTCCG
CCAGGTGGCA AGCGCTCGGT GACGTTGAAC GTTGATCAAC CTGCCGCTAC CTGCTGGTTC
CATCCACATC AACATGGCAA GACCGGGCGA CAGGTGGCGA TGGGGCTGGC TGGGCTGGTG
GTAATTGAAG ATGACGAGAT CCTGAAATTA ATGCTGCCAA AACAGTGGGG TATTGATGAT
GTCCCGGTGA TTGTTCAGGA TAAAAAATTT AACGCTGACG GGCAGATTGA TTATCAACTG
GATGTGATGA CCGCCGCCGT GGGCTGGTTT GGTGATACGT TGCTGACCAA CGGCGCTATC
TACCCGCAAC ACGCTGCCCC GCGTGGCTGG CTTCGCCTGC GTTTGCTCAA TGGCTGTAAT
GCCCGCTCGC TTAATTTCGC CACCAGCGAC AATCGCCCGC TGTATGTGAT TGCCAGCGAC
GGTGGTCTGC TACCTGAACC GGTGAAGGTG AACGAGCTGC CAGTGCTGAT GGGCGAGCGT
TTTGAAGTGC TGGTGGAAGT TAACGACAAC AAACCCTTTG ACCTGGTGAC GCTGCCGGTC
AGCCAGATGG GGATGGCGAT TGCGCCGTTT GACAAGCCGC ATCCGGTAAT GCGGATTCAG
CCGATTGCCA TTAGTGCCTC TGGTGCTTTG CCAGACACAT TAAGTAGCCT GCCTGCGTTA
CCTTCGCTGG AAGGGCTGAC GGTACGCAAG CTGCAACTTT CTATGGACCC AATGCTCGAT
ATGATGGGGA TGCAGATGCT GATGGAGAAA TATGGCGATC AGGCGATGGC CGGGATGGAT
CACAGCCAGA TGATGGGCCA TATGGGGCAC GGCAATATGA ATCATATGAA CCACGGCGGG
AAGTTCGATT TCCACCATGC CAATAAAATC AACGGTCAGG CGTTTGATAT GAACAAGCCG
ATGTTTGCGG CGGCGAAAGG GCAATACGAA CGTTGGGTTA TCTCTGGCGT GGGCGACATG
ATGCTGCATC CGTTCCATAT CCACGGTACG CAGTTCCGTA TCTTGTCAGA AAATGGCAAA
CCGCCAGCGA CTCATCGCGC AGGCTGGAAA GATACCGTTA AGGTAGAAGG TAATGTCAGC
GAAGTGCTGG TGAAGTTTAA TCACGATGCA CCGAAAGAAC ATGCTTATAT GGCGCACTGC
CATCTGCTGG AGCATGAAGA TACGGGGATG ATGTTAGGGT TTACGGTATA A
 
Protein sequence
MQRRDFLKYS VALGVASALP LWSRAVFAAE RPTLPIPDLL TTDARNRIQL TIGAGQSTFG 
GKTATTWGYN GNLLGPAVKL QRGKAVTVDI YNQLTEETTL HWHGLEVPGE VDGGPQGIIP
PGGKRSVTLN VDQPAATCWF HPHQHGKTGR QVAMGLAGLV VIEDDEILKL MLPKQWGIDD
VPVIVQDKKF NADGQIDYQL DVMTAAVGWF GDTLLTNGAI YPQHAAPRGW LRLRLLNGCN
ARSLNFATSD NRPLYVIASD GGLLPEPVKV NELPVLMGER FEVLVEVNDN KPFDLVTLPV
SQMGMAIAPF DKPHPVMRIQ PIAISASGAL PDTLSSLPAL PSLEGLTVRK LQLSMDPMLD
MMGMQMLMEK YGDQAMAGMD HSQMMGHMGH GNMNHMNHGG KFDFHHANKI NGQAFDMNKP
MFAAAKGQYE RWVISGVGDM MLHPFHIHGT QFRILSENGK PPATHRAGWK DTVKVEGNVS
EVLVKFNHDA PKEHAYMAHC HLLEHEDTGM MLGFTV