Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0133 |
Symbol | cueO |
ID | 6144781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 147291 |
End bp | 148841 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615034 |
Product | multicopper oxidase |
Protein accession | YP_001742250 |
Protein GI | 170682517 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2132] Putative multicopper oxidases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0240845 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACGTC GTGATTTCTT AAAATATTCC GTCGCGCTGG GTGTGGCTTC GGCTTTGCCG CTGTGGAGCC GCGCAGTATT TGCGGCAGAA CGCCCAACGT TACCGATCCC TGATTTGCTC ACGACCGATG CCCGTAATCG CATTCAGTTA ACTATTGGCG CAGGTCAGTC CACCTTTGGC GGGAAAACCG CAACTACCTG GGGCTATAAC GGCAATCTGC TGGGGCCGGC GGTGAAATTA CAGCGCGGCA AAGCGGTAAC GGTTGATATC TACAACCAAC TGACGGAAGA GACGACGCTG CACTGGCACG GGCTGGAAGT ACCGGGTGAA GTCGACGGCG GCCCGCAGGG AATTATTCCG CCAGGTGGCA AGCGCTCGGT GACGTTGAAC GTTGATCAAC CTGCCGCTAC CTGCTGGTTC CATCCACATC AACATGGCAA GACCGGGCGA CAGGTGGCGA TGGGGCTGGC TGGGCTGGTG GTAATTGAAG ATGACGAGAT CCTGAAATTA ATGCTGCCAA AACAGTGGGG TATTGATGAT GTCCCGGTGA TTGTTCAGGA TAAAAAATTT AACGCTGACG GGCAGATTGA TTATCAACTG GATGTGATGA CCGCCGCCGT GGGCTGGTTT GGTGATACGT TGCTGACCAA CGGCGCTATC TACCCGCAAC ACGCTGCCCC GCGTGGCTGG CTTCGCCTGC GTTTGCTCAA TGGCTGTAAT GCCCGCTCGC TTAATTTCGC CACCAGCGAC AATCGCCCGC TGTATGTGAT TGCCAGCGAC GGTGGTCTGC TACCTGAACC GGTGAAGGTG AACGAGCTGC CAGTGCTGAT GGGCGAGCGT TTTGAAGTGC TGGTGGAAGT TAACGACAAC AAACCCTTTG ACCTGGTGAC GCTGCCGGTC AGCCAGATGG GGATGGCGAT TGCGCCGTTT GACAAGCCGC ATCCGGTAAT GCGGATTCAG CCGATTGCCA TTAGTGCCTC TGGTGCTTTG CCAGACACAT TAAGTAGCCT GCCTGCGTTA CCTTCGCTGG AAGGGCTGAC GGTACGCAAG CTGCAACTTT CTATGGACCC AATGCTCGAT ATGATGGGGA TGCAGATGCT GATGGAGAAA TATGGCGATC AGGCGATGGC CGGGATGGAT CACAGCCAGA TGATGGGCCA TATGGGGCAC GGCAATATGA ATCATATGAA CCACGGCGGG AAGTTCGATT TCCACCATGC CAATAAAATC AACGGTCAGG CGTTTGATAT GAACAAGCCG ATGTTTGCGG CGGCGAAAGG GCAATACGAA CGTTGGGTTA TCTCTGGCGT GGGCGACATG ATGCTGCATC CGTTCCATAT CCACGGTACG CAGTTCCGTA TCTTGTCAGA AAATGGCAAA CCGCCAGCGA CTCATCGCGC AGGCTGGAAA GATACCGTTA AGGTAGAAGG TAATGTCAGC GAAGTGCTGG TGAAGTTTAA TCACGATGCA CCGAAAGAAC ATGCTTATAT GGCGCACTGC CATCTGCTGG AGCATGAAGA TACGGGGATG ATGTTAGGGT TTACGGTATA A
|
Protein sequence | MQRRDFLKYS VALGVASALP LWSRAVFAAE RPTLPIPDLL TTDARNRIQL TIGAGQSTFG GKTATTWGYN GNLLGPAVKL QRGKAVTVDI YNQLTEETTL HWHGLEVPGE VDGGPQGIIP PGGKRSVTLN VDQPAATCWF HPHQHGKTGR QVAMGLAGLV VIEDDEILKL MLPKQWGIDD VPVIVQDKKF NADGQIDYQL DVMTAAVGWF GDTLLTNGAI YPQHAAPRGW LRLRLLNGCN ARSLNFATSD NRPLYVIASD GGLLPEPVKV NELPVLMGER FEVLVEVNDN KPFDLVTLPV SQMGMAIAPF DKPHPVMRIQ PIAISASGAL PDTLSSLPAL PSLEGLTVRK LQLSMDPMLD MMGMQMLMEK YGDQAMAGMD HSQMMGHMGH GNMNHMNHGG KFDFHHANKI NGQAFDMNKP MFAAAKGQYE RWVISGVGDM MLHPFHIHGT QFRILSENGK PPATHRAGWK DTVKVEGNVS EVLVKFNHDA PKEHAYMAHC HLLEHEDTGM MLGFTV
|
| |