Gene EcolC_3536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3536 
Symbol 
ID6065411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3860950 
End bp3862500 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content55% 
IMG OID641602953 
Productmulticopper oxidase 
Protein accessionYP_001726477 
Protein GI170021523 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.013115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGTC GTGATTTCTT AAAATATTCC GTCGCGCTGG GTGTGGCTTC GGCTTTGCCG 
CTGTGGAGCC GCGCAGTATT TGCGGCAGAA CGCCCAACGT TACCGATCCC TGATTTGCTC
ACGACCGATG CCCGTAATCG CATTCAGTTA ACTATTGGCG CAGGCCAGTC CACCTTTGGC
GGGAAAACTG CAACTACCTG GGGCTATAAC GGCAATCTGC TGGGGCCGGC GGTGAAATTA
CAGCGCGGCA AAGCGGTAAC GGTTGATATC TACAACCAAC TGACGGAAGA GACAACGTTG
CACTGGCACG GGCTGGAAGT ACCGGGTGAA GTCGACGGCG GCCCGCAGGG AATTATTCCG
CCAGGTGGCA AGCGCTCGGT GACGTTGAAC GTTGATCAAC CTGCCGCTAC CTGCTGGTTC
CATCCGCATC AGCACGGCAA AACCGGGCGA CAGGTGGCGA TGGGGCTGGC TGGGCTGGTG
GTGATTGAAG ATGACGAGAT CCTGAAATTA ATGCTGCCAA AACAGTGGGG TATCGATGAT
GTTCCGGTGA TCGTTCAGGA TAAGAAATTT AGCGCCGACG GGCAGATTGA TTATCAACTG
GATGTGATGA CCGCCGCCGT GGGCTGGTTT GGCGATACGT TGCTGACCAA CGGTGCAATC
TACCCGCAAC ACGCTGCCCC GCGTGGTTGG CTGCGCCTGC GTTTGCTCAA TGGCTGTAAT
GCCCGTTCGC TCAATTTCGC CACCAGCGAC AATCGCCCGC TGTATGTGAT TGCCAGCGAC
GGTGGTCTGC TACCTGAACC AGTGAAGGTG AGCGAACTGC CGGTGCTGAT GGGCGAGCGT
TTTGAAGTGC TGGTGGAGGT TAACGATAAC AAACCCTTTG ACCTGGTGAC GCTGCCGGTC
AGCCAGATGG GGATGGCGAT TGCGCCGTTT GATAAGCCTC ATCCGGTAAT GCGGATTCAG
CCGATTGCTA TTAGTGCCTC CGGTGCTTTG CCAGACACAT TAAGTAGCCT GCCTGCGTTA
CCTTCGCTGG AAGGGCTGAC GGTACGCAAG CTGCAACTCT CTATGGACCC GATGCTCGAT
ATGATGGGGA TGCAGATGCT AATGGAGAAA TATGGCGATC AGGCGATGGC CGGGATGGAT
CACAGCCAGA TGATGGGCCA TATGGGGCAC GGCAATATGA ATCATATGAA CCACGGCGGG
AAGTTCGATT TCCACCATGC CAACAAAATC AACGGTCAGG CGTTTGATAT GAACAAGCCG
ATGTTTGCGG CGGCGAAAGG GCAATACGAA CGTTGGGTTA TCTCTGGCGT GGGCGACATG
ATGCTGCATC CGTTCCATAT CCACGGCACG CAGTTCCGTA TCTTGTCAGA AAATGGCAAA
CCGCCAGCGG CTCATCGCGC GGGCTGGAAA GATACCGTTA AGGTAGAAGG TAATGTCAGC
GAAGTGCTGG TGAAGTTTAA TCACGATGCA CCGAAAGAAC ATGCTTATAT GGCGCACTGC
CATCTGCTGG AGCATGAAGA TACGGGGATG ATGTTAGGGT TTACGGTATA A
 
Protein sequence
MQRRDFLKYS VALGVASALP LWSRAVFAAE RPTLPIPDLL TTDARNRIQL TIGAGQSTFG 
GKTATTWGYN GNLLGPAVKL QRGKAVTVDI YNQLTEETTL HWHGLEVPGE VDGGPQGIIP
PGGKRSVTLN VDQPAATCWF HPHQHGKTGR QVAMGLAGLV VIEDDEILKL MLPKQWGIDD
VPVIVQDKKF SADGQIDYQL DVMTAAVGWF GDTLLTNGAI YPQHAAPRGW LRLRLLNGCN
ARSLNFATSD NRPLYVIASD GGLLPEPVKV SELPVLMGER FEVLVEVNDN KPFDLVTLPV
SQMGMAIAPF DKPHPVMRIQ PIAISASGAL PDTLSSLPAL PSLEGLTVRK LQLSMDPMLD
MMGMQMLMEK YGDQAMAGMD HSQMMGHMGH GNMNHMNHGG KFDFHHANKI NGQAFDMNKP
MFAAAKGQYE RWVISGVGDM MLHPFHIHGT QFRILSENGK PPAAHRAGWK DTVKVEGNVS
EVLVKFNHDA PKEHAYMAHC HLLEHEDTGM MLGFTV