Gene EcolC_3416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3416 
Symbol 
ID6067764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3731891 
End bp3733708 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content52% 
IMG OID641602828 
ProductCopA family copper resistance protein 
Protein accessionYP_001726360 
Protein GI170021406 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01480] copper-resistance protein, CopA family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTTGA AAACGTCTCG ACGAACTTTC CTGAAGGGGT TAACCCTCTC TGGCGTAGCC 
GGAAGTCTTG GCGTATGGAG TTTCAATGCG CGTTCCAGTC TGAGCCTGCC AGTTGCCGCA
TCCCTGCAGG GTACTCAGTT TGACCTGACC ATTGGTGAAA CGGCCGTCAA TATCACGGGC
AGTGAGCGTC AGGCCAAAAC AATCAATGGA GGCCTGCCGG GGCCCGTTCT TCGCTGGAAA
GAAGGTGACA CCATTACCCT GAAGGTCAAA AACCGTCTTA ATGAACAGAC GTCCATTCAC
TGGCACGGCA TTATTCTTCC GGCCAATATG GATGGTGTTC CGGGGCTGAG TTTTATGGGC
ATAGAGCCTG ATGATACCTA CGTTTACACC TTTAAGGTTA AGCAGAACGG GACTTACTGG
TACCACAGCC ATTCCGGTCT GCAGGAACAG GAGGGGGTAT ACGGTGCCAT TATCATCGAT
GCCAGGGAGC CAGAACCGTT TGCTTACGAT CGTGAGCATG TGGTCATGTT GTCTGACTGG
ACCGATGAAA ATCCTCACAG CCTGCTGAAA AAATTAAAAA AACAGTCGGA TTACTACAAT
TTCAATAAAC CAACCGTTGG CTCTTTTTTC CGCGACGTGA ATACCAGGGG GCTGTCAGCC
ACCATTGCCG ATCGGAAAAT GTGGGCTGAA ATGAAAATGA ATCCGACTGA CCTCGCGGAT
GTCAGTGGCT ACACCTACAC CTATCTCATG AACGGGCAGG CCCCGCTGAA AAACTGGACC
GGACTGTTCC GTCCCGGTGA AAAGATACGC TTACGGTTTA TCAACGGCTC GGCAATGACC
TATTTCGATA TCCGTATCCC CGGGCTGAAA ATGACGGTCG TGGCTGCAGA TGGCCAGTAT
GTAAACCCGG TTACCGTTGA CGAATTCAGG ATTGCCGTTG CCGAAACCTA TGATGTCATT
GTGGAGCCTC AGGGTGAGGC CTATACCATC TTCGCACAAT CCATGGACAG GACCGGTTAC
GCTCGAGGGA CACTGGCCAC GAGAGAGGGG TTAAGTGCTG CCGTTCCCCC CCTCGATCCC
CGTCCTCTGT TGACCATGGA AGATATGGGT ATGGGGGGAA TGGGACATGA TATGGCAGGA
ATGGACCACA GCCAGATGGG AGGCATGGAT AACAGCGGAG AGATGATGTC TATGGACGGT
GCTGACCTTC CGGATAGCGG GACATCCTCC GCGCCCATGG ATCACAGCAG CATGGCCGGT
ATGGATCATT CCCGGATGGC CGGAATGCCG GGTATGCAAA GTCATCCTGC GTCAGAAACG
GATAACCCAC TGGTTGATAT GCAGGCGATG AGCGTCTCTC CGAAATTAAA TGATCCGGGT
ATTGGTCTTC GAAATAACGG AAGAAAGGTT CTCACGTACG CGGATTTGAA AAGCCGCTTT
GAGGATCCTG ACGGACGTGA ACCTGGCCGT ACCATAGAAC TGCATTTAAC CGGCCACATG
GAAAAGTTTG CCTGGTCATT TAACGGAATC AAGTTTTCAG ATGCCGCACC GGTGCTGCTG
AAATACGGTG AGCGGCTCAG GATCACGCTG ATCAACGATA CCATGATGAC TCACCCCATT
CACCTGCATG GTATGTGGAG CGATCTGGAA GATGAAAACG GTAATTTCAT GGTTCGTAAA
CACACAATAG ATGTTCCCCC TGGTACAAAA CGCAGTTACA GAGTGACAGC AGATGCGCTT
GGCCGCTGGG CGTATCACTG CCATTTGCTC TATCACATGG AAATGGGAAT GTTTCGTGAA
GTCCGGGTGG AGGAATGA
 
Protein sequence
MLLKTSRRTF LKGLTLSGVA GSLGVWSFNA RSSLSLPVAA SLQGTQFDLT IGETAVNITG 
SERQAKTING GLPGPVLRWK EGDTITLKVK NRLNEQTSIH WHGIILPANM DGVPGLSFMG
IEPDDTYVYT FKVKQNGTYW YHSHSGLQEQ EGVYGAIIID AREPEPFAYD REHVVMLSDW
TDENPHSLLK KLKKQSDYYN FNKPTVGSFF RDVNTRGLSA TIADRKMWAE MKMNPTDLAD
VSGYTYTYLM NGQAPLKNWT GLFRPGEKIR LRFINGSAMT YFDIRIPGLK MTVVAADGQY
VNPVTVDEFR IAVAETYDVI VEPQGEAYTI FAQSMDRTGY ARGTLATREG LSAAVPPLDP
RPLLTMEDMG MGGMGHDMAG MDHSQMGGMD NSGEMMSMDG ADLPDSGTSS APMDHSSMAG
MDHSRMAGMP GMQSHPASET DNPLVDMQAM SVSPKLNDPG IGLRNNGRKV LTYADLKSRF
EDPDGREPGR TIELHLTGHM EKFAWSFNGI KFSDAAPVLL KYGERLRITL INDTMMTHPI
HLHGMWSDLE DENGNFMVRK HTIDVPPGTK RSYRVTADAL GRWAYHCHLL YHMEMGMFRE
VRVEE