Gene ECH74115_5649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5649 
SymbolcadC 
ID6968639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5290047 
End bp5291585 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content42% 
IMG OID643389283 
ProductDNA-binding transcriptional activator CadC 
Protein accessionYP_002273680 
Protein GI209398414 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000157695 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.551738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAAC CTGTAGTTCG CGTTGGCGAA TGGCTTGTTA CTCCGTCCAT AAACCAAATT 
AGCCGCAATG GGCGTCAACT TACCCTTGAG CCGAGATTAA TCGACCTTCT GGTTTTCTTT
GCTCAACACA GTGGCGAAGT ACTTAGCAGG GATGAACTTA TCGATAATGT CTGGAAGAGA
AGTATTGTCA CCAATCACGT TGTGACGCAG AGTATCTCAG AACTACGTAA GTCATTAAAA
GATAATGATG AAGATAGTCC TGTCTATATC GCTACTGTAC CAAAGCGCGG CTATAAATTA
ATGGTGCCGG TTATCTGGTA CAGCGAAGAA GAGGGAGAGG AAATAATGCT ATCTTCGCCT
CCCCCTATAC CAGAGGCGGT TCCTGCCACA GATTCTCCCT CCCACAGTCT TAACATTCAA
AACACCGCAA CGCCACCTGA ACAATCCCCA GTTAAAAGCA AACGATTCAC TACCTTTTGG
GTATGGTTTT TTTTCCTGTT GTCCTTAGGT ATCTGTGTAG CACTGGTAGC GTTTTCAAGT
CTTGATACAC GTCTTCCGAT GAGCAAATCG CGTATTTTGC TCAATCCACG CGATATTGAC
ATTAATATGG TAAATAAAAG TTGTAACAGC TGGAGTTCCC CGTATCAGCT CTCTTACGCG
ATAGGCGTGG GTGATTTGGT GGCGACATCA CTTAACACCT TCTCCACCTT TATGGTGCAT
GACAAAATCA ACTACAACAT TGATGAACCG AGCAGTTCCG GTAAAACATT ATCTATTGCG
TTTGTTAATC AGCGCCAATA CCGTGCTCAA CAATGCTTTA TGTCGATAAA ATTGGTAGAC
AATGCAGATG GTTCAACCAT GCTGGATAAA CGTTATGTCA TCACTAACGG TAATCAGCTG
GCGATTCAAA ATGATTTACT GGAGAGTTTA TCAAAAGCGT TAAACCAACC GTGGCCACAA
CGAATGCAGG AGACGCTCCA GCAAATTTTG CCTCATCGTG GTGCGTTATT AACTAATTTT
TATCAGGCAC ATGATTATTT ACTGCATGGC GATGATAAAT CATTGAACCG TGCCAGTGAA
TTATTAGGTG AGATTGTTCA ATCATCCCCA GAATTTACCT ACGCGAGAGC AGAAAAAGCA
TTAGTTGATA TCGTGCGCCA TTCTCAACAT CCTTTAGATG AAAAACAATT AGCAGCACTG
AACACAGAAA TAGATAACAT TGTTACACTA CCGGAATTGA ATAACCTGTC CATTATATAT
CAAATAAAAG CGGTCAGTGC TCTGGTAAAA GGTAAAACAG ATGAGTCTTA CCAGGCGATA
AATACTGGCA TTGATCTTGA AATGTCCTGG CTAAATTATG TATTGCTTGG CAAGGTTTAT
GAAATGAAGG GGATGAACCG GGAAGCGGCT GATGCATATC TCACCGCCTT TAATTTACGC
CCAGGGGCAA ACACCCTTTA CTGGATTGAA AATGGTATAT TCCAGACTTC TGTTCCTTAT
GTTGTACCTT ATCTCGACAA ATTTCTCGCT TCAGAATAA
 
Protein sequence
MQQPVVRVGE WLVTPSINQI SRNGRQLTLE PRLIDLLVFF AQHSGEVLSR DELIDNVWKR 
SIVTNHVVTQ SISELRKSLK DNDEDSPVYI ATVPKRGYKL MVPVIWYSEE EGEEIMLSSP
PPIPEAVPAT DSPSHSLNIQ NTATPPEQSP VKSKRFTTFW VWFFFLLSLG ICVALVAFSS
LDTRLPMSKS RILLNPRDID INMVNKSCNS WSSPYQLSYA IGVGDLVATS LNTFSTFMVH
DKINYNIDEP SSSGKTLSIA FVNQRQYRAQ QCFMSIKLVD NADGSTMLDK RYVITNGNQL
AIQNDLLESL SKALNQPWPQ RMQETLQQIL PHRGALLTNF YQAHDYLLHG DDKSLNRASE
LLGEIVQSSP EFTYARAEKA LVDIVRHSQH PLDEKQLAAL NTEIDNIVTL PELNNLSIIY
QIKAVSALVK GKTDESYQAI NTGIDLEMSW LNYVLLGKVY EMKGMNREAA DAYLTAFNLR
PGANTLYWIE NGIFQTSVPY VVPYLDKFLA SE