Gene EcSMS35_4602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4602 
SymbolcadC 
ID6145699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4705363 
End bp4706901 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content42% 
IMG OID641619418 
ProductDNA-binding transcriptional activator CadC 
Protein accessionYP_001746530 
Protein GI170681004 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00143405 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.967142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAAC CTGTAGTTCG CGTTGGCGAA TGGCTTGTTA CTCCGTCCAT AAACCAAATT 
AGCCGCAATG GGCGTCAACT TACCCTTGAG CCGAGATTAA TCGATCTTCT GGTTTTCTTT
GCTCAACACA GTGGCGAAGT ACTTAGCAGG GATGAACTTA TCGATAATGT CTGGAAGAGA
AGTATTGTCA CCAATCACGT TGTGACGCAG AGTATCTCAG AACTACGTAA GTCATTAAAA
GATAATGATG AAGATAGTCC TGTCTATATC GCTACTGTAC CAAAGCGCGG CTATAAATTA
ATGGTGCCGG TTATCTGGTA CAGCGAAGAA GAGGGAGAGG AAATAATGCT ATCTTCGCCT
CCCCCTATAC CAGAGGCGGT TCCTGCCACA GATTCTCCCT CCCACAGTCT TAACATTCAA
AACACCACAG CGCCACCTGA ACAATCCCCA GTTAAAAGCA AACGATTCAC TACCTTTTGG
GTATGGTTTT TTTTCCTGTT GTCGTTAGGT ATCTGTGTCG CACTGGTAGC GTTTTCAAGT
CTTGAAACAC GTCTTCCTAT GAGCAAATCG CGTATTTTGC TCAATCCACG CGATATTGAC
ATTAATATGG TTAATAAGAG TTGTAACAGC TGGAGTTCCC CGTATCAGCT CTCTTACGCG
ATAGGCGTGG GTGATTTGGT GGCGACATCA CTTAACACCT TCTCCACCTT TATGGTGCAT
GACAAAATCA ACTACAACAT TGATGAACCG AGCAGTTCCG GTAAAACATT ATCTATTGCG
TTTGTTAATC AGCGCCAATA CCGTGCTCAA CAATGCTTTA TGTCGGTAAA ATTGGTAGAC
AATGCAGATG GTTCAACCAT GCTGGATAAA CGTTATGTCA TCACTAACGG TAATCAGCTG
GCGATTCAAA ATGATTTACT CCAGAGTTTA TCAAAAGCGT TAAACCAACC GTGGCCACAA
CGAATGCAGG AGACGCTCCA GCAAATTTTG CCGCATCGTG GTGCGTTATT AACTAATTTT
TATCAGGCTC ATGATTATTT ACTGCATGGC GATGATAAAT CATTGAATCG TGCCAGTGAA
TTATTAGGTG AGATTGTTCA ATCATCCCCA GAATTTACCT ACGCGAGAGC AGAAAAAGCA
TTAGTTGATA TCGTGCGCCA TTCTCAACAT CCTTTAGATG AAAAACAATT AGCAGCACTG
AATACAGAAA TAGATAACAT TGTTACACTG CCGGAATTGA ATAACCTGTC CATTATATAT
CAAATAAAAG CGGTCAGTGC TCTGGTAAAA GGTAAAACAG ATGAGTCTTA CCAGGCGATA
AATACTGGCA TTGATCTTGA AATGTCCTGG CTAAATTATG TGTTGCTTGG CAAGGTTTAT
GAAATGAAGG GGATGAACCG GGAAGCAGCT GATGCATATC TCACCGCCTT TAATTTACGC
CCAGGGGCAA ACACCCTTTA CTGGATTGAA AATGGTATAT TCCAGACTTC TGTTCCTTAT
GTTGTACCTT ATCTCGACAA ATTTCTCGCT TCAGAATAA
 
Protein sequence
MQQPVVRVGE WLVTPSINQI SRNGRQLTLE PRLIDLLVFF AQHSGEVLSR DELIDNVWKR 
SIVTNHVVTQ SISELRKSLK DNDEDSPVYI ATVPKRGYKL MVPVIWYSEE EGEEIMLSSP
PPIPEAVPAT DSPSHSLNIQ NTTAPPEQSP VKSKRFTTFW VWFFFLLSLG ICVALVAFSS
LETRLPMSKS RILLNPRDID INMVNKSCNS WSSPYQLSYA IGVGDLVATS LNTFSTFMVH
DKINYNIDEP SSSGKTLSIA FVNQRQYRAQ QCFMSVKLVD NADGSTMLDK RYVITNGNQL
AIQNDLLQSL SKALNQPWPQ RMQETLQQIL PHRGALLTNF YQAHDYLLHG DDKSLNRASE
LLGEIVQSSP EFTYARAEKA LVDIVRHSQH PLDEKQLAAL NTEIDNIVTL PELNNLSIIY
QIKAVSALVK GKTDESYQAI NTGIDLEMSW LNYVLLGKVY EMKGMNREAA DAYLTAFNLR
PGANTLYWIE NGIFQTSVPY VVPYLDKFLA SE