Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4602 |
Symbol | cadC |
ID | 6145699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4705363 |
End bp | 4706901 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641619418 |
Product | DNA-binding transcriptional activator CadC |
Protein accession | YP_001746530 |
Protein GI | 170681004 |
COG category | [K] Transcription |
COG ID | [COG3710] DNA-binding winged-HTH domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00143405 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.967142 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAAC CTGTAGTTCG CGTTGGCGAA TGGCTTGTTA CTCCGTCCAT AAACCAAATT AGCCGCAATG GGCGTCAACT TACCCTTGAG CCGAGATTAA TCGATCTTCT GGTTTTCTTT GCTCAACACA GTGGCGAAGT ACTTAGCAGG GATGAACTTA TCGATAATGT CTGGAAGAGA AGTATTGTCA CCAATCACGT TGTGACGCAG AGTATCTCAG AACTACGTAA GTCATTAAAA GATAATGATG AAGATAGTCC TGTCTATATC GCTACTGTAC CAAAGCGCGG CTATAAATTA ATGGTGCCGG TTATCTGGTA CAGCGAAGAA GAGGGAGAGG AAATAATGCT ATCTTCGCCT CCCCCTATAC CAGAGGCGGT TCCTGCCACA GATTCTCCCT CCCACAGTCT TAACATTCAA AACACCACAG CGCCACCTGA ACAATCCCCA GTTAAAAGCA AACGATTCAC TACCTTTTGG GTATGGTTTT TTTTCCTGTT GTCGTTAGGT ATCTGTGTCG CACTGGTAGC GTTTTCAAGT CTTGAAACAC GTCTTCCTAT GAGCAAATCG CGTATTTTGC TCAATCCACG CGATATTGAC ATTAATATGG TTAATAAGAG TTGTAACAGC TGGAGTTCCC CGTATCAGCT CTCTTACGCG ATAGGCGTGG GTGATTTGGT GGCGACATCA CTTAACACCT TCTCCACCTT TATGGTGCAT GACAAAATCA ACTACAACAT TGATGAACCG AGCAGTTCCG GTAAAACATT ATCTATTGCG TTTGTTAATC AGCGCCAATA CCGTGCTCAA CAATGCTTTA TGTCGGTAAA ATTGGTAGAC AATGCAGATG GTTCAACCAT GCTGGATAAA CGTTATGTCA TCACTAACGG TAATCAGCTG GCGATTCAAA ATGATTTACT CCAGAGTTTA TCAAAAGCGT TAAACCAACC GTGGCCACAA CGAATGCAGG AGACGCTCCA GCAAATTTTG CCGCATCGTG GTGCGTTATT AACTAATTTT TATCAGGCTC ATGATTATTT ACTGCATGGC GATGATAAAT CATTGAATCG TGCCAGTGAA TTATTAGGTG AGATTGTTCA ATCATCCCCA GAATTTACCT ACGCGAGAGC AGAAAAAGCA TTAGTTGATA TCGTGCGCCA TTCTCAACAT CCTTTAGATG AAAAACAATT AGCAGCACTG AATACAGAAA TAGATAACAT TGTTACACTG CCGGAATTGA ATAACCTGTC CATTATATAT CAAATAAAAG CGGTCAGTGC TCTGGTAAAA GGTAAAACAG ATGAGTCTTA CCAGGCGATA AATACTGGCA TTGATCTTGA AATGTCCTGG CTAAATTATG TGTTGCTTGG CAAGGTTTAT GAAATGAAGG GGATGAACCG GGAAGCAGCT GATGCATATC TCACCGCCTT TAATTTACGC CCAGGGGCAA ACACCCTTTA CTGGATTGAA AATGGTATAT TCCAGACTTC TGTTCCTTAT GTTGTACCTT ATCTCGACAA ATTTCTCGCT TCAGAATAA
|
Protein sequence | MQQPVVRVGE WLVTPSINQI SRNGRQLTLE PRLIDLLVFF AQHSGEVLSR DELIDNVWKR SIVTNHVVTQ SISELRKSLK DNDEDSPVYI ATVPKRGYKL MVPVIWYSEE EGEEIMLSSP PPIPEAVPAT DSPSHSLNIQ NTTAPPEQSP VKSKRFTTFW VWFFFLLSLG ICVALVAFSS LETRLPMSKS RILLNPRDID INMVNKSCNS WSSPYQLSYA IGVGDLVATS LNTFSTFMVH DKINYNIDEP SSSGKTLSIA FVNQRQYRAQ QCFMSVKLVD NADGSTMLDK RYVITNGNQL AIQNDLLQSL SKALNQPWPQ RMQETLQQIL PHRGALLTNF YQAHDYLLHG DDKSLNRASE LLGEIVQSSP EFTYARAEKA LVDIVRHSQH PLDEKQLAAL NTEIDNIVTL PELNNLSIIY QIKAVSALVK GKTDESYQAI NTGIDLEMSW LNYVLLGKVY EMKGMNREAA DAYLTAFNLR PGANTLYWIE NGIFQTSVPY VVPYLDKFLA SE
|
| |