Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4374 |
Symbol | cadC |
ID | 5590982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4384419 |
End bp | 4385957 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640923472 |
Product | DNA-binding transcriptional activator CadC |
Protein accession | YP_001460917 |
Protein GI | 157163599 |
COG category | [K] Transcription |
COG ID | [COG3710] DNA-binding winged-HTH domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.00319503 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAC CTGTAGTTCG CGTTGGCGAA TGGCTTGTTA CTCCGTCCAT AAACCAAATT AGCCGCAATG GGCGTCAACT TACCCTTGAG CCGAGATTAA TCGATCTTCT GGTTTTCTTT GCTCAACACA GTGGCGAAGT ACTTAGCAGG GATGAACTTA TCGATAATGT CTGGAAGAGA AGTATTGTCA CCAATCATGT TGTGACGCAG AGTATCTCAG AACTACGTAA GTCATTAAAA GATAATGATG AAGATAGTCC TGTCTATATC GCTACTGTAC CAAAGCGCGG CTATAAATTA ATGGTGCCGG TTATCTGGTA CAGCGAAGAA GAGGGAGAGG AAATAATGCT ATCTTCGCCT CCCCCTATAC CAGAGGCGGT TCCTGCCACA GATTCTCCCT CCCACAGTCT TAACATTCAA AACACCGCAA CGCCACCTGA ACAATCCCCA ATTAAAAGCA AACGATTCAC TACCTTTTGG GTATGGTTTT TTTTCCTGTT GTCGTTAGGT ATCTGTGTAG CACTGGTAGC GTTTTCAACT CTTGATACAC GTCTTCCTAT GAGCAAATCG CGTATTTTGC TCAATCCACG CGATATTGAC ATTAATATGG TAAATAAAAG TTGTAACAGC TGGAGTTCCC CGTATCAGCT CTCTTACGCG ATAGGCGTGG GTAATTTGGT GGCGACATCA CTTAACACCT TCTCCACCTT TATGGTGCAT GACAAAATCA ACTACAACAT TGATGAACCG AGCAGTTCCG GTAAAACATT ATCTATTGCC TTTGTTAATC AGCGCCAATA CCGTGCTCAA CAATGCTTTA TGTCGATAAA ATTGGTAGAC AATGCAGATG GTTCAACCAT GCTGGATAAA CGTTATGTCA TCACTAACGG TAATCAGCTG GCGATTCAAA ATGATTTACT GGAGAGTTTA TCAAAAGCGT TAAACCAACC GTGGCCACAA CGAATGCAGG AGACGCTCCA GCAAATATTG CCTCATCGTG GTGCGTTATT AACTAATTTT TATCAGGCAC ATGATTATTT ACTGCATGGC GATGATAAAT CATTGAATCG TGCCAGTGAA TTATTAGGTG AGATTGTTCA ATCATCCCCA GAATTTACCT ACGCGAGAGC AGAAAAAGCA TTAGTTGATA TCGTGCGCCA TTCTCAACAT CCTTTAGATG AAAAACAATT AGCAGCACTG AACACAGAAA TAGACAACAT TGTTACACTG CCGGAATTGA ATAATCTGTC CATTATATAT CAAATAAAAG CGGTCAGTGC TCTGGTAAAA GGTAAAACAG ATGAGTCTTA CCAGGCGATA AATACTGGCA TTGATCTTGA AATGTCCTGG CTAAATTATG TGTTGCTTGG CAAGGTTTAT GAAATGAAGG GGATGAACCG GGAAGCGGCT GATGCATATC TCACCGCCTT TAATTTACGC CCAGGGGCAA ACACCCTTTA CTGGATTGAA AATGGTATAT TCCAGACTTC TGTTCCTTAT GTTGTACCTT ATCTCGACAA ATTTCTCGCT TCAGAATAA
|
Protein sequence | MQQPVVRVGE WLVTPSINQI SRNGRQLTLE PRLIDLLVFF AQHSGEVLSR DELIDNVWKR SIVTNHVVTQ SISELRKSLK DNDEDSPVYI ATVPKRGYKL MVPVIWYSEE EGEEIMLSSP PPIPEAVPAT DSPSHSLNIQ NTATPPEQSP IKSKRFTTFW VWFFFLLSLG ICVALVAFST LDTRLPMSKS RILLNPRDID INMVNKSCNS WSSPYQLSYA IGVGNLVATS LNTFSTFMVH DKINYNIDEP SSSGKTLSIA FVNQRQYRAQ QCFMSIKLVD NADGSTMLDK RYVITNGNQL AIQNDLLESL SKALNQPWPQ RMQETLQQIL PHRGALLTNF YQAHDYLLHG DDKSLNRASE LLGEIVQSSP EFTYARAEKA LVDIVRHSQH PLDEKQLAAL NTEIDNIVTL PELNNLSIIY QIKAVSALVK GKTDESYQAI NTGIDLEMSW LNYVLLGKVY EMKGMNREAA DAYLTAFNLR PGANTLYWIE NGIFQTSVPY VVPYLDKFLA SE
|
| |