Gene EcHS_A4374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4374 
SymbolcadC 
ID5590982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4384419 
End bp4385957 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content42% 
IMG OID640923472 
ProductDNA-binding transcriptional activator CadC 
Protein accessionYP_001460917 
Protein GI157163599 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.00319503 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAAC CTGTAGTTCG CGTTGGCGAA TGGCTTGTTA CTCCGTCCAT AAACCAAATT 
AGCCGCAATG GGCGTCAACT TACCCTTGAG CCGAGATTAA TCGATCTTCT GGTTTTCTTT
GCTCAACACA GTGGCGAAGT ACTTAGCAGG GATGAACTTA TCGATAATGT CTGGAAGAGA
AGTATTGTCA CCAATCATGT TGTGACGCAG AGTATCTCAG AACTACGTAA GTCATTAAAA
GATAATGATG AAGATAGTCC TGTCTATATC GCTACTGTAC CAAAGCGCGG CTATAAATTA
ATGGTGCCGG TTATCTGGTA CAGCGAAGAA GAGGGAGAGG AAATAATGCT ATCTTCGCCT
CCCCCTATAC CAGAGGCGGT TCCTGCCACA GATTCTCCCT CCCACAGTCT TAACATTCAA
AACACCGCAA CGCCACCTGA ACAATCCCCA ATTAAAAGCA AACGATTCAC TACCTTTTGG
GTATGGTTTT TTTTCCTGTT GTCGTTAGGT ATCTGTGTAG CACTGGTAGC GTTTTCAACT
CTTGATACAC GTCTTCCTAT GAGCAAATCG CGTATTTTGC TCAATCCACG CGATATTGAC
ATTAATATGG TAAATAAAAG TTGTAACAGC TGGAGTTCCC CGTATCAGCT CTCTTACGCG
ATAGGCGTGG GTAATTTGGT GGCGACATCA CTTAACACCT TCTCCACCTT TATGGTGCAT
GACAAAATCA ACTACAACAT TGATGAACCG AGCAGTTCCG GTAAAACATT ATCTATTGCC
TTTGTTAATC AGCGCCAATA CCGTGCTCAA CAATGCTTTA TGTCGATAAA ATTGGTAGAC
AATGCAGATG GTTCAACCAT GCTGGATAAA CGTTATGTCA TCACTAACGG TAATCAGCTG
GCGATTCAAA ATGATTTACT GGAGAGTTTA TCAAAAGCGT TAAACCAACC GTGGCCACAA
CGAATGCAGG AGACGCTCCA GCAAATATTG CCTCATCGTG GTGCGTTATT AACTAATTTT
TATCAGGCAC ATGATTATTT ACTGCATGGC GATGATAAAT CATTGAATCG TGCCAGTGAA
TTATTAGGTG AGATTGTTCA ATCATCCCCA GAATTTACCT ACGCGAGAGC AGAAAAAGCA
TTAGTTGATA TCGTGCGCCA TTCTCAACAT CCTTTAGATG AAAAACAATT AGCAGCACTG
AACACAGAAA TAGACAACAT TGTTACACTG CCGGAATTGA ATAATCTGTC CATTATATAT
CAAATAAAAG CGGTCAGTGC TCTGGTAAAA GGTAAAACAG ATGAGTCTTA CCAGGCGATA
AATACTGGCA TTGATCTTGA AATGTCCTGG CTAAATTATG TGTTGCTTGG CAAGGTTTAT
GAAATGAAGG GGATGAACCG GGAAGCGGCT GATGCATATC TCACCGCCTT TAATTTACGC
CCAGGGGCAA ACACCCTTTA CTGGATTGAA AATGGTATAT TCCAGACTTC TGTTCCTTAT
GTTGTACCTT ATCTCGACAA ATTTCTCGCT TCAGAATAA
 
Protein sequence
MQQPVVRVGE WLVTPSINQI SRNGRQLTLE PRLIDLLVFF AQHSGEVLSR DELIDNVWKR 
SIVTNHVVTQ SISELRKSLK DNDEDSPVYI ATVPKRGYKL MVPVIWYSEE EGEEIMLSSP
PPIPEAVPAT DSPSHSLNIQ NTATPPEQSP IKSKRFTTFW VWFFFLLSLG ICVALVAFST
LDTRLPMSKS RILLNPRDID INMVNKSCNS WSSPYQLSYA IGVGNLVATS LNTFSTFMVH
DKINYNIDEP SSSGKTLSIA FVNQRQYRAQ QCFMSIKLVD NADGSTMLDK RYVITNGNQL
AIQNDLLESL SKALNQPWPQ RMQETLQQIL PHRGALLTNF YQAHDYLLHG DDKSLNRASE
LLGEIVQSSP EFTYARAEKA LVDIVRHSQH PLDEKQLAAL NTEIDNIVTL PELNNLSIIY
QIKAVSALVK GKTDESYQAI NTGIDLEMSW LNYVLLGKVY EMKGMNREAA DAYLTAFNLR
PGANTLYWIE NGIFQTSVPY VVPYLDKFLA SE