Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0069 |
Symbol | araC |
ID | 6971804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 75163 |
End bp | 76092 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384149 |
Product | DNA-binding transcriptional regulator AraC |
Protein accession | YP_002268672 |
Protein GI | 209395726 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATATG GACAATTGGT TTCTTCTCTG AATGGCGGGA GTATGAAAAG TATGGCTGAA GCGCAAAATG ATCCCCTGCT GCCGGGATAC TCGTTTAACG CCCATCTGGT GGCGGGTTTA ACGCCGATTG AGGCCAATGG TTATCTCGAT TTTTTTATCG ACCGACCGCT GGGAATGAAA GGTTATATTC TCAATCTCAC CATTCGCGGT CAGGGGGTGG TGAAAAATCA GGGACGAGAA TTTGTCTGCC GACCGGGTGA TATTTTGCTG TTCCCGCCAG GAGAGATTCA TCACTACGGT CGTCATCCGG AGGCTCGCGA ATGGTATCAC CAGTGGGTTT ACTTTCGTCC GCGCGCCTAC TGGCATGAAT GGCTTAACTG GCCGTCAATA TTTGCCAATA CGGGTTTCTT TCGCCCGGAT GAAGCGCACC AGCCGCATTT CAGCGACCTG TTTGGGCAAA TCATTAACGC CGGGCAAGGG GAAGGGCGCT ATTCGGAGCT GCTGGCGATA AATCTGCTTG AGCAATTGTT ACTGCGGCGC ATGGAAGCGA TTAACGAGTC GCTCCATCCA CCGATGGATA ATCGGGTACG CGAGGCTTGT CAGTACATCA GCGATCACCT GGCAGACAGC AATTTTGATA TCGCCAGCGT CGCACAGCAT GTTTGCCTGT CGCCGTCGCG TCTGTCACAT CTTTTCCGCC AGCAGTTAGG GATTAGCGTC TTAAGCTGGC GCGAGGACCA ACGCATCAGC CAGGCGAAGC TGCTTTTGAG CACTACCCGG ATGCCTATCG CCACCGTCGG TCGCAATGTT GGTTTTGACG ATCAACTCTA TTTCTCGCGA GTATTTAAAA AATGCACCGG GGCCAGCCCG AGCGAGTTCC GTGCCGGTTG TGAAGAAAAA GTGAATGATG TAGCCGTCAA GTTGTCATAA
|
Protein sequence | MQYGQLVSSL NGGSMKSMAE AQNDPLLPGY SFNAHLVAGL TPIEANGYLD FFIDRPLGMK GYILNLTIRG QGVVKNQGRE FVCRPGDILL FPPGEIHHYG RHPEAREWYH QWVYFRPRAY WHEWLNWPSI FANTGFFRPD EAHQPHFSDL FGQIINAGQG EGRYSELLAI NLLEQLLLRR MEAINESLHP PMDNRVREAC QYISDHLADS NFDIASVAQH VCLSPSRLSH LFRQQLGISV LSWREDQRIS QAKLLLSTTR MPIATVGRNV GFDDQLYFSR VFKKCTGASP SEFRAGCEEK VNDVAVKLS
|
| |