Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4320 |
Symbol | |
ID | 6966968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3997701 |
End bp | 3998600 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643388049 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002272487 |
Protein GI | 209396822 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGTG AAGAGATTTG CCGCTTGCTG GCGGATAAAG TTAATAAACT GAAAAATAAA GAAAATAGTT TGTCAGAACT GTTGCCCGAT GTGCGTTTGT TGTATGGCGA GACGCCTTTC GCACGTACAC CGGTGATGTA CGAGCCTGGC ATCATAATTC TCTTTTCCGG ACATAAAATC GGTTATATCA ATGAACGCGT GTTTCGTTAT GATGCCAATG AATACCTGCT GCTGACGGTG CCGTTGCCGT TTGAGTGCGA AACCTATGCC ACGTCAGAGG TGCCGCTGGC AGGGTTGCGT CTCAATGTCG ATATTTTGCA GTTACAGGAA CTGTTGATGG ATATTGGCGA AGATGAGCAT TTCCAGCCGT CGATGGCAGC CAGCGGGATT AACTCCGCCA CGTTATCAGA AGAGATTTTA TGCGCGGCGG AGCGGTTACT CGACGTGATG GCGCGACCAC TGGATGCGCG TATTCTCGGC AAACAGATCA TCCGCGAAAT TCTGTACTAC GTGCTGACCG GACCTTGCGG CGGCGCGTTA CTGGCGCTGG TCAGTCGCCA GACTCACTTC AGTCTGATTA GCCGCGTGCT GAAACGGATT GAGAATAAAT ACACCGAAAA CCTGAGCGTC GAGCAACTGG CGGCAGAAGC TAATATGAGC GTATCGGCGT TCCACCATAA TTTTAAGTCT GTCACCAGCA CCTCGCCGTT GCAGTATTTG AAGAATTACC GTCTGCATAA GGCGCGGATG ATGATCATCC ATGACGGCAT GAAGGCCAGC GCAGCAGCGA TGCGCGTCGG CTATGAAAGC GCATCGCAAT TTAGCCGTGA GTTTAAACGT TACTTCGGTG TGACGCCGGG GGAAGATGCG GCAAGAATGC GGGCGATGCA GGGGAATTAA
|
Protein sequence | MKREEICRLL ADKVNKLKNK ENSLSELLPD VRLLYGETPF ARTPVMYEPG IIILFSGHKI GYINERVFRY DANEYLLLTV PLPFECETYA TSEVPLAGLR LNVDILQLQE LLMDIGEDEH FQPSMAASGI NSATLSEEIL CAAERLLDVM ARPLDARILG KQIIREILYY VLTGPCGGAL LALVSRQTHF SLISRVLKRI ENKYTENLSV EQLAAEANMS VSAFHHNFKS VTSTSPLQYL KNYRLHKARM MIIHDGMKAS AAAMRVGYES ASQFSREFKR YFGVTPGEDA ARMRAMQGN
|
| |