Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1965 |
Symbol | |
ID | 6966570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1857793 |
End bp | 1858791 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643385891 |
Product | transcriptional regulator, LacI family |
Protein accession | YP_002270380 |
Protein GI | 209396470 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.543817 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCCTA CTATTTATGA TATTGCCAGG GTTGCAGGCG TATCAAAATC CACCGTATCA CGCGTGCTGA ATAAGCAAAC CAATATCTCC CCGGAAGCGC GCGAAAAAGT GTTACGGGCC ATTGAAGAAT TACAGTATCA ACCAAACAAG CTGGCCCGCG CGCTGACCTC TTCGGGTTTT GATGCCATTA TGGTGATTTC TACCCGTTCG ACCAAAACTA CGGCGGGTAA TCCGTTTTTC TCTGAAGTTT TGCATGCCAT CACCGCCAAA GCTGAAGAAG AAGGTTTCGA CGTGATATTG CAGACGTCGC ACACCCCGGC AGAAGACTTA CAAAAATGTG AAAGCAAAAT TAAGCAGAAA ATGATTAAAG GCATTATTAT GCTAAGTTCG CCAGCGGATG AGTCATTTTT TGCCCAACTT GATAAATACG ATATTCCTGT GGTGGTGATT GGCAAAGTTG AAGGTCAATA TGCCCATGTT TATTCTGTCG ATACCGATAA TTTTGGTGAC AGCATTGCGT TGACCGATGC GTTAATTGAA AGTGGACATC AAAATATTGC CTGTCTGCAT GCACCGCTTG ATGTCCATGT TTCAGTGGAT CGGGTAAATG GTTATAAGCA GAGTCTGGAA GATCATAATA TTGCAGTGCG TGATGAATGG ATTGTTGATG GCGGTTATAC CCATGAAACA GCACTGCAAG CAGCACGGCA ATTATTAAGT CAGTCACCGT TGCCTGAGGC AGTGTTTGCC ACTGACAGCC TGAAATTAAT GAGCATTTAT CGTGCGGCGG CAGAGAAAAA CATTGCTATT CCGCAGCAGT TAGCGGTGGT GGGTTATAGC AATGAAACGC TGTCATTTAT TTTAACGCCT GCACCGGGCG GCATCGATGT TCCGACGCAG GAGTTAGGTC GACAAAGCTG TGAGTTATTA TTCCGCTTAA TTGCCGGAAA ACCGTCACCA CAAAATATTA CCGTTGCCAC GCATATGACG CTGAAATAA
|
Protein sequence | MSPTIYDIAR VAGVSKSTVS RVLNKQTNIS PEAREKVLRA IEELQYQPNK LARALTSSGF DAIMVISTRS TKTTAGNPFF SEVLHAITAK AEEEGFDVIL QTSHTPAEDL QKCESKIKQK MIKGIIMLSS PADESFFAQL DKYDIPVVVI GKVEGQYAHV YSVDTDNFGD SIALTDALIE SGHQNIACLH APLDVHVSVD RVNGYKQSLE DHNIAVRDEW IVDGGYTHET ALQAARQLLS QSPLPEAVFA TDSLKLMSIY RAAAEKNIAI PQQLAVVGYS NETLSFILTP APGGIDVPTQ ELGRQSCELL FRLIAGKPSP QNITVATHMT LK
|
| |