Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3595 |
Symbol | |
ID | 6970845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3309981 |
End bp | 3310976 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643387392 |
Product | sugar binding transcriptional regulator, LacI family |
Protein accession | YP_002271851 |
Protein GI | 209396088 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00083156 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTTCAT TAAAGGATGT CGCACGCCTG GCGGGAGTGT CGATGATGAC AGTCTCCCGG GTGATGCATA ATGCAGAATC TGTGCGTCCT GCAACGCGTG ACCGCGTATT GCAGGCAATC CAGACCCTGA ATTATGTTCC TGATCTTTCC GCCCGTAAGA TGCGCGCTCA AGGACGTAAG CCGTCGACTC TCGCCGTGCT GGCGCAGGAC ACTGCTACCA CTCCTTTCTC TGTTGATATT CTGCTTGCCA TTGAGCAAAC CGCCAGCGAG TTCGGCTGGA ATAGTTTTTT AATCAATATT TTTTCTGAAG ATGACGCTGC CCGCGCGGCA CGTCAGCTGC TTGCCCACCG TCCGGATGGC ATTATCTATA CTACAATGGG GCTGCGACAT ATCACGCTAC CTGAGTCTCT GTATGGTGAA AATATTGTAT TGGCGAACTG TGTGGCGGAT GACCCAGCGT TACCCAGTTA TATCCCTGAT GATTACACTG CACAATATGA ATCAACACAG CATTTGCTCG CGGCGGGCTA TCGTCAACCG TTATGCTTCT GGCTACCGGA AAGTGCGTTG GCAACAGGGT ATCGTCGGCA GGGATTTGAG CAGGCCTGGC GTGATGCTGG ACGAGATCTG GCTGAGGTGA AACAATTTCA CATGGCAACA GGTGATGATC ACTACACCGA TCTCGCAAGT TTACTCAATG ACCACTTCAA ATCGGGCAAA CCAGATTTTG ATGTTCTGAT ATGTGGTAAC GATCGCGCAG CCTTTGTGGC TTATCAGGTT CTCCTGGCTA AGGGGGTACG TATCCCGCAG GATGTCGCCG TAATGGGCTT TGATAATCTG GTTGGCGTCG GGCATCTGTT TTTACCGCCG CTGACCACAA TTCAGCTTCC ACATGACATT ATCGGGCGGG AAGCTGCATT GCATATTATT GAAGGTCGTG AAGGGGGAAG TGTGACGCGG ATCCCTTGCC CGCTGTTGAT CCGTTGTTCC ACCTGA
|
Protein sequence | MASLKDVARL AGVSMMTVSR VMHNAESVRP ATRDRVLQAI QTLNYVPDLS ARKMRAQGRK PSTLAVLAQD TATTPFSVDI LLAIEQTASE FGWNSFLINI FSEDDAARAA RQLLAHRPDG IIYTTMGLRH ITLPESLYGE NIVLANCVAD DPALPSYIPD DYTAQYESTQ HLLAAGYRQP LCFWLPESAL ATGYRRQGFE QAWRDAGRDL AEVKQFHMAT GDDHYTDLAS LLNDHFKSGK PDFDVLICGN DRAAFVAYQV LLAKGVRIPQ DVAVMGFDNL VGVGHLFLPP LTTIQLPHDI IGREAALHII EGREGGSVTR IPCPLLIRCS T
|
| |