Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2027 |
Symbol | |
ID | 6971539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1926247 |
End bp | 1927170 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643385943 |
Product | transcriptional regulator, LysR family |
Protein accession | YP_002270432 |
Protein GI | 209395969 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.294769 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0000000175537 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAAAAA ATAGTCTGTT TAGTCAGCGC ATCCGTTTGC GCCACCTTCA TACATTCGTA GCTGTCGCAC AACAAGGAAC TTTGGGGCGC GCGGCTGAAA CCCTTAATTT GAGTCAACCT GCGCTCTCTA AGACATTAAA TGAACTGGAG CAGCTGACGG GCGCTCGCTT GTTTGAGCGT GGTCGTCAGG GGGCGCAACT TACCTTACCC GGCGAACAAT TTTTAACGCA TGCAGTCAGA GTACTTGACG CCATCAACAC TGCCGGACAG TCGCTTCATC GTAAAGAAGG TCCTAATAAT GATGTCGTCA GGGTTGGTGC ACTACCTACT GCGGCACTGG GGATATTACC TTCGGTTATA GGTCAGTTTC ATCAGCAACA AAAAGAGACG ACCTTGCAAG TTGCGACAAT GAGTAACCCT ATGATTCTGG CGGGTTTGAA AACCGGGGAA ATCGATATCG GCATTGGTCG GATGTCAGAT CCTGAACTGA TGACCGGGCT TAATTACGAA CTGCTGTTTC TTGAATCGCT GAAGCTGGTT GTCCGCCCTA ATCATCCGCT ACTTCAGGAG AACGTAACGC TAAGCCGGGT GCTGGAATGG CCGGTCGTTG TATCACCAGA AGGCACTGCG CCACGCCAGC ATTCAGATGC ATTAGTACAG AGCCAGGGAT GTAAAATTCC TTCGGGTTGT ATCGAAACGC TGTCTGCTTC GCTATCTCGT CAACTTACGG TTGAATACGA CTACGTGTGG TTTGTCCCTT CTGGCGCGGT AAAAGACGAC CTGCGTCATG CCACGCTGGT GGCCCTGCCT GTTCCGGGAC ATGGTGCAGG CGAACCGATT GGAATACTGA CCCGCGTAGA TGCGACGTTC TCTTCTGGTT GCCAGTTGAT GATTAACGCT ATTCGAAAAT CAATGCCGTT CTGA
|
Protein sequence | MEKNSLFSQR IRLRHLHTFV AVAQQGTLGR AAETLNLSQP ALSKTLNELE QLTGARLFER GRQGAQLTLP GEQFLTHAVR VLDAINTAGQ SLHRKEGPNN DVVRVGALPT AALGILPSVI GQFHQQQKET TLQVATMSNP MILAGLKTGE IDIGIGRMSD PELMTGLNYE LLFLESLKLV VRPNHPLLQE NVTLSRVLEW PVVVSPEGTA PRQHSDALVQ SQGCKIPSGC IETLSASLSR QLTVEYDYVW FVPSGAVKDD LRHATLVALP VPGHGAGEPI GILTRVDATF SSGCQLMINA IRKSMPF
|
| |