Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3783 |
Symbol | |
ID | 6968087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3507481 |
End bp | 3508674 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387570 |
Product | ROK family protein |
Protein accession | YP_002272023 |
Protein GI | 209397633 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCCT GCATTAACAA TCAACAGATT CGCCACCATA ACAAATGCGT GATTCTGGAA CTGCTGTACC GGCAAAAGCG CGCCAATAAA TCAACGCTGG CCCGGCTGGC GCAAATTTCG ATTCCGGCAG TCAGTAATAT TTTGCAGGAA CTGGAAAGCG AAAAACGGGT GGTGAATATC GACGATGAAA GCCAGACGCG CGGGCATAGT AGCGGTACAT GGCTGATTGC GCCGGAAGGT GACTGGACGC TGTGTCTGAA CGTGACGCCC ACCAGTATTG AGTGTCAGGT TGCTAATGCT TGTTTAAGTC CGAAAGGTGA ATTTGAGTAT TTACAGATTG ATGCACCGAC GCCGCAGGCG CTGCTGTCCG AAATCGAAAA ATGCTGGCAT CGCCACCGTA AATTGTGGCC GGACCGCACC ATCAATCTGG CGCTGGCAAT CCACGGTCAG GTTGATCCAG TGACTGGAGT GTCGCAAACC ATGCCGCAAG CGCCGTGGAC AACGCCGGTT GAGGTGAAGT ATCTGCTGGA AGAGAAGCTG GGCATTCGGG TGATGGTCGA TAATGACTGC GTGATGCTGG CGCTGGCGGA AAAATGGCAA AATAATTCGC AGGAACGGGA TTTCTGCGTG ATCAACGTTG ATTACGGCAT TGGCTCGTCG TTCGTGATTA ACGAGCAAAT TTATCGCGGC AGTTTGTATG GTAGCGGACA GATTGGTCAC ACCATCGTTA ATCCGGATGG CGTCGTCTGC GACTGTGGAC GTTACGGCTG CCTGGAAACC GTCGCCTCGT TAAGTGCATT AAAAAAACAG GCGCGGGTAT GGCTAAAATC ACAACCGGTT AATACTCAGC TTGATCCTGA AAAACTGACT ACAGCGCAGT TAATCGCTGC CTGGCAAAGT GGAGAACCGT GGATCACCAG CTGGGTTGAT CGCTCTGCCA ATGCCATTGG TTTGAGTCTG TATAACTTCC TCAACATCCT CAATATTAAT CAGATTTGGT TGTACGGTCG CAGTTGTGCC TTTGGTGAGC ACTGGCTTAA TACTATTATT CGCCAGACAG GATTTAACCC GTTCGACCGC GACGAAGGAC CGAGCGTGAA AGCGACGCAA ATTGGCTTCG GGCAATTAAG CCGCGCACAA CAGGTGCTGG GAATTGGCTA TTTGTATGTT GAGACGCAGT TACGACAGAT TTGA
|
Protein sequence | MRACINNQQI RHHNKCVILE LLYRQKRANK STLARLAQIS IPAVSNILQE LESEKRVVNI DDESQTRGHS SGTWLIAPEG DWTLCLNVTP TSIECQVANA CLSPKGEFEY LQIDAPTPQA LLSEIEKCWH RHRKLWPDRT INLALAIHGQ VDPVTGVSQT MPQAPWTTPV EVKYLLEEKL GIRVMVDNDC VMLALAEKWQ NNSQERDFCV INVDYGIGSS FVINEQIYRG SLYGSGQIGH TIVNPDGVVC DCGRYGCLET VASLSALKKQ ARVWLKSQPV NTQLDPEKLT TAQLIAAWQS GEPWITSWVD RSANAIGLSL YNFLNILNIN QIWLYGRSCA FGEHWLNTII RQTGFNPFDR DEGPSVKATQ IGFGQLSRAQ QVLGIGYLYV ETQLRQI
|
| |