Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2703 |
Symbol | |
ID | 6146442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2778197 |
End bp | 2779390 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617574 |
Product | ROK family protein |
Protein accession | YP_001744739 |
Protein GI | 170681604 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCCT GCATTAATAA TCAACAGATT CGCCACCATA ACAAATGCGT GATTCTGGAA CTGCTGTACC GGCAAAAGCG CGCCAATAAA TCAACGCTGG CCCGGCTGGC GCAAATTTCG ATTCCGGCAG TCAGTAATAT TTTGCAGGAA CTGGAAAGCG AAAAACGGGT GGTAAATATC GACGATGAAA GCCAGACGCG CGGGCATAGT AGCGGTACAT GGCTGATTGC GCCGGAAGGT GACTGGACGC TGTGCCTGAA CGTGACGCCC ACCAGTATTG AGTGTCAGGT CGCTAACGCC TGTTTAAGTC CGAAAGGTGA ATTTGAGTAT TTACAGATTG ATGCACCGAC GCCGCAGGCG CTGCTGTCCG AAATCGAAAA ATGCTGGCAT CGCCACCGTA AATTGTGGCC GGACCGTACC ATCAACCTGG CGCTGGCAAT CCACGGTCAG GTTGATCCGG TGACTGGCGT GTCGCAAACC ATGCCGCAAG CGCCGTGGAC AACGCCGGTT GAGGTAAAGT ATCTGCTGGA AGAGAAGCTC GGCATTCGGG TGATGGTCGA TAATGACTGC GTGATGCTGG CGCTGGCGGA GAAATGGCAA AATAATTCGC AGGAACGGGA TTTCTGCGTG ATCAACGTTG ATTACGGCAT TGGCTCGTCG TTCGTGATTA ACGAGCAAAT TTATCGCGGC AGTTTGTATG GTAGCGGGCA AATTGGTCAC ACCATCGTTA ATCCGGATGG CGTCGTCTGC GACTGCGGGC GTTACGGCTG CCTGGAAACC GTCGCCTCGT TAAGCGCATT AAAAAAACAG GCGCGGGTAT GGCTAAAATC ACAACCGGTT AATACTCAGC TTGATCCTGA AAAACTGACT ACCGCGCAGT TAATCGCTGC CTGGCAGAGT GGAGAACCGT GGATCACCAG TTGGGTTGAT CGCTCTGCCA ACGCCATTGG TTTGAGTCTG TATAACTTCC TCAACATCCT CAATATTAAT CAGATTTGGT TGTACGGTCG CAGTTGTGCC TTTGGTGAGA ACTGGCTTAA TACCATTATT CGCCAGACGG GATTTAACCC GTTCGACCGC GACGAAGGAC CGAGCGTGAA AGCGACGCAA ATTGGCTTTG GGCAATTAAG CCGCGCACAG CAGGTGCTGG GAATTGGCTA TTTGTATGTT GAGGCGCAGT TACGGCAGAT TTGA
|
Protein sequence | MRACINNQQI RHHNKCVILE LLYRQKRANK STLARLAQIS IPAVSNILQE LESEKRVVNI DDESQTRGHS SGTWLIAPEG DWTLCLNVTP TSIECQVANA CLSPKGEFEY LQIDAPTPQA LLSEIEKCWH RHRKLWPDRT INLALAIHGQ VDPVTGVSQT MPQAPWTTPV EVKYLLEEKL GIRVMVDNDC VMLALAEKWQ NNSQERDFCV INVDYGIGSS FVINEQIYRG SLYGSGQIGH TIVNPDGVVC DCGRYGCLET VASLSALKKQ ARVWLKSQPV NTQLDPEKLT TAQLIAAWQS GEPWITSWVD RSANAIGLSL YNFLNILNIN QIWLYGRSCA FGENWLNTII RQTGFNPFDR DEGPSVKATQ IGFGQLSRAQ QVLGIGYLYV EAQLRQI
|
| |