Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3992 |
Symbol | |
ID | 6144661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4070760 |
End bp | 4071692 |
Gene Length | 933 bp |
Protein Length | 310 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618818 |
Product | ROK family protein |
Protein accession | YP_001745957 |
Protein GI | 170680800 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTATC TGGGGTTGGA TATTGGTGGG ACCAAAATCG CCGCCGTCGT CATGGATGCG CATGGCTGGG AGATTCGCCG TTACCGCTGC CCGACGCAAA AGTCGACATA TCAACAATTT GTCTCATGCG TTGTGGCGCT TATCGAGCAG ATTAGGCGGG ACGTTCAACG ACCGATGCTG ACAGGGATCG CCTTACCCGG CAGTATCTCG CCACTCACTG GCCTGATTAA AAACGCGAAT ATTCAGGTGA TTAACGGTCA TGCGTTACAG GCTGATTTGC AGCAATTGCT TGGGCAACCG GTGGTGATAG CCAATGATGG TAACTGTTTC GCGCTATCAG AAGCTTGCGA CGGTGCCGGG CAAGATTATG ACGTGGTATT TGGTATTACG CTTGGTTCGG GCTGCGGCGG TGGCATTGCC ATCAAGCAAC GACCGTTTAT AGGGGCCTGG GGAAATGCTG CCGAATGCGG TCATATCACG TTGCCAGGCT ATATGGAGCA GGAAGATGGT CCATCAGTCA GTTGCTATTG CGGCAAACAC AACTGCGTGG AGTCGTTTGT TTCCGGCAGC GGTTTTAGTG AACGCTATCA ACAGATGACT GGTAACTTGC TCACTCCTGC GGCGATTGTC ACCCTGGCAC AACGTGGTGA TGCTTGTGCC ATGCAGCAGG TGGCACGTTT TCGCCAACAG CTTGCCCGCA CGCTGGCAAC CATCGTTAAC GTTGTTGACC CTGGCGTGAT TGTCATCGGC GGCGGGCTTT CGAATGTGGA ACTGCTTATC GCCGATCTGA ACACAGAAGT CGCTCCTCTG GTTTTCACCG ACCAATTCAC CACCCCCATT GTAAAAGCAC TGCACGGCGA CAGTAGCGGA ATGCGTGGCG CTGCCTGGCT TGCTATGCGC AACGGAGAAG CCAATGAAAC GTTCACAAAT TAA
|
Protein sequence | MHYLGLDIGG TKIAAVVMDA HGWEIRRYRC PTQKSTYQQF VSCVVALIEQ IRRDVQRPML TGIALPGSIS PLTGLIKNAN IQVINGHALQ ADLQQLLGQP VVIANDGNCF ALSEACDGAG QDYDVVFGIT LGSGCGGGIA IKQRPFIGAW GNAAECGHIT LPGYMEQEDG PSVSCYCGKH NCVESFVSGS GFSERYQQMT GNLLTPAAIV TLAQRGDACA MQQVARFRQQ LARTLATIVN VVDPGVIVIG GGLSNVELLI ADLNTEVAPL VFTDQFTTPI VKALHGDSSG MRGAAWLAMR NGEANETFTN
|
| |