Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4722 |
Symbol | treR |
ID | 6147153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4821519 |
End bp | 4822466 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619538 |
Product | trehalose repressor |
Protein accession | YP_001746646 |
Protein GI | 170680573 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | [TIGR02405] trehalose operon repressor, proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.557711 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAATC GGCTGACCAT CAAAGACATC GCGCGCTTAA GCGGCGTGGG GAAATCTACG GTTTCCCGGG TGCTGAATAA CGAAAGCGGC GTGAGCCAGC GCACGCGCGA GCGTGTTGAA GCAGTGATGA ATCAGCATGG ATTTTCCCCT TCCCGCTCTG CGCGCGCTAT GCGTGGGCAA AGCGATAAAG TGGTCGCCAT CATTGTTACC CGTCTGGATT CGTTGTCAGA AAATCTCGCC GTTCAAACCA TGCTGCCAGC GTTCTATGAA CAAGGTTACG ACCCAATCAT GATGGAAAGT CAGTTTTCCC CGCAATTAGT TGCCGAACAT TTGGGGGTCC TGAAACGGCG TAATATCGAC GGCGTAGTGC TGTTCGGTTT TACCGGCATA ACAGAAGAAA TGTTAGCCCA CTGGCAGTCA TCGCTGGTTC TGCTGGCGCG TGACGCAAAA GGCTTTGCTT CAGTCTGTTA TGACGACGAA GGGGCAATCA AGATCCTGAT GCAACGGCTG TATGACCAGG GGCATCGTAA TATCAGTTAT CTCGGCGTGC CGCATAGTGA CGTGACGACC GGTAAGCGAC GTCACGAAGC CTACCTGGCG TTCTGCAAAG CGCATAAATT GCATCCCGTT GCCGCTCTGC CAGGGCTTGC TATGAAGCAA GGCTATGAGA ACGTAGCAAA AGTGATTACG CCTGAAACTA CCGCGTTACT GTGCGCAACC GACACGCTGG CACTTGGCGC AAGTAAATAC CTGCAAGAGC AACGCATCGA CACCTTGCAA CTGGCGAGCG TCGGTAATAC ACCGTTAATG AAATTCCTCC ATCCGGAGAT CGTAACCGTA GATCCCGGTT ACGCCGAAGC TGGACGCCAG GCGGCCTGCC AGTTGATCGC GCAGGTCACC GGGCGCAGCG AACCGCAACA AATCATCATC CCCGCCACCC TGTCCTGA
|
Protein sequence | MQNRLTIKDI ARLSGVGKST VSRVLNNESG VSQRTRERVE AVMNQHGFSP SRSARAMRGQ SDKVVAIIVT RLDSLSENLA VQTMLPAFYE QGYDPIMMES QFSPQLVAEH LGVLKRRNID GVVLFGFTGI TEEMLAHWQS SLVLLARDAK GFASVCYDDE GAIKILMQRL YDQGHRNISY LGVPHSDVTT GKRRHEAYLA FCKAHKLHPV AALPGLAMKQ GYENVAKVIT PETTALLCAT DTLALGASKY LQEQRIDTLQ LASVGNTPLM KFLHPEIVTV DPGYAEAGRQ AACQLIAQVT GRSEPQQIII PATLS
|
| |