Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2909 |
Symbol | scrR |
ID | 6144872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2982908 |
End bp | 2983915 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641617778 |
Product | sucrose operon repressor |
Protein accession | YP_001744933 |
Protein GI | 170682803 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | [TIGR02417] D-fructose-responsive transcription factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00007545 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGAAAAA CCAAACGCGT TACTATTAAA GACATTGCTG AACTGGCGGG AGTGTCGAAA GCAACCGCCA GTCTGGTTCT CAATGGTCGT GGCAAAGAGT TACGTGTTGC GCAGGAAACG CGTGAGCGTG TGCTGAGTAT TGCCCAACAG CAGCATTATC AGCCCAGCAT TCATGCCCGC TCATTACGGG ACAACCGCAG TCACACCATT GGTCTCGTCG TTCCTGAAAT CACTAACTAC GGATTTGCAG TATTTTCCCA TGAACTGGAA ACATTGTGTC GGGAAGCGGG AGTCCAACTC CTCATATCCT GTACTGATGA AAATCCTTCA CAGGAAACAA TGGTGGTGAA CAATATGATT GCCCGCCAGG TTGATGGGTT AATCGTCGCA TCCAGTATGC TGCATGATAA TGACTATCAA AAACTTAGCG AGCAATTACC TGTTGTGTTG TTCGATCGCC ACATGAACGG CAGCACATTG CCATTGGTCA TTACTGACTC TGTTTCTCCC ACCGCCGCTC TGGTGGCAGA TATTGCACGG AGTCACCCGG ATGAATTCTA TTTTCTGGGC GGACAACCCC GTTTGTCACC GACTCGCGAT CGACTGGAAG GATTTACACA AGGCTTGCAG CAAGCCGGAG TAACGCTACA ACCTGAATGG ATTATTCACG GTAATTATCA TCCGAGCTCA GGCTATGAAA TGTTTGCCGC GCTATGCGCT CGTTTAGGAC GTCCCCCTAA AGCCTTATTC ACTGCCGCTT GTGGATTACT GGAAGGCGTT CTGCGCTATA TGAGCCAATA TAATTTACTC GACAGTAAAA TCCACCTGGC GAGTTTTGAT GATCATTACT TGTATGATTC ACTTTCGGTA AGAATCGACA CAATACAACA GGATAATCGC CAACTTGCTT TTCACTGCTT TGAACTTATT TCGCAACTGA TTGAAGGTGA AACGCCATCC CCATTACAGC GTTATCTTCC TGCCAGTCTA CAAAAACGCT ACCGATAA
|
Protein sequence | MRKTKRVTIK DIAELAGVSK ATASLVLNGR GKELRVAQET RERVLSIAQQ QHYQPSIHAR SLRDNRSHTI GLVVPEITNY GFAVFSHELE TLCREAGVQL LISCTDENPS QETMVVNNMI ARQVDGLIVA SSMLHDNDYQ KLSEQLPVVL FDRHMNGSTL PLVITDSVSP TAALVADIAR SHPDEFYFLG GQPRLSPTRD RLEGFTQGLQ QAGVTLQPEW IIHGNYHPSS GYEMFAALCA RLGRPPKALF TAACGLLEGV LRYMSQYNLL DSKIHLASFD DHYLYDSLSV RIDTIQQDNR QLAFHCFELI SQLIEGETPS PLQRYLPASL QKRYR
|
| |