Gene EcSMS35_2909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2909 
SymbolscrR 
ID6144872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2982908 
End bp2983915 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content47% 
IMG OID641617778 
Productsucrose operon repressor 
Protein accessionYP_001744933 
Protein GI170682803 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID[TIGR02417] D-fructose-responsive transcription factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00007545 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAAAAA CCAAACGCGT TACTATTAAA GACATTGCTG AACTGGCGGG AGTGTCGAAA 
GCAACCGCCA GTCTGGTTCT CAATGGTCGT GGCAAAGAGT TACGTGTTGC GCAGGAAACG
CGTGAGCGTG TGCTGAGTAT TGCCCAACAG CAGCATTATC AGCCCAGCAT TCATGCCCGC
TCATTACGGG ACAACCGCAG TCACACCATT GGTCTCGTCG TTCCTGAAAT CACTAACTAC
GGATTTGCAG TATTTTCCCA TGAACTGGAA ACATTGTGTC GGGAAGCGGG AGTCCAACTC
CTCATATCCT GTACTGATGA AAATCCTTCA CAGGAAACAA TGGTGGTGAA CAATATGATT
GCCCGCCAGG TTGATGGGTT AATCGTCGCA TCCAGTATGC TGCATGATAA TGACTATCAA
AAACTTAGCG AGCAATTACC TGTTGTGTTG TTCGATCGCC ACATGAACGG CAGCACATTG
CCATTGGTCA TTACTGACTC TGTTTCTCCC ACCGCCGCTC TGGTGGCAGA TATTGCACGG
AGTCACCCGG ATGAATTCTA TTTTCTGGGC GGACAACCCC GTTTGTCACC GACTCGCGAT
CGACTGGAAG GATTTACACA AGGCTTGCAG CAAGCCGGAG TAACGCTACA ACCTGAATGG
ATTATTCACG GTAATTATCA TCCGAGCTCA GGCTATGAAA TGTTTGCCGC GCTATGCGCT
CGTTTAGGAC GTCCCCCTAA AGCCTTATTC ACTGCCGCTT GTGGATTACT GGAAGGCGTT
CTGCGCTATA TGAGCCAATA TAATTTACTC GACAGTAAAA TCCACCTGGC GAGTTTTGAT
GATCATTACT TGTATGATTC ACTTTCGGTA AGAATCGACA CAATACAACA GGATAATCGC
CAACTTGCTT TTCACTGCTT TGAACTTATT TCGCAACTGA TTGAAGGTGA AACGCCATCC
CCATTACAGC GTTATCTTCC TGCCAGTCTA CAAAAACGCT ACCGATAA
 
Protein sequence
MRKTKRVTIK DIAELAGVSK ATASLVLNGR GKELRVAQET RERVLSIAQQ QHYQPSIHAR 
SLRDNRSHTI GLVVPEITNY GFAVFSHELE TLCREAGVQL LISCTDENPS QETMVVNNMI
ARQVDGLIVA SSMLHDNDYQ KLSEQLPVVL FDRHMNGSTL PLVITDSVSP TAALVADIAR
SHPDEFYFLG GQPRLSPTRD RLEGFTQGLQ QAGVTLQPEW IIHGNYHPSS GYEMFAALCA
RLGRPPKALF TAACGLLEGV LRYMSQYNLL DSKIHLASFD DHYLYDSLSV RIDTIQQDNR
QLAFHCFELI SQLIEGETPS PLQRYLPASL QKRYR