Gene EcSMS35_3113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3113 
SymbolrafR 
ID6143982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3198745 
End bp3199755 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content48% 
IMG OID641617980 
ProductHTH-type transcriptional regulator RafR 
Protein accessionYP_001745130 
Protein GI170682779 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.021598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTTA AGGCGATTGC CACTGCACTC GGAGTTTCTG TCACCACTGT CAGTCGGGCT 
CTTGGAGGCT ATGCAGATGT GTCAGCTTCC ACCCGTGCAC GGGTGGAAGC TGAAGCTCGT
CGACGCGGTT ACCGTCCTAA TACTCAGGCA AGAAGACTCA AAACCGGTAA AACCGATGCT
GTCGGTCTGG TTTATCCTGG ACGTGATGTG CCATTTAACA GTGGTGTTTT TATGGATATG
GTCAGTTGCA TCAGCCGGGA ACTTGCTCAC CATGATATTG ACTTACTACT GATAGCTGAT
GATGAGCATG CCGATTGCCA CAGCTATATG CGGCTTGTTG AAAGTCGCAG GGTTGATGCT
CTCATCATTG CACACACTCT GGATGACGAT CCCCGTATCA CACACCTTCA TAAAGCAGGT
ATTCCATTTC TGGCTCTTGG TCGGGTACCA TCTGGTCTGC CCTGTGCGTG GTTTGACTTT
GATAATCATG CCGGGACTGG ACAGGCAACC CAGAAACTGA TTGCTTTGGG ACATAAACGT
ATTGCATTGT TGAGTGAAGA CACTTCACAT AGCTATGTTG TTGCAAGACG TCAGGGATGG
CTTGATGCAC TGAATGAGCA TGAACTGGAA GATACATGGC TGCGGCTGGT TTCTCCCACA
CGGCGAGCAG GATATCAGGC CGTGATGGAG TTAATGTCAC TACCGATACC ACCAACAGCT
ATTATTACTG ACAATGACCT GAGTGGAGAT GGTGCTGCCA TGGCACTGCA GTTGAGCGGT
CGTCTTTCAG GAAAAGATGC GGTATCTCTG GTTGTATATG ATGGTTTGCC ACAGGACAGT
ATTATTGAGC TGGATGTGGC TGCAGTTATT CAGTCAACAC GAAGCCTCGT TGGTCGCCAG
ATTTCCGAGA TGATATATCA GATAATTACT GATTCATCAT CAAAACCACT ACAGGTTGTC
TGGAACCCGG TATTTTATCC GGGAAAGACG ATTCATCCTC CTTGCTCCTG A
 
Protein sequence
MSLKAIATAL GVSVTTVSRA LGGYADVSAS TRARVEAEAR RRGYRPNTQA RRLKTGKTDA 
VGLVYPGRDV PFNSGVFMDM VSCISRELAH HDIDLLLIAD DEHADCHSYM RLVESRRVDA
LIIAHTLDDD PRITHLHKAG IPFLALGRVP SGLPCAWFDF DNHAGTGQAT QKLIALGHKR
IALLSEDTSH SYVVARRQGW LDALNEHELE DTWLRLVSPT RRAGYQAVME LMSLPIPPTA
IITDNDLSGD GAAMALQLSG RLSGKDAVSL VVYDGLPQDS IIELDVAAVI QSTRSLVGRQ
ISEMIYQIIT DSSSKPLQVV WNPVFYPGKT IHPPCS