Gene EcSMS35_3703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3703 
SymbolrtcR 
ID6144039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3765799 
End bp3767397 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content55% 
IMG OID641618529 
Productsigma-54 dependent transcriptional regulator RtcR 
Protein accessionYP_001745669 
Protein GI170680280 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG4650] Sigma54-dependent transcription regulator containing an AAA-type ATPase domain and a DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.973563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.184346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAA CAGTGGCTTT TGGTTTTGTC GGAACGGTGT TGGATTATGC CGGGCGCGGC 
AGTCAGCGCT GGTCAAAATG GCGTCCGTCA CTCTGTCTAT GCCAGCAAGA ATCGTTAGTC
ATCAATCGAC TGGAATTGTT GCACGACGCC CGCTCACGCT CGCTGTTTGA AACGCTTAAA
CGCGATATCG CCAATGTTTC GCCAGAAACA AAAGTGGTGG GCGTTGAGAT TGAACTGCAT
AACCCATGGG ATTTCGAGGA GGTTTACGCC TGCCTGCATG ATTTCGCCCG TGGTTACGCG
TTCGAACCTG ATAAAGAAGA CTATTTAATT CACATCACCA CCGGTACCCA CGTCGCGCAG
ATTTGCTGGT TTCTGCTGGC GGAAGCACGT TACCTGCCCG CCAGGTTAAT ACAATCTTCG
CCACCGCGTA AAAAAGAACA GCCGCGCGGT GCAGGTGAAG TGACGATTAT CGATCTCGAT
TTAAGCCGTT ATAACGCCAT CGCCAGCCGC TTTGCCGAGG AACGCCAGCA AACGCTCGAT
TTTCTTAAGT CCGGCATTGC CACGCGTAAC GCCCACTTCA ACCGCATGAT TGAGCAGATC
GAAAAAGTGG CGATCAAATC CCGCGCACCG ATCCTGCTCA ACGGCCCAAC CGGTGCGGGA
AAGTCGTTTC TGGCACGACG CATCTTCGAG TTAAAACAGG CGCGGCATCA GTTTAGCGGC
GCGTTTGTGG AGGTGAACTG TGCCACTCTG CGCGGCGATA CCGCCATGTC AGCGCTGTTT
GGTCATGTGA AAGGCGCGTT TACCGGGGCG CGGGAGTCGC GGGAAGGATT ATTACGCAGC
GCCAACGGCG GGATGTTATT TCTTGATGAG ATTGGCGAAC TGGGCGCAGA CGAACAGGCA
ATGCTGCTGA AAGCCATTGA AGAGAAAACC TTTTATCCGT TTGGCAGCGA TCGCCAGGTG
AGTAGCGACT TTCAGCTGAT CGCCGGAACG GTGCGCGATT TGCGTCAGCT GGTTGCCGAA
GGCAAATTCC GCGAAGATCT GTACGCGCGG ATCAATCTCT GGACCTTCAC CCTGCCGGGG
CTACGCCAGC GCCAGGAAGA TATTGAACCT AACCTGGATT ATGAAGTGGA GCGCCACGCC
TCACTTACCG GCGACAGCGT GCGTTTTAAC ACCGAAGCGC GGCGCGCCTG GCTGGCCTTT
GCAACATCAT CTCAGGCGGC ATGGCGCGGT AACTTTCGCG AGCTTTCTGC CAGCGTCACG
CGAATGGCAA CCTTTGCCAC TAGCGGACGC ATCACTCTGG AAGTGGTTGG AGATGAGATA
AACCGTCTGC GCTATAACTG GCAGGAGAGT CGTCCTTCCG CGCTTACGGC GTTGCTGGGC
GCTGAGGCAG AAAACATCGA CCTCTTTGAC CGTATGCAAC TGGAACATGT TATAGCTATC
TGCCGGCAGG CAAAGTCGCT TTCCGCTGCC GGACGCCAGC TTTTTGACGT TTCGCGCCAG
GGGAAAGCCA GCGTCAACGA CGCGGATCGG CTACGCAAAT ACCTGGCGCG TTTTAATCTG
ACGTGGGAAG CCGTGCAGGA TCAGCACAGC TCCAGTTGA
 
Protein sequence
MRKTVAFGFV GTVLDYAGRG SQRWSKWRPS LCLCQQESLV INRLELLHDA RSRSLFETLK 
RDIANVSPET KVVGVEIELH NPWDFEEVYA CLHDFARGYA FEPDKEDYLI HITTGTHVAQ
ICWFLLAEAR YLPARLIQSS PPRKKEQPRG AGEVTIIDLD LSRYNAIASR FAEERQQTLD
FLKSGIATRN AHFNRMIEQI EKVAIKSRAP ILLNGPTGAG KSFLARRIFE LKQARHQFSG
AFVEVNCATL RGDTAMSALF GHVKGAFTGA RESREGLLRS ANGGMLFLDE IGELGADEQA
MLLKAIEEKT FYPFGSDRQV SSDFQLIAGT VRDLRQLVAE GKFREDLYAR INLWTFTLPG
LRQRQEDIEP NLDYEVERHA SLTGDSVRFN TEARRAWLAF ATSSQAAWRG NFRELSASVT
RMATFATSGR ITLEVVGDEI NRLRYNWQES RPSALTALLG AEAENIDLFD RMQLEHVIAI
CRQAKSLSAA GRQLFDVSRQ GKASVNDADR LRKYLARFNL TWEAVQDQHS SS