Gene EcSMS35_0361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0361 
SymbolprpR 
ID6143945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp371781 
End bp373367 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content55% 
IMG OID641615258 
Productpropionate catabolism operon regulatory protein PrpR 
Protein accessionYP_001742465 
Protein GI170684111 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR02329] propionate catabolism operon regulatory protein PrpR 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.785854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACATC CACCACGGCT GAATGACGAC AAACCGGTTA TCTGGACAGT ATCGGTGACG 
CGCCTGTTCG AGCTGTTTCG CGATATCAGC CTCGAGTTTG ATCACCTGGC GAATATCACC
CCTATTCAGC TTGGCTTTGA AAAAGCGGTG ACCTACATCC GCAAGAAACT GGCAAACGAA
CGCTGCGATG CCATCATCGC TGCTGGCTCT AACGGCGCGT ACCTGAAAAG CCGCCTGTCA
GTGCCAGTTA TTTTGATTAA ACCGAGCGGC TACGATGTAT TACAGGCACT GGCAAAAGCC
GGAAAACTCA CCTCTTCTAT CGGCGTTGTC ACTTATCAGG AAACTATTCC GGCACTGGTG
GCGTTTCAAA AAACCTTTAA TTTGCGCCTC GACCAACGTA GCTACATTAC CGAAGAAGAC
GCGCGCGGGC AGATTAACGA GCTGAAAGCT AACGGCACCG AAGCGGTGGT CGGCGCGGGG
CTGATTACCG ATCTGGCAGA AGAAGCCGGA ATGACCGGAA TTTTTATCTA TTCCGCCGCC
ACCGTACGCC AGGCATTCAG CGATGCGCTG GATATGACGC GCATGTCGTT ACGCCATAAC
ACTCACGATG CCACCCGCAA CGCCCTGCGT ACTCGTTACG TGCTGGGCGA TATGCTCGGT
CAATCACCAC AGATGGAACA GGTACGGCAG ACTATTTTGC TGTATGCCCG CTCCAGTGCG
GCGGTGTTGA TTGAGGGGGA AACGGGGACG GGCAAAGAGC TGGCGGCCCA GGCGATTCAT
CGGGAATATT TTGCCCGCCA CGATGCGCGA CAGGGCAAAA AGTCGCATCC GTTTGTTGCC
GTCAACTGCG GGGCGATTGC TGAATCTCTG CTGGAAGCGG AGCTGTTTGG CTATGAGGAA
GGAGCGTTTA CCGGCTCGCG ACGTGGAGGT CGCGCCGGGC TGTTCGAAAT TGCCCACGGC
GGTACGCTGT TTCTGGATGA GATTGGCGAA ATGCCGCTAC CTTTGCAGAC TCGCCTGTTA
CGGGTGCTGG AAGAAAAAGA GGTCACCCGC GTCGGCGGGC ATCAGCCTGT TCCGGTGGAT
GTGCGAGTCA TTAGCGCCAC TCACTGCAAT CTGGAAGAAG ATATGCGGCA AGGGGAGTTT
CGCCGTGATC TGTTTTATCG GCTGAGTATT TTGCGTCTGC AATTGCCACC ACTGCGCGAG
CGGGTGACGG ATATTCTGCC GCTGGCGGAA AGCTTTTTGA AAGTGTCTCT GGCGGCGCTC
TCCGCCCCGT TTTCTGCCGC ATTACGCCAG GGGTTACAGG CAAGCGAAAC CGTGCTGGTG
CACTACGACT GGCCGGGCAA TATTCGTGAA CTGCGCAATA TGATGGAGCG ACTGGCGCTG
TTTTTAAGTG TGGAACCGAC GCCGGATTTA ACGCCGCAAT TTTTGCAGCT GCTACTGCCG
GAACTGGCGC GCGAGTCGGC GAAAACTCCC GCTCCACGCT TACTGACACC ACAACAGGCA
CTGGAGAAAT TTAATGGCGA TAAAACAGCA GCGGCGAATT ATTTAGGTAT CAGCCGGACG
ACGTTCTGGC GGCGGCTGAA AAGCTGA
 
Protein sequence
MAHPPRLNDD KPVIWTVSVT RLFELFRDIS LEFDHLANIT PIQLGFEKAV TYIRKKLANE 
RCDAIIAAGS NGAYLKSRLS VPVILIKPSG YDVLQALAKA GKLTSSIGVV TYQETIPALV
AFQKTFNLRL DQRSYITEED ARGQINELKA NGTEAVVGAG LITDLAEEAG MTGIFIYSAA
TVRQAFSDAL DMTRMSLRHN THDATRNALR TRYVLGDMLG QSPQMEQVRQ TILLYARSSA
AVLIEGETGT GKELAAQAIH REYFARHDAR QGKKSHPFVA VNCGAIAESL LEAELFGYEE
GAFTGSRRGG RAGLFEIAHG GTLFLDEIGE MPLPLQTRLL RVLEEKEVTR VGGHQPVPVD
VRVISATHCN LEEDMRQGEF RRDLFYRLSI LRLQLPPLRE RVTDILPLAE SFLKVSLAAL
SAPFSAALRQ GLQASETVLV HYDWPGNIRE LRNMMERLAL FLSVEPTPDL TPQFLQLLLP
ELARESAKTP APRLLTPQQA LEKFNGDKTA AANYLGISRT TFWRRLKS