Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0361 |
Symbol | prpR |
ID | 6143945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 371781 |
End bp | 373367 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615258 |
Product | propionate catabolism operon regulatory protein PrpR |
Protein accession | YP_001742465 |
Protein GI | 170684111 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR02329] propionate catabolism operon regulatory protein PrpR |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.785854 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACATC CACCACGGCT GAATGACGAC AAACCGGTTA TCTGGACAGT ATCGGTGACG CGCCTGTTCG AGCTGTTTCG CGATATCAGC CTCGAGTTTG ATCACCTGGC GAATATCACC CCTATTCAGC TTGGCTTTGA AAAAGCGGTG ACCTACATCC GCAAGAAACT GGCAAACGAA CGCTGCGATG CCATCATCGC TGCTGGCTCT AACGGCGCGT ACCTGAAAAG CCGCCTGTCA GTGCCAGTTA TTTTGATTAA ACCGAGCGGC TACGATGTAT TACAGGCACT GGCAAAAGCC GGAAAACTCA CCTCTTCTAT CGGCGTTGTC ACTTATCAGG AAACTATTCC GGCACTGGTG GCGTTTCAAA AAACCTTTAA TTTGCGCCTC GACCAACGTA GCTACATTAC CGAAGAAGAC GCGCGCGGGC AGATTAACGA GCTGAAAGCT AACGGCACCG AAGCGGTGGT CGGCGCGGGG CTGATTACCG ATCTGGCAGA AGAAGCCGGA ATGACCGGAA TTTTTATCTA TTCCGCCGCC ACCGTACGCC AGGCATTCAG CGATGCGCTG GATATGACGC GCATGTCGTT ACGCCATAAC ACTCACGATG CCACCCGCAA CGCCCTGCGT ACTCGTTACG TGCTGGGCGA TATGCTCGGT CAATCACCAC AGATGGAACA GGTACGGCAG ACTATTTTGC TGTATGCCCG CTCCAGTGCG GCGGTGTTGA TTGAGGGGGA AACGGGGACG GGCAAAGAGC TGGCGGCCCA GGCGATTCAT CGGGAATATT TTGCCCGCCA CGATGCGCGA CAGGGCAAAA AGTCGCATCC GTTTGTTGCC GTCAACTGCG GGGCGATTGC TGAATCTCTG CTGGAAGCGG AGCTGTTTGG CTATGAGGAA GGAGCGTTTA CCGGCTCGCG ACGTGGAGGT CGCGCCGGGC TGTTCGAAAT TGCCCACGGC GGTACGCTGT TTCTGGATGA GATTGGCGAA ATGCCGCTAC CTTTGCAGAC TCGCCTGTTA CGGGTGCTGG AAGAAAAAGA GGTCACCCGC GTCGGCGGGC ATCAGCCTGT TCCGGTGGAT GTGCGAGTCA TTAGCGCCAC TCACTGCAAT CTGGAAGAAG ATATGCGGCA AGGGGAGTTT CGCCGTGATC TGTTTTATCG GCTGAGTATT TTGCGTCTGC AATTGCCACC ACTGCGCGAG CGGGTGACGG ATATTCTGCC GCTGGCGGAA AGCTTTTTGA AAGTGTCTCT GGCGGCGCTC TCCGCCCCGT TTTCTGCCGC ATTACGCCAG GGGTTACAGG CAAGCGAAAC CGTGCTGGTG CACTACGACT GGCCGGGCAA TATTCGTGAA CTGCGCAATA TGATGGAGCG ACTGGCGCTG TTTTTAAGTG TGGAACCGAC GCCGGATTTA ACGCCGCAAT TTTTGCAGCT GCTACTGCCG GAACTGGCGC GCGAGTCGGC GAAAACTCCC GCTCCACGCT TACTGACACC ACAACAGGCA CTGGAGAAAT TTAATGGCGA TAAAACAGCA GCGGCGAATT ATTTAGGTAT CAGCCGGACG ACGTTCTGGC GGCGGCTGAA AAGCTGA
|
Protein sequence | MAHPPRLNDD KPVIWTVSVT RLFELFRDIS LEFDHLANIT PIQLGFEKAV TYIRKKLANE RCDAIIAAGS NGAYLKSRLS VPVILIKPSG YDVLQALAKA GKLTSSIGVV TYQETIPALV AFQKTFNLRL DQRSYITEED ARGQINELKA NGTEAVVGAG LITDLAEEAG MTGIFIYSAA TVRQAFSDAL DMTRMSLRHN THDATRNALR TRYVLGDMLG QSPQMEQVRQ TILLYARSSA AVLIEGETGT GKELAAQAIH REYFARHDAR QGKKSHPFVA VNCGAIAESL LEAELFGYEE GAFTGSRRGG RAGLFEIAHG GTLFLDEIGE MPLPLQTRLL RVLEEKEVTR VGGHQPVPVD VRVISATHCN LEEDMRQGEF RRDLFYRLSI LRLQLPPLRE RVTDILPLAE SFLKVSLAAL SAPFSAALRQ GLQASETVLV HYDWPGNIRE LRNMMERLAL FLSVEPTPDL TPQFLQLLLP ELARESAKTP APRLLTPQQA LEKFNGDKTA AANYLGISRT TFWRRLKS
|
| |