Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0396 |
Symbol | prpR |
ID | 5591597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 412848 |
End bp | 414434 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640919581 |
Product | propionate catabolism operon regulatory protein PrpR |
Protein accession | YP_001457166 |
Protein GI | 157159848 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR02329] propionate catabolism operon regulatory protein PrpR |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACATC CACCACGGCT TAATGACGAC AAACCGGTTA TCTGGACGGT ATCTGTAACG CGCCTGTTCG AGCTGTTTCG CGATATCAGC CTCGAGTTTG ATCACCTGGC GAACATTACC CCTATCCAGC TTGGCTTTGA AAAAGCAGTG ACCTATATCC GCAAGAAACT GGCAAACGAA CGCTGCGACG CCGTCATCGC CGCTGGCTCT AACGGCGCGT ACCTGAAAAG CCGCCTGTCA GTGCCAGTTA TTTTGATTAA ACCGAGCGGC TACGATGTGT TACAGGCACT GGCAAAAGCC GGAAAGCTCA CCTCTTCTAT CGGCGTTGTC ACTTATCAGG AAACTATTCC GGCACTGGTG GCGTTTCAAA AAACCTTTAA TTTGCGCCTC GATCAACGTA GCTACATTAC CGAAGAAGAC GCCCGCGGGC AGATTAACGA GCTCAAAGCT AACGGCACCG AAGCGGTGGT CGGTGCGGGG CTGATTACCG ATCTGGCAGA AGAAGCCGGA ATGACCGGAA TTTTTATCTA TTCCGCCGTC ACCGTGCGCC AGGCGTTCAG CGATGCGCTG GATATGACGC GCATGTCGTT ACGCCATAAC ACTCACGATG CCACCCGCAA CGCCCTGAGA ACTCGTTACG TGCTGGGCGA TATGCTCGGT CAATCACCAC AGATGGAACA AGTACGGCAG ACTATTTTGC TGTATGCCCG CTCCAGTGCG GCGGTGTTGA TTGAAGGGGA AACGGGGACG GGCAAAGAGC TGGCGGCCCA GGCGATTCAT CGGGAATATT TTGCCCGCCA CGATGCGCGA CAGGGCAAAA AGTCGCATCC GTTTGTTGCC GTCAACTGCG GGGCGATAGC CGAATCGCTG CTGGAAGCGG AGCTGTTTGG CTATGAGGAA GGGGCGTTTA CCGGCTCGCG ACGCGGAGGT CGCGCCGGGC TGTTCGAAAT TGCCCACGGC GGTACGCTGT TTCTGGATGA GATTGGCGAA ATGCCGCTAC CTTTGCAGAC TCGCCTGTTA CGGGTGCTGG AAGAAAAAGA GGTTACTCGC GTCGGCGGGC ATCAGCCTGT TCCGGTGGAT GTGCGGGTCA TTAGCGCCAC TCACTGCAAT CTGGAAGAAG ATATGCGGCA AGGGCAGTTT CGCCGTGATC TGTTTTATCG GCTGAGTATT TTGCGTCTGC AATTGCCACC ACTGCGCGAG CGGGTGGCGG ATATTCTTCC GCTGGCGGAA AGCTTTTTGA AAGTGTCTCT GGCGGCGCTC TCCGCCCCGT TTTCTGCCGC ATTACGCCAG GGATTACAGG CAAGCGAAAC CGTGCTGGTG CACTACGACT GGCCAGGCAA TATTCGTGAA CTGCGCAATA TGATGGAACG ACTGGCGCTG TTTTTAAGTG TGGAACCGAC GCCGGATTTA ACGCCGCAGT TTATGCAACT GCTACTGCCG GAGCTGGCGC GCGAGTCGGC GAAAATTCCC GCTCCACGCT TACTGACACC ACAACAGGCA CTGGAGAAAT TTAAAGGCGA TAAAACAGCA GCGGCGAATT ATTTAGGCAT AAGCCGGACG ACGTTCTGGC GGCGGCTGAA AAACTGA
|
Protein sequence | MAHPPRLNDD KPVIWTVSVT RLFELFRDIS LEFDHLANIT PIQLGFEKAV TYIRKKLANE RCDAVIAAGS NGAYLKSRLS VPVILIKPSG YDVLQALAKA GKLTSSIGVV TYQETIPALV AFQKTFNLRL DQRSYITEED ARGQINELKA NGTEAVVGAG LITDLAEEAG MTGIFIYSAV TVRQAFSDAL DMTRMSLRHN THDATRNALR TRYVLGDMLG QSPQMEQVRQ TILLYARSSA AVLIEGETGT GKELAAQAIH REYFARHDAR QGKKSHPFVA VNCGAIAESL LEAELFGYEE GAFTGSRRGG RAGLFEIAHG GTLFLDEIGE MPLPLQTRLL RVLEEKEVTR VGGHQPVPVD VRVISATHCN LEEDMRQGQF RRDLFYRLSI LRLQLPPLRE RVADILPLAE SFLKVSLAAL SAPFSAALRQ GLQASETVLV HYDWPGNIRE LRNMMERLAL FLSVEPTPDL TPQFMQLLLP ELARESAKIP APRLLTPQQA LEKFKGDKTA AANYLGISRT TFWRRLKN
|
| |