Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1744 |
Symbol | purR |
ID | 6792918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 1707039 |
End bp | 1708064 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642775979 |
Product | DNA-binding transcriptional repressor PurR |
Protein accession | YP_002146615 |
Protein GI | 197249303 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000285436 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACAA TTAAAGATGT AGCGAAACGA GCAAACGTTT CCACTACAAC TGTATCACAC GTAATCAACA AAACGCGTTT TGTCGCTGAA GAAACGCGTA ACGCGGTCTG GGCGGCAATT AAAGAGCTGC ACTACTCTCC CAGCGCCGTC GCGCGTAGCC TGAAGGTTAA CCATACCAAG TCGATAGGCT TACTGGCGAC CAGCAGCGAA GCGGCCTATT TTGCCGAAAT TATCGAGGCA GTTGAAAAAA ACTGTTTCCA GAAAGGCTAT ACGCTGATTT TAGGCAACGC CTGGAATAAC CTGGAAAAAC AGCGCGCCTA CCTGTCCATG ATGGCGCAAA AGCGCGTGGA TGGCCTACTG GTGATGTGTT CTGAGTATCC AGAACCTCTG CTTTCCATGC TGGAAGAGTA TCGCCATATT CCGATGGTGG TGATGGACTG GGGTGAAGCG AAGGCCGATT TTACCGACAC GGTGATTGAT AACGCCTTTG CAGGCGGCTA TATGGCGGGT CGTTATCTGG TTGAACGCGG CCACCGGGAT ATCGGCGTTA TTCCCGGCCC GCTGGAGCGC AACACCGGCG CGGGCCGGCT GGCAGGCTTT ATGAAAGCTA TGGAGGAGGC GCTGATCAAC GTGCCGGACA ACTGGATTGT TCAGGGCGAC TTCGAGCCGG AATCCGGTTA CCACGCGATG CAGCAAATCT TATCGCAGTC ACATCGCCCT ACCGCCGTTT TCTGCGGCGG CGATATTATG GCGATGGGCG CGCTTTGCGC GGCTGACGAA ATGGGGCTTC GCGTACCGCA GGACGTTTCG GTGATCGGTT ATGACAATGT GCGTAACGCC CGTTTCTTTA CCCCGGCGCT GACGACGATT CACCAGCCCA AAGACTCTTT AGGCGAAACC GCATTTAATA TGCTACTGGA TCGCATCGTC AATAAGCGTG AAGAGTCACA GTCTATTGAA GTTCATCCAC GCCTGGTTGA GCGTCGCTCG GTCGCTGACG GCCCGTTCCG CGACTATCGG CGTTAA
|
Protein sequence | MATIKDVAKR ANVSTTTVSH VINKTRFVAE ETRNAVWAAI KELHYSPSAV ARSLKVNHTK SIGLLATSSE AAYFAEIIEA VEKNCFQKGY TLILGNAWNN LEKQRAYLSM MAQKRVDGLL VMCSEYPEPL LSMLEEYRHI PMVVMDWGEA KADFTDTVID NAFAGGYMAG RYLVERGHRD IGVIPGPLER NTGAGRLAGF MKAMEEALIN VPDNWIVQGD FEPESGYHAM QQILSQSHRP TAVFCGGDIM AMGALCAADE MGLRVPQDVS VIGYDNVRNA RFFTPALTTI HQPKDSLGET AFNMLLDRIV NKREESQSIE VHPRLVERRS VADGPFRDYR R
|
| |