Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1540 |
Symbol | purR |
ID | 6483754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 1507360 |
End bp | 1508385 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642736927 |
Product | DNA-binding transcriptional repressor PurR |
Protein accession | YP_002040679 |
Protein GI | 194443895 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000065632 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 83 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACAA TTAAAGATGT AGCGAAACGG GCAAACGTTT CCACTACAAC TGTATCACAC GTAATCAACA AAACGCGTTT TGTCGCTGAA GAAACGCGTA ACGCGGTCTG GGCGGCAATT AAAGAGCTGC ACTACTCTCC CAGCGCCGTC GCGCGTAGCC TGAAGGTTAA CCATACCAAG TCGATAGGCT TACTGGCGAC CAGCAGCGAA GCGGCCTATT TTGCCGAAAT TATCGAGGCA GTTGAAAAAA ACTGTTTCCA GAAAGGCTAT ACGCTGATTT TAGGCAACGC CTGGAATAAC CTGGAAAAAC AGCGCGCCTA CCTGTCCATG ATGGCGCAAA AGCGCGTGGA TGGCCTGCTG GTGATGTGTT CTGAGTATCC AGAACCTCTG CTTTCCATGC TGGAAGAGTA TCGCCATATT CCGATGGTGG TGATGGACTG GGGTGAAGCG AAGGCCGATT TTACCGACAC GGTGATTGAT AACGCCTTTG CAGGCGGCTA TATGGCGGGT CGTTATCTGG TTGAACGCGG CCACCGGGAT ATCGGCGTTA TTCCCGGCCC GCTGGAGCGC AACACCGGCG CGGGGCGGCT GGCAGGCTTT ATGAAAGCCA TGGAGGAGGC GCTGATCAAC GTGCCGGACA ACTGGATTGT TCAGGGCGAC TTCGAGCCGG AATCCGGTTA CCACGCGATG CAGCAAATCT TATCGCAGTC ACATCGCCCT ACCGCCGTTT TCTGCGGCGG CGATATTATG GCGATGGGCG CGCTTTGCGC GGCTGACGAA ATGGGGCTTC GCGTACCGCA GGACGTTTCG GTGATCGGTT ATGACAATGT GCGTAACGCC CGTTTCTTTA CCCCGGCGCT GACGACGATT CACCAGCCCA AAGACTCTTT AGGTGAAACC GCATTTAATA TGCTATTGGA TCGCATCGTC AATAAGCGTG AAGAGTCACA GTCTATTGAA GTTCATCCAC GCCTGGTTGA GCGTCGCTCG GTCGCTGACG GCCCGTTCCG CGACTATCGG CGTTAA
|
Protein sequence | MATIKDVAKR ANVSTTTVSH VINKTRFVAE ETRNAVWAAI KELHYSPSAV ARSLKVNHTK SIGLLATSSE AAYFAEIIEA VEKNCFQKGY TLILGNAWNN LEKQRAYLSM MAQKRVDGLL VMCSEYPEPL LSMLEEYRHI PMVVMDWGEA KADFTDTVID NAFAGGYMAG RYLVERGHRD IGVIPGPLER NTGAGRLAGF MKAMEEALIN VPDNWIVQGD FEPESGYHAM QQILSQSHRP TAVFCGGDIM AMGALCAADE MGLRVPQDVS VIGYDNVRNA RFFTPALTTI HQPKDSLGET AFNMLLDRIV NKREESQSIE VHPRLVERRS VADGPFRDYR R
|
| |