Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1857 |
Symbol | purR |
ID | 6268494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1699013 |
End bp | 1700038 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641725921 |
Product | DNA-binding transcriptional repressor PurR |
Protein accession | YP_001880419 |
Protein GI | 187731351 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000000046163 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACAA TAAAAGATGT AGCGAAACGA GCAAACGTTT CCACTACAAC TGTGTCACAC GTAATCAACA AAACACGTTT CGTCGCTGAA GAAACGCGCA ACGCCGTGTG GGCAGCGATT AAAGAATTAC ACTACTCCCC TAGCGCGGTG GCGCGTAGCC TGAAGGTTAA CCACACCAAG TCTATCGGTT TGCTGGCGAC CAGCAGCGAA GCGGCCTATT TTGCCGAGAT CATTGAAGCG GTTGAAAAAA ATTGCTTCCA GAAAGGTTAC ACCCTGATTC TGGGTAATGC GTGGAACAAT CTTGAGAAAC AGCGGGCTTA TCTGTCGATG ATGGCGCAAA AACGCGTCGA TGGTCTGCTG GTGATGTGTT CTGAGTACCC AGAGCCGTTG CTGGCGATGC TGGAAGAGTA TCGCCATATC CCAATGGTGG TGATGGACTG GGGTGAAGCA AAAGCTGACT TCACCGATGC GGTCATTGAT AACGCGTTCG AAGGCGGCTA CATGGCCGGG CGTTATCTGA TTGAACGCGG TCACCGCGAA ATCGGCGTCA TCCCCGGCCC GCTGGAACGT AACACCGGCG CAGGCCGCCT TGCCGGTTTT ATGAAGGCGA TGGAAGAAGC GATGATCAAG GTGCCGGAAA GCTGGATTGT TCAGGGTGAC TTTGAACCCG AATCTGGTTA TCGCGCCATG CAGCAAATAC TGTCGCAGCC GCATCGCCCT ACTGCCGTCT TCTGTGGTGG CGATATCATG GCAATGGGCG CACTTTGTGC TGCTGATGAA ATGGGTCTGC GCGTCCCGCA GGATGTTTCG CTGATCGGTT ATGATAACGT GCGCAACGCC CGCTATTTTA CGCCGGCGCT GACCACGATC CACCAGCCAA AAGATTCGCT GGGTGAAACA GCGTTCAACA TGCTGTTGGA TCGTATCGTC AACAAACGTG AAGAACCGCA GTCCATTGAA GTGCATCCGC GCTTGATTGA ACGCCGCTCC GTGGCTGACG GCCCGTTCCG CGACTATCGT CGTTAA
|
Protein sequence | MATIKDVAKR ANVSTTTVSH VINKTRFVAE ETRNAVWAAI KELHYSPSAV ARSLKVNHTK SIGLLATSSE AAYFAEIIEA VEKNCFQKGY TLILGNAWNN LEKQRAYLSM MAQKRVDGLL VMCSEYPEPL LAMLEEYRHI PMVVMDWGEA KADFTDAVID NAFEGGYMAG RYLIERGHRE IGVIPGPLER NTGAGRLAGF MKAMEEAMIK VPESWIVQGD FEPESGYRAM QQILSQPHRP TAVFCGGDIM AMGALCAADE MGLRVPQDVS LIGYDNVRNA RYFTPALTTI HQPKDSLGET AFNMLLDRIV NKREEPQSIE VHPRLIERRS VADGPFRDYR R
|
| |