Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1538 |
Symbol | purR |
ID | 6147195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1523990 |
End bp | 1525015 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616416 |
Product | DNA-binding transcriptional repressor PurR |
Protein accession | YP_001743594 |
Protein GI | 170683174 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0686294 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0781462 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACAA TAAAAGATGT AGCGAAACGA GCAAACGTTT CCACTACAAC TGTGTCACAC GTGATCAACA AAACACGTTT CGTCGCTGAA GAAACGCGCA ACGCCGTGTG GGCAGCGATT AAAGAATTAC ACTACTCCCC TAGCGCGGTG GCGCGTAGCC TGAAGGTTAA CCACACCAAG TCTATCGGTT TGCTGGCGAC CAGCAGCGAA GCGGCCTATT TTGCCGAGAT CATTGAAGCG GTTGAAAAAA ATTGCTTCCA GAAAGGTTAC ACCCTGATTC TGGGCAATGC GTGGAACAAT CTTGAGAAAC AGCGGGCTTA TCTGTCGATG ATGGCGCAAA AACGCGTCGA TGGTCTGCTG GTGATGTGTT CTGAGTACCC AGAGCCGTTG CTGGCGATGC TGGAAGAGTA TCGCCATATC CCAATGGTGG TGATGGACTG GGGTGAAGCA AAAGCTGACT TCACCGATGC GGTCATTGAT AACGCGTTCG AAGGCGGCTA CATGGCCGGG CGTTATCTGA TAGAACGCGG TCACCGCGAA ATCGGCGTTA TCCCTGGCCC GCTGGAACGT AACACCGGCG CAGGCCGCCT TGCCGGTTTT ATGAAGGCGA TGGAAGAAGC GATGATCAAA GTGCCGGAAA GCTGGATTGT ACAGGGTGAC TTTGAACCTG AATCCGGTTA TCGCGCCATG CAGCAAATAC TGTCGCAACC GCATCGCCCT ACTGCCGTCT TCTGTGGTGG CGATATCATG GCAATGGGCG CACTTTGTGC TGCTGATGAA ATGGGTCTGC GCGTGCCGCA GGATGTTTCG CTGATCGGTT ATGATAACGT GCGCAACGCC CGCTATTTTA CGCCAGCGCT GACCACGATC CATCAGCCAA AAGATTCGCT GGGCGAAACG GCGTTCAACA TGCTGTTGGA TCGTATCGTC AATAAACGCG AAGAACCGCA ATCCATTGAA GTGCATCCGC GGTTGATTGA ACGCCGCTCC GTGGCTGACG GCCCGTTCCG CGACTATCGT CGTTAA
|
Protein sequence | MATIKDVAKR ANVSTTTVSH VINKTRFVAE ETRNAVWAAI KELHYSPSAV ARSLKVNHTK SIGLLATSSE AAYFAEIIEA VEKNCFQKGY TLILGNAWNN LEKQRAYLSM MAQKRVDGLL VMCSEYPEPL LAMLEEYRHI PMVVMDWGEA KADFTDAVID NAFEGGYMAG RYLIERGHRE IGVIPGPLER NTGAGRLAGF MKAMEEAMIK VPESWIVQGD FEPESGYRAM QQILSQPHRP TAVFCGGDIM AMGALCAADE MGLRVPQDVS LIGYDNVRNA RYFTPALTTI HQPKDSLGET AFNMLLDRIV NKREEPQSIE VHPRLIERRS VADGPFRDYR R
|
| |