Gene ECH74115_2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2372 
SymbolpurR 
ID6970030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2243850 
End bp2244875 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content53% 
IMG OID643386244 
ProductDNA-binding transcriptional repressor PurR 
Protein accessionYP_002270728 
Protein GI209400108 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000002203 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.181026 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA TAAAAGATGT AGCGAAACGA GCAAACGTTT CCACTACAAC TGTGTCACAC 
GTGATCAACA AAACACGTTT CGTCGCTGAA GAAACGCGCA ACGCCGTGTG GGCAGCGATC
AAAGAATTAC ACTACTCCCC TAGCGCGGTG GCGCGTAGCC TGAAGGTTAA CCACACCAAG
TCTATCGGTT TGCTAGCGAC CAGCAGCGAA GCGGCCTATT TTGCCGAGAT CATTGAAGCA
GTTGAAAAAA ATTGCTTCCA GAAAGGTTAC ACCCTGATTC TGGGTAATGC GTGGAACAAT
CTTGAGAAAC AGCGGGCTTA TCTGTCGATG ATGGCGCAAA AACGCGTCGA TGGTCTGCTG
GTGATGTGTT CTGAGTACCC AGAGCCGTTG CTGGCGATGC TGGAAGAGTA TCGCCATATC
CCAATGGTGG TGATGGACTG GGGTGAAGCA AAAGCTGACT TCACCGATGC GGTCATTGAT
AACGCGTTCG AAGGCGGCTA CATGGCCGGG CGTTATCTGA TTGAACGCGG TCACCGCGAA
ATCGGCGTTA TCCCCGGCCC GCTGGAACGT AACACTGGCG CAGGCCGCCT TGCCGGTTTT
ATGAAGGCGA TGGAAGAAGC AATGATCAAG GTGCCGGAAA GCTGGATTGT GCAGGGTGAC
TTTGAACCTG AATCCGGTTA TCGCGCCATG CAGCAAATCC TGTCGCAGCC GCATCGCCCT
ACTGCCGTCT TCTGTGGTGG CGATATCATG GCAATGGGCG CACTTTGTGC TGCTGATGAA
ATGGGGCTGC GCGTCCCGCA GGATGTTTCG CTGATCGGTT ATGATAACGT GCGCAACGCC
CGCTATTTTA CGCCGGCGCT GACAACAATC CATCAGCCAA AAGATTCGTT GGGTGAAACA
GCGTTCAACA TGTTGTTGGA TCGTATAGTC AACAAACGTG AAGAACCGCA GTCCATTGAA
GTGCATCCGC GCTTGATTGA ACGCCGCTCC GTGGCTGACG GCCCGTTCCG CGACTATCGT
CGTTAA
 
Protein sequence
MATIKDVAKR ANVSTTTVSH VINKTRFVAE ETRNAVWAAI KELHYSPSAV ARSLKVNHTK 
SIGLLATSSE AAYFAEIIEA VEKNCFQKGY TLILGNAWNN LEKQRAYLSM MAQKRVDGLL
VMCSEYPEPL LAMLEEYRHI PMVVMDWGEA KADFTDAVID NAFEGGYMAG RYLIERGHRE
IGVIPGPLER NTGAGRLAGF MKAMEEAMIK VPESWIVQGD FEPESGYRAM QQILSQPHRP
TAVFCGGDIM AMGALCAADE MGLRVPQDVS LIGYDNVRNA RYFTPALTTI HQPKDSLGET
AFNMLLDRIV NKREEPQSIE VHPRLIERRS VADGPFRDYR R