Gene EcSMS35_1538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1538 
SymbolpurR 
ID6147195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1523990 
End bp1525015 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content53% 
IMG OID641616416 
ProductDNA-binding transcriptional repressor PurR 
Protein accessionYP_001743594 
Protein GI170683174 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0686294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0781462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA TAAAAGATGT AGCGAAACGA GCAAACGTTT CCACTACAAC TGTGTCACAC 
GTGATCAACA AAACACGTTT CGTCGCTGAA GAAACGCGCA ACGCCGTGTG GGCAGCGATT
AAAGAATTAC ACTACTCCCC TAGCGCGGTG GCGCGTAGCC TGAAGGTTAA CCACACCAAG
TCTATCGGTT TGCTGGCGAC CAGCAGCGAA GCGGCCTATT TTGCCGAGAT CATTGAAGCG
GTTGAAAAAA ATTGCTTCCA GAAAGGTTAC ACCCTGATTC TGGGCAATGC GTGGAACAAT
CTTGAGAAAC AGCGGGCTTA TCTGTCGATG ATGGCGCAAA AACGCGTCGA TGGTCTGCTG
GTGATGTGTT CTGAGTACCC AGAGCCGTTG CTGGCGATGC TGGAAGAGTA TCGCCATATC
CCAATGGTGG TGATGGACTG GGGTGAAGCA AAAGCTGACT TCACCGATGC GGTCATTGAT
AACGCGTTCG AAGGCGGCTA CATGGCCGGG CGTTATCTGA TAGAACGCGG TCACCGCGAA
ATCGGCGTTA TCCCTGGCCC GCTGGAACGT AACACCGGCG CAGGCCGCCT TGCCGGTTTT
ATGAAGGCGA TGGAAGAAGC GATGATCAAA GTGCCGGAAA GCTGGATTGT ACAGGGTGAC
TTTGAACCTG AATCCGGTTA TCGCGCCATG CAGCAAATAC TGTCGCAACC GCATCGCCCT
ACTGCCGTCT TCTGTGGTGG CGATATCATG GCAATGGGCG CACTTTGTGC TGCTGATGAA
ATGGGTCTGC GCGTGCCGCA GGATGTTTCG CTGATCGGTT ATGATAACGT GCGCAACGCC
CGCTATTTTA CGCCAGCGCT GACCACGATC CATCAGCCAA AAGATTCGCT GGGCGAAACG
GCGTTCAACA TGCTGTTGGA TCGTATCGTC AATAAACGCG AAGAACCGCA ATCCATTGAA
GTGCATCCGC GGTTGATTGA ACGCCGCTCC GTGGCTGACG GCCCGTTCCG CGACTATCGT
CGTTAA
 
Protein sequence
MATIKDVAKR ANVSTTTVSH VINKTRFVAE ETRNAVWAAI KELHYSPSAV ARSLKVNHTK 
SIGLLATSSE AAYFAEIIEA VEKNCFQKGY TLILGNAWNN LEKQRAYLSM MAQKRVDGLL
VMCSEYPEPL LAMLEEYRHI PMVVMDWGEA KADFTDAVID NAFEGGYMAG RYLIERGHRE
IGVIPGPLER NTGAGRLAGF MKAMEEAMIK VPESWIVQGD FEPESGYRAM QQILSQPHRP
TAVFCGGDIM AMGALCAADE MGLRVPQDVS LIGYDNVRNA RYFTPALTTI HQPKDSLGET
AFNMLLDRIV NKREEPQSIE VHPRLIERRS VADGPFRDYR R