Gene EcE24377A_1873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1873 
SymbolpurR 
ID5585976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1863662 
End bp1864687 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content54% 
IMG OID640925548 
ProductDNA-binding transcriptional repressor PurR 
Protein accessionYP_001462953 
Protein GI157157434 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000326079 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAA TAAAAGATGT AGCGAAACGA GCAAACGTTT CCACTACAAC TGTGTCACAC 
GTAATCAACA AAACACGTTT CGTCTCTGAA GAAACGCGCA ACGCCGTGTG GGCAGCGATT
AAAGAATTAC ACTACTCCCC TAGCGCGGTG GCGCGTAGCC TGAAGGTTAA CCACACCAAG
TCTATCGGTT TGCTGGCGAC CAGCAGCGAA GCGGCCTATT TTGCCGAGAT CATTGAAGCG
GTTGAAAAAA ATTGCTTCCA GAAAGGTTAC ACCCTGATTC TGGGTAATGC GTGGAACAAT
CTTGAGAAAC AGCGGGCTTA TCTGTCGATG ATGGCGCAAA AACGCGTCGA TGGTCTGCTG
GTGATGTGTT CTGAGTACCC AGAGCCGTTG CTGGCGATGC TGGAAGAGTA TCGCCATATC
CCAATGGTGG TGATGGACTG GGGTGAAGCA AAAGCTGACT TCACCGATGC GGTCATTGAT
AACGCGTTCG AAGGCGGCTA CATGGCCGGG CGTTATCTGA TTGAACGCGG TCACCGCGAA
ATCGGCGTCA TCCCCGGCCC GCTGGAACGT AACACCGGCG CAGGCCGCCT TGCCGGTTTT
ATGAAGGCGA TGGAAGAAGC GATGATCAAG GTGCCGGAAA GCTGGATTGT TCAGGGTGAC
TTTGAACCCG AATCTGGTTA TCGCGCCATG CAGCAAATAC TGTCGCAGCC GCATCGCCCT
ACTGCCGTCT TCTGTGGTGG CGATATCATG GCAATGGGCG CACTTTGTGC TGCTGATGAA
ATGGGTCTGC GCGTCCCGCA GGATGTTTCG CTGATCGGTT ATGATAACGT GCGCAACGCC
CGCTATTTTA CGCCGGCGCT GACCACGATC CACCAGCCAA AAGATTCGCT GGGTGAAACA
GCGTTCAACA TGCTGTTGGA TCGTATCGTC AACAAACGTG AAGAACCGCA GTCCATTGAA
GTGCATCCGC GCTTGATTGA ACGCCGCTCC GTGGCTGACG GCCCGTTCCG CGACTATCGT
CGTTAA
 
Protein sequence
MATIKDVAKR ANVSTTTVSH VINKTRFVSE ETRNAVWAAI KELHYSPSAV ARSLKVNHTK 
SIGLLATSSE AAYFAEIIEA VEKNCFQKGY TLILGNAWNN LEKQRAYLSM MAQKRVDGLL
VMCSEYPEPL LAMLEEYRHI PMVVMDWGEA KADFTDAVID NAFEGGYMAG RYLIERGHRE
IGVIPGPLER NTGAGRLAGF MKAMEEAMIK VPESWIVQGD FEPESGYRAM QQILSQPHRP
TAVFCGGDIM AMGALCAADE MGLRVPQDVS LIGYDNVRNA RYFTPALTTI HQPKDSLGET
AFNMLLDRIV NKREEPQSIE VHPRLIERRS VADGPFRDYR R