Gene SbBS512_E1857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1857 
SymbolpurR 
ID6268494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1699013 
End bp1700038 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content54% 
IMG OID641725921 
ProductDNA-binding transcriptional repressor PurR 
Protein accessionYP_001880419 
Protein GI187731351 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000046163 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAA TAAAAGATGT AGCGAAACGA GCAAACGTTT CCACTACAAC TGTGTCACAC 
GTAATCAACA AAACACGTTT CGTCGCTGAA GAAACGCGCA ACGCCGTGTG GGCAGCGATT
AAAGAATTAC ACTACTCCCC TAGCGCGGTG GCGCGTAGCC TGAAGGTTAA CCACACCAAG
TCTATCGGTT TGCTGGCGAC CAGCAGCGAA GCGGCCTATT TTGCCGAGAT CATTGAAGCG
GTTGAAAAAA ATTGCTTCCA GAAAGGTTAC ACCCTGATTC TGGGTAATGC GTGGAACAAT
CTTGAGAAAC AGCGGGCTTA TCTGTCGATG ATGGCGCAAA AACGCGTCGA TGGTCTGCTG
GTGATGTGTT CTGAGTACCC AGAGCCGTTG CTGGCGATGC TGGAAGAGTA TCGCCATATC
CCAATGGTGG TGATGGACTG GGGTGAAGCA AAAGCTGACT TCACCGATGC GGTCATTGAT
AACGCGTTCG AAGGCGGCTA CATGGCCGGG CGTTATCTGA TTGAACGCGG TCACCGCGAA
ATCGGCGTCA TCCCCGGCCC GCTGGAACGT AACACCGGCG CAGGCCGCCT TGCCGGTTTT
ATGAAGGCGA TGGAAGAAGC GATGATCAAG GTGCCGGAAA GCTGGATTGT TCAGGGTGAC
TTTGAACCCG AATCTGGTTA TCGCGCCATG CAGCAAATAC TGTCGCAGCC GCATCGCCCT
ACTGCCGTCT TCTGTGGTGG CGATATCATG GCAATGGGCG CACTTTGTGC TGCTGATGAA
ATGGGTCTGC GCGTCCCGCA GGATGTTTCG CTGATCGGTT ATGATAACGT GCGCAACGCC
CGCTATTTTA CGCCGGCGCT GACCACGATC CACCAGCCAA AAGATTCGCT GGGTGAAACA
GCGTTCAACA TGCTGTTGGA TCGTATCGTC AACAAACGTG AAGAACCGCA GTCCATTGAA
GTGCATCCGC GCTTGATTGA ACGCCGCTCC GTGGCTGACG GCCCGTTCCG CGACTATCGT
CGTTAA
 
Protein sequence
MATIKDVAKR ANVSTTTVSH VINKTRFVAE ETRNAVWAAI KELHYSPSAV ARSLKVNHTK 
SIGLLATSSE AAYFAEIIEA VEKNCFQKGY TLILGNAWNN LEKQRAYLSM MAQKRVDGLL
VMCSEYPEPL LAMLEEYRHI PMVVMDWGEA KADFTDAVID NAFEGGYMAG RYLIERGHRE
IGVIPGPLER NTGAGRLAGF MKAMEEAMIK VPESWIVQGD FEPESGYRAM QQILSQPHRP
TAVFCGGDIM AMGALCAADE MGLRVPQDVS LIGYDNVRNA RYFTPALTTI HQPKDSLGET
AFNMLLDRIV NKREEPQSIE VHPRLIERRS VADGPFRDYR R