Gene SeHA_C1563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1563 
SymbolpurR 
ID6489086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1513435 
End bp1514460 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content55% 
IMG OID642741786 
ProductDNA-binding transcriptional repressor PurR 
Protein accessionYP_002045431 
Protein GI194447411 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00208837 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones93 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA TTAAAGATGT AGCGAAACGG GCAAACGTTT CCACTACAAC TGTATCACAC 
GTAATCAACA AAACGCGTTT TGTCGCTGAA GAAACGCGTA ACGCGGTCTG GGCGGCAATT
AAAGAGCTGC ACTACTCTCC CAGCGCCGTC GCGCGTAGCC TGAAGGTTAA CCATACCAAG
TCGATAGGCT TACTGGCGAC CAGCAGCGAA GCGGCCTATT TTGCCGAAAT TATCGAGGCA
GTTGAAAAAA ACTGTTTCCA GAAAGGCTAT ACGCTGATTT TAGGCAACGC CTGGAATAAC
CTGGAAAAAC AGCGCGCCTA CCTGTCCATG ATGGCGCAAA AGCGCGTGGA TGGCCTGCTG
GTGATGTGTT CTGAGTATCC AGAACCTCTG CTTTCCATGC TGGAAGAGTA TCGCCATATT
CCGATGGTGG TGATGGACTG GGGTGAAGCG AAGGCCGATT TTACCGACAC GGTGATTGAT
AACGCCTTTG CAGGCGGCTA TATGGCGGGT CGTTATCTGG TTGAGCGCGG CCACCGGGAT
ATCGGCGTTA TTCCCGGCCC GCTGGAGCGC AACACCGGCG CGGGGCGGCT GGCAGGCTTT
ATGAAAGCCA TGGAGGAGGC GCTGATCAAC GTGCCGGACA ACTGGATTGT TCAGGGCGAC
TTCGAGCCGG AGTCCGGTTA CCACGCGATG CAGCAAATCT TATCGCAGTC ACATCGCCCT
ACCGCCGTTT TCTGCGGCGG CGATATTATG GCGATGGGCG CGCTTTGCGC GGCTGACGAA
ATGGGGCTTC GCGTACCGCA GGACGTTTCG GTGATCGGTT ATGACAATGT GCGTAACGCC
CGTTACTTTA CCCCGGCGCT GACGACGATT CACCAGCCCA AAGACTCTTT AGGCGAAACC
GCATTTAATA TGCTACTGGA TCGCATCGTC AATAAGCGTG AAGAGTCACA GTCTATTGAA
GTTCATCCAC GCCTGGTTGA GCGTCGCTCG GTCGCTGACG GCCCGTTCCG CGACTATCGG
CGTTAA
 
Protein sequence
MATIKDVAKR ANVSTTTVSH VINKTRFVAE ETRNAVWAAI KELHYSPSAV ARSLKVNHTK 
SIGLLATSSE AAYFAEIIEA VEKNCFQKGY TLILGNAWNN LEKQRAYLSM MAQKRVDGLL
VMCSEYPEPL LSMLEEYRHI PMVVMDWGEA KADFTDTVID NAFAGGYMAG RYLVERGHRD
IGVIPGPLER NTGAGRLAGF MKAMEEALIN VPDNWIVQGD FEPESGYHAM QQILSQSHRP
TAVFCGGDIM AMGALCAADE MGLRVPQDVS VIGYDNVRNA RYFTPALTTI HQPKDSLGET
AFNMLLDRIV NKREESQSIE VHPRLVERRS VADGPFRDYR R