Gene SeHA_C1869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1869 
Symbol 
ID6489875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1823993 
End bp1825534 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content54% 
IMG OID642742082 
ProductDNA-binding transcriptional regulator TyrR 
Protein accessionYP_002045727 
Protein GI194451846 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG3283] Transcriptional regulator of aromatic amino acids metabolism 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.148583 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGG AAGTCTTTTG TGAAGACCGA CTTGGTCTGA CCCGCGAATT ACTTGATTTA 
CTGGTGTTAC GTAGCATTGA TTTACGCGGA ATCGAGATTG ATCCCATTGG GCGAATTTAT
CTTAATTTTG CTGAGCTGGA ATTCACCGAC TTCAGCAGCC TGATGGCCGA AATCCGCCGT
ATTTCCGGCG TAACGGATGT CCGTACCGTT CCCTGGATGC CGTCCGAACG TGAACATCTG
GCCCTGAGCG CGCTGCTTGA GGCGTTGCCG GAGCCGGTGC TCTCATTGGA TATGAAGAGT
AAAGTGGAGA TGGCGAACCC GGCGAGTTGT CAACTTTTTG CCCAGAGCCA GGAGCGAATG
CGGCACCATA CCGCCGCACA ATTAATCAAC GGCTTCAATT TTCAGCGCTG GCTGGACGGT
AACCCGCAAA GCTCCCATAA CGAACATGTC GTGATCAACG GGCAAAACTT CCTGATGGAG
ATTACGCCGG TACATTTACA AAACGAAAAT GACGAATACG TGTTGACCGG GGCGGTCGTG
ATGTTGCGTT CCACGATTCG TATGGGGCAG CAGCTACAGA ATTTGTCCAC GCAGGATCTG
AGCGCGTTTA GTCAGATTAT TGCCGTGAGC GCAAAGATGA AGCACGTCGT TGAGCAGGCG
CGCAAACTGG CGATGCTCAG CGCGCCGCTG CTGATTACCG GCGATACGGG AACCGGCAAA
GATCTTTTCG CCTATGCCTG TCACCAGGCA AGCCCTCGTT CAGCGAAACC GTATCTGGCG
CTCAACTGCG CTTCAATCCC GGAAGATGCG GTAGAAAGCG AACTATTTGG CCATGCGCCG
GAAGGTAAAA AAGGTTTCTT TGAACAGGCG AATGGCGGTT CGGTGCTGCT GGATGAAATT
GGCGAAATGT CGCCGCGTAT GCAGGCGAAG CTGCTGCGTT TTCTCAACGA TGGTACGTTC
CGTCGCGTCG GCGAAGATCA CGAAATTCAT GTTGATGTCC GCGTTATCTG CGCCACGCAG
AAAAATCTGG TGGAGCTGGT GCAAAAAGGA CTGTTCCGCG AAGATCTCTA TTATCGACTT
AACGTTCTGA CGCTTAATTT GCCGCCGTTG CGCGATTGTC CGCAGGATAT TATGCCGTTG
ACCGAACTGT TCGTGGCGCG TTTTGCCGAC GAACAGGGCG TTCCGCGACC GAAACTGTCT
GCCGATCTGA GTACGGTCCT CACTCGTTAC GGCTGGCCGG GTAACGTTCG CCAGCTTAAA
AATGCGATTT ACCGGGCGCT GACGCAACTG GAAGGGTATG AGCTGCGTCC GCAGGATATC
CTGCTGCCTG ACTACGATGC CGCGACGGTG GCAGTCGGCG AGGATGCGAT GGAAGGCTCG
CTGGATGACA TTACCAGTCG TTTTGAACGT TCTGTCCTGA CCCAGCTTTA TCGTAGCTAT
CCGAGTACGC GTAAACTGGC GAAACGGTTG GGGGTATCGC ACACCGCGAT TGCCAATAAG
CTGCGTGAAT ATGGTCTGAG CCAGAAGAAG GGTGAAGAGT AG
 
Protein sequence
MRLEVFCEDR LGLTRELLDL LVLRSIDLRG IEIDPIGRIY LNFAELEFTD FSSLMAEIRR 
ISGVTDVRTV PWMPSEREHL ALSALLEALP EPVLSLDMKS KVEMANPASC QLFAQSQERM
RHHTAAQLIN GFNFQRWLDG NPQSSHNEHV VINGQNFLME ITPVHLQNEN DEYVLTGAVV
MLRSTIRMGQ QLQNLSTQDL SAFSQIIAVS AKMKHVVEQA RKLAMLSAPL LITGDTGTGK
DLFAYACHQA SPRSAKPYLA LNCASIPEDA VESELFGHAP EGKKGFFEQA NGGSVLLDEI
GEMSPRMQAK LLRFLNDGTF RRVGEDHEIH VDVRVICATQ KNLVELVQKG LFREDLYYRL
NVLTLNLPPL RDCPQDIMPL TELFVARFAD EQGVPRPKLS ADLSTVLTRY GWPGNVRQLK
NAIYRALTQL EGYELRPQDI LLPDYDAATV AVGEDAMEGS LDDITSRFER SVLTQLYRSY
PSTRKLAKRL GVSHTAIANK LREYGLSQKK GEE