Gene SNSL254_A1807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1807 
Symbol 
ID6486324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1772536 
End bp1774077 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content54% 
IMG OID642737183 
ProductDNA-binding transcriptional regulator TyrR 
Protein accessionYP_002040935 
Protein GI194443344 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG3283] Transcriptional regulator of aromatic amino acids metabolism 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.0010497 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTCTGG AAGTCTTTTG TGAAGACCGA CTTGGTCTGA CCCGCGAATT ACTTGATTTA 
CTGGTGTTAC GTAGCATTGA TTTACGCGGA ATCGAGATTG ATCCCATTGG GCGAATTTAT
CTTAATTTTG CTGAGCTGGA ATTCACCGAC TTCAGCAGCC TGATGGCCGA AATCCGCCGT
ATTTCCGGCG TAACGGATGT CCGTACCGTT CCCTGGATGC CGTCCGAACG TGAACATCTG
GCCCTGAGCG CGCTGCTTGA GGCGTTGCCG GAGCCGGTGC TCTCATTGGA TATGAAGAGT
AAAGTGGAGA TGGCGAACCC GGCGAGTTGT CAGCTTTTTG CCCAGAGCCA GGAGCGAATG
CGGCACCATA CCGCCGCACA ATTAATCAAC GGCTTCAATT TTCAGCGCTG GCTGGACGGT
AACCCGCAAA GCTCCCATAA CGAACATGTC GTGATCAACG GGCAAAACTT CCTGATGGAG
ATTACGCCGG TACATTTACA AAACGAAAAT GACGAATACG TGTTGACCGG GGCGGTCGTG
ATGTTGCGTT CCACGATTCG TATGGGGCAG CAGCTACAGA ATTTGTCCAC GCAGGATCTG
AGCGCGTTTA GTCAGATTAT TGCCGTGAGC GCAAAGATGA AGCACGTCGT TGAGCAGGCG
CGCAAACTGG CGATGCTCAG CGCGCCGCTG CTGATTACCG GCGATACGGG AACCGGCAAA
GATCTTTTCG CCTATGCCTG TCACCAGGCA AGCCCTCGTT CAGCGAAACC GTATCTGGCG
CTCAACTGCG CTTCAATCCC GGAAGATGCG GTAGAAAGCG AACTATTTGG CCATGCGCCG
GAAGGTAAAA AAGGCTTTTT TGAACAGGCG AATGGCGGTT CGGTGCTGCT GGATGAAATT
GGCGAAATGT CGCCGCGTAT GCAGGCGAAG CTGCTGCGTT TTCTCAACGA TGGTACGTTC
CGTCGCGTCG GCGAAGATCA CGAAATTCAT GTTGATGTCC GCGTTATCTG CGCCACGCAG
AAAAATCTGG TGGAGCTGGT GCAAAAAGGA CTGTTCCGCG AAGATCTCTA TTATCGACTT
AACGTTCTGA CGCTTAATTT GCCGCCGTTG CGCGATTGCC CGCAGGATAT TATGCCGTTG
ACCGAACTGT TCGTGGCGCG TTTTGCCGAC GAACAGGGCG TTCCGCGACC GAAACTGTCT
GCCGATCTGA GTACGGTCCT CACTCGTTAC GGCTGGCCGG GTAACGTTCG CCAGCTTAAA
AATGCGATAT ACCGGGCGCT GACGCAACTG GAAGGGTATG AGCTGCGTCC GCAGGATATC
CTGCTGCCTG ACTACGATGC CGCGACGGTG GCAGTCGGCG AGGATGCGAT GGAAGGTTCG
CTGGATGACA TTACCAGTCG TTTTGAACGT TCTGTCCTTA CTCAGCTTTA TCGTAGCTAT
CCGAGTACGC GTAAACTGGC GAAACGGTTG GGGGTATCGC ATACCGCGAT TGCCAATAAG
CTGCGTGAAT ATGGTCTGAG CCAGAAGAAG GGTGAAGAGT AG
 
Protein sequence
MRLEVFCEDR LGLTRELLDL LVLRSIDLRG IEIDPIGRIY LNFAELEFTD FSSLMAEIRR 
ISGVTDVRTV PWMPSEREHL ALSALLEALP EPVLSLDMKS KVEMANPASC QLFAQSQERM
RHHTAAQLIN GFNFQRWLDG NPQSSHNEHV VINGQNFLME ITPVHLQNEN DEYVLTGAVV
MLRSTIRMGQ QLQNLSTQDL SAFSQIIAVS AKMKHVVEQA RKLAMLSAPL LITGDTGTGK
DLFAYACHQA SPRSAKPYLA LNCASIPEDA VESELFGHAP EGKKGFFEQA NGGSVLLDEI
GEMSPRMQAK LLRFLNDGTF RRVGEDHEIH VDVRVICATQ KNLVELVQKG LFREDLYYRL
NVLTLNLPPL RDCPQDIMPL TELFVARFAD EQGVPRPKLS ADLSTVLTRY GWPGNVRQLK
NAIYRALTQL EGYELRPQDI LLPDYDAATV AVGEDAMEGS LDDITSRFER SVLTQLYRSY
PSTRKLAKRL GVSHTAIANK LREYGLSQKK GEE