Gene SNSL254_A0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0407 
SymbolprpR 
ID6483472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp420544 
End bp422169 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content55% 
IMG OID642735831 
Productpropionate catabolism operon regulatory protein PrpR 
Protein accessionYP_002039605 
Protein GI194442843 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG1221] Transcriptional regulators containing an AAA-type ATPase domain and a DNA-binding domain 
TIGRFAM ID[TIGR02329] propionate catabolism operon regulatory protein PrpR 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00000000136332 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACTG CTCACAGCGC TCCGCGCGAT AATAGCGATA AACCGGTGAT CTGGACGGTC 
TCCGTAACGC GTCTGTTCGA ACTGTTTCGG GATATCAGCC TGGAATTCGA TCATCTGGCG
ACCATCACGC CTATTCAACT CGGCTTTGAA AAGGCGGTGA CCTACATTCG CAAAAAACTG
GCGACCGAGC GCTGCGACGC GATTATCGCG GCGGGTTCGA ATGGGGCCTA TTTAAAAAGT
CGCCTGTCAA TACCGGTGAT CCTCATCAAG CCCAGCGGAT TCGATGTATT ACAGGCGCTG
GCGAAAGCGG GAAAGCTCAC CTCGTCTATC GGTATCGTGA CCTATCAGGA GACCATTCCG
GCTTTACTTG CCTTTCAGAA AACGTTTCAC CTCCGTCTTG AACAGCGAAG CTATGTCACC
GAAGAGGACG CGCGCGGGCA AATTAACGAA CTTAAGGCCA ACGGTATTGA AGCCGTCGTC
GGCGCGGGAT TAATTACCGA TCTGGCGGAA GAAGCGGGAA TGACCGCCAT CTTTATTTAT
TCCGCGGCGA CCGTTCGTCA GGCTTTCCAT GATGCGCTGG ATATGACCCG TCTGACACGG
CGACAGCGCG TGGATTACTC ATCCGGCAAG GGATTACAAA CCCGGTATGA ACTGGGCGAT
ATACGCGGCC AGTCGCCGCA AATGGAGCAG CTCCGCCAGA CGATTACGCT CTATGCCCGC
TCCCGTGCGG CAGTGCTGAT TCAGGGGGAA ACAGGGACCG GAAAAGAGCT GGCGGCGCAG
GCGATTCACC AGACGTTCTT TCACCGCCAG CCTCACCGTC AGAATAAGCC ATCCCCTCCC
TTTGTCGCCG TCAATTGCGG CGCGATTACC GAGTCGTTGC TGGAAGCGGA ACTGTTTGGT
TATGAAGAGG GCGCGTTTAC CGGTTCACGA CGAGGAGGCC GGGCGGGGCT GTTTGAAATC
GCACACGGCG GCACGCTGTT TCTGGATGAA ATCGGCGAAA TGCCCTTGCC GTTACAAACC
CGACTCTTGC GCGTACTGGA GGAAAAAGCC GTCACTCGCG TTGGCGGACA CCAGCCGATC
CCGGTGGAGG TCCGGGTGAT CAGCGCTACG CATTGCGATC TGGATCGGGA AATAATGCAA
GGACGTTTTC GCCCCGATCT CTTTTATCGC CTGAGTATTC TGCGTCTGAC GCTTCCTCCT
TTGCGCGAGC GGCAGGCTGA TATTTTGCCG CTGGCGGAAA GCTTTTTAAA ACAGTCGCTG
GCAGCGATGG AAATTCCGTT TACCGAATCG ATACGTCATG GATTGACACA GTGTCAGCCG
CTTTTGCTGG CCTGGCGCTG GCCCGGTAAT ATTCGCGAAC TGCGTAATAT GATGGAGCGC
CTGGCGCTTT TTTTAAGCGT CGATCCCGCG CCAACGCTGG ACAGGCAATT TATGCGGCAG
TTATTACCTG AGCTTATGGT GAACACAGCA GAGCTGACGC CTTCAACCGT GGATGCGCAC
ACGTTACAGG ATGTACTGGC GCGCTTTAAG GGCGATAAGA CCGCGGCGGC GCGTTATCTG
GGGATTAGCC GCACCACTCT GTGGCGTCGT TTAAAAGCAG GAGCCAAAGA CCAGTCGGAT
AATTAA
 
Protein sequence
MTTAHSAPRD NSDKPVIWTV SVTRLFELFR DISLEFDHLA TITPIQLGFE KAVTYIRKKL 
ATERCDAIIA AGSNGAYLKS RLSIPVILIK PSGFDVLQAL AKAGKLTSSI GIVTYQETIP
ALLAFQKTFH LRLEQRSYVT EEDARGQINE LKANGIEAVV GAGLITDLAE EAGMTAIFIY
SAATVRQAFH DALDMTRLTR RQRVDYSSGK GLQTRYELGD IRGQSPQMEQ LRQTITLYAR
SRAAVLIQGE TGTGKELAAQ AIHQTFFHRQ PHRQNKPSPP FVAVNCGAIT ESLLEAELFG
YEEGAFTGSR RGGRAGLFEI AHGGTLFLDE IGEMPLPLQT RLLRVLEEKA VTRVGGHQPI
PVEVRVISAT HCDLDREIMQ GRFRPDLFYR LSILRLTLPP LRERQADILP LAESFLKQSL
AAMEIPFTES IRHGLTQCQP LLLAWRWPGN IRELRNMMER LALFLSVDPA PTLDRQFMRQ
LLPELMVNTA ELTPSTVDAH TLQDVLARFK GDKTAAARYL GISRTTLWRR LKAGAKDQSD
N