Gene SNSL254_A1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1104 
Symbol 
ID6485271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1114399 
End bp1115652 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content52% 
IMG OID642736507 
Productparaquat-inducible protein A 
Protein accessionYP_002040266 
Protein GI194446385 
COG category[S] Function unknown 
COG ID[COG2995] Uncharacterized paraquat-inducible protein A 
TIGRFAM ID[TIGR00155] integral membrane protein, PqiA family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.163733 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGAAC ATCACCATGC TGCGAAGCAC ATATTGTGCC CGCAATGTGA CATGCTGGTG 
GCTTTACCCC GCCTCTCGCA TGGGCAAAAA GCAGCATGTC CACGATGCGG CGCAACGTTA
ACGACCGAGT GGGACGCGCC CCGGCAGCGT CCTACCGCCT ATGCGTTAGC GGCACTATTT
ATGTTGCTGC TTTCTAACCT TTTTCCCTTC GTGAATATGA ATGTCGCGGG CGTCACCAGC
GAAGTCACGC TTCTGGAAAT CCCCGGCGTC ATGTTTTCTG AAGATTACGC CAGCCTCGGC
ACGTTCTTTT TATTATTTGT CCAACTGGTG CCGGCATTTT GCCTGGTGAC CATTCTATTA
CTTGTCAACC GCGCCAGTCT GCCGTTGTCG GTAAAAAAAA CGCTGGCAAG GATCTTTTTC
CTCCTCAAAT CGTGGGGAAT GGCGGAAATA TTTCTCGCCG GGGTGTTGGT CAGTTTTGTT
AAGCTGATGG CCTACGGCGA TATCGGCATC GGCAGCAGTT TTATTCCCTG GTGCTTATTT
TGTCTCGTTC AACTGCGGGC GTTCCAGTGT GTCGATCGCC GCTGGTTGTG GGATGATATC
GCCCCGCAGC CTGCACTGGC GCAGCCGTTA ACGCCGGGGA TTACCGGTAT CCGGCAGTCT
TTGCGTTCTT GCGCCTGCTG TACGGCGATC CTGCCTGCGG AGAGTCTCGT CTGTCCGCGT
TGCCATACGA AAGGCTATGT CCGGCGTAAA AACAGTTTGC AGTGGACGCT GGCGTTATTA
TTTACCTCGA TCATGCTTTA TCTGCCCGCC AATATTTTGC CGATTATGAT TACCGACTTA
CTGGGCTCGA AAATGCCGTC GACCATTCTG GCTGGCGTGA TTTTGCTGTG GAGCGAGGGG
TCTTATCCGG TGGCGGCGGT TATCTTTCTC GCCAGTATTA TGGTGCCGAC GCTAAAAATG
ATCGCCATTG CCTGGCTTTG TTGGGATGCG AAAGGCCACG GTAAGCGCGA CAGTGAACGG
ATGCATTTTA TTTATGAAGT AGTGGAGTTT GTGGGGCGCT GGTCAATGAT TGATGTCTTT
GTCATTGCCG TACTCTCTGC GCTGGTGCGT ATGGGAGGGT TAATGAATAT TTATCCTGCG
ATGGGTGCGT TGATGTTTGC TTTAGTCGTC ATAATGACAA TGTTTTCTGC GATGACCTTT
GATCCGCGTC TGTCGTGGGA TCGTGAGTAC GAACCAGGCC ATGAGGAGTC CTGA
 
Protein sequence
MCEHHHAAKH ILCPQCDMLV ALPRLSHGQK AACPRCGATL TTEWDAPRQR PTAYALAALF 
MLLLSNLFPF VNMNVAGVTS EVTLLEIPGV MFSEDYASLG TFFLLFVQLV PAFCLVTILL
LVNRASLPLS VKKTLARIFF LLKSWGMAEI FLAGVLVSFV KLMAYGDIGI GSSFIPWCLF
CLVQLRAFQC VDRRWLWDDI APQPALAQPL TPGITGIRQS LRSCACCTAI LPAESLVCPR
CHTKGYVRRK NSLQWTLALL FTSIMLYLPA NILPIMITDL LGSKMPSTIL AGVILLWSEG
SYPVAAVIFL ASIMVPTLKM IAIAWLCWDA KGHGKRDSER MHFIYEVVEF VGRWSMIDVF
VIAVLSALVR MGGLMNIYPA MGALMFALVV IMTMFSAMTF DPRLSWDREY EPGHEES