Gene SNSL254_A0098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0098 
Symbolimp 
ID6484537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp105775 
End bp108135 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content53% 
IMG OID642735541 
Productorganic solvent tolerance protein 
Protein accessionYP_002039323 
Protein GI194445274 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00167743 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.0233205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC GTATTCCCAC TCTTCTGGCC ACCATGATCG CCAGCGCCCT TTATAGTCAT 
CAGGGGCTGG CAGCCGATCT CGCCTCACAG TGTATGTTGG GCGTGCCGAG CTACGATCGT
CCTCTGGTAA AAGGCGATAC CAACGATCTG CCGGTTACTA TCAATGCCGA TAACGCTAAA
GGTAACTACC CGGACGATGC CGTTTTTACC GGCAACGTGG ACATTATGCA GGGGAATAGC
CGCCTGCAAG CGGATGAAGT GCAGCTTCAT CAGAAGCAGG CGGAAGGTCA GCCGGAACCT
GTACGCACCG TCGATGCGCT GGGTAATGTG CATTATGATG ATAATCAGGT CATCCTTAAA
GGGCCGAAGG GCTGGGCGAA CCTGAACACC AAAGACACGA ACGTCTGGGA AGGCGATTAC
CAGATGGTGG GCCGTCAGGG GCGCGGTAAA GCCGATCTCA TGAAGCAGCG TGGCGAAAAC
CGCTATACCA TTCTGGAAAA CGGCAGCTTT ACCTCCTGTC TGCCTGGCTC CGATACCTGG
AGCGTGGTGG GGAGTGAAGT CATCCATGAC CGTGAAGAAC AGGTTGCGGA GATCTGGAAC
GCCCGGTTTA AAGTAGGTCC GGTTCCGATC TTTTATAGCC CCTATTTACA GCTCCCCGTC
GGTGACAAAC GTCGCTCAGG TTTCCTGATC CCGAACGCGA AATACACGAC CAAGAACTAT
TTCGAGTTCT ACTTACCGTA TTACTGGAAC ATCGCGCCCA ATATGGACGC CACCATCACC
CCGCACTATA TGCACCGCCG CGGCAATATT ATGTGGGAGA ACGAATTCCG TTATCTCACG
CAGGCAGGCG CCGGGTTGAT GGAATTAGAT TATCTGCCTT CTGATAAAGT CTACGAGGAC
GATCATCCCA AAGAGGGCGA TAAGCACCGC TGGTTATTCT ACTGGCAGCA CTCAGGCGTG
ATGGATCAGG TGTGGCGTTT TAACGTCGAT TACACCAAAG TCAGCGACTC CAGCTACTTT
AACGATTTCG ACAGTAAGTA CGGTTCCAGT ACCGACGGCT ACGCAACGCA GAAATTCAGC
GTCGGCTACG CCGTACAAAA CTTTGACGCT ACGGTGTCGA CCAAACAATT CCAGGTCTTT
AACGATCAAA ACACCAGCAG CTACTCTGCG GAGCCGCAGT TAGACGTTAA CTACTACCAT
AACGATCTCG GGCCGTTTGA TACCCGGATT TACGGCCAGG CGGTACATTT CGTCAACACC
AAAGACAATA TGCCGGAAGC GACCCGCGTC CACCTGGAGC CAACCATTAA TTTGCCGCTC
TCCAACCGCT GGGGCAGCCT GAACACCGAA GCGAAGCTGA TGGCGACGCA CTATCAGCAA
ACGAATCTGG ACAGCTATAA CAGCGATCCA AACAATAAAA ATAAGCTGGA AGATTCGGTT
AACCGCGTCA TGCCGCAGTT TAAAGTCGAC GGTAAGCTCA TCTTCGAACG CGATATGGCG
ATGCTGGCGC CGGGGTATAC CCAGACGCTG GAACCACGCG TGCAGTACCT GTATGTGCCG
TACCGCGACC AGAGCGGCAT CTATAACTAC GATTCTTCTT TGCTGCAATC CGACTATAAC
GGCCTGTTCC GCGACCGCAC TTATGGCGGT CTCGACCGTA TTGCTTCCGC CAACCAGGTC
ACGACCGGCG TCACAACACG CATTTATGAT GATGCCGCCG TTGAACGTTT TAACGTTTCT
GTTGGTCAAA TCTACTATTT CACGGAGTCT CGCACCGGCG ATGACAACAT TAAATGGGAG
AATGACGACA AAACCGGTTC GCTGGTTTGG GCAGGCGACA CTTACTGGCG TATTTCAGAA
CGCTGGGGGC TGCGTAGCGG AGTGCAGTAC GATACCCGTC TGGATAGCGT CGCTACCAGC
AGCAGCAGCC TCGAATACCG TCGGGATCAG GATCGTCTGG TACAGTTGAA CTACCGCTAT
GCCAGCCCGG AATATATTCA GGCTACGTTG CCTTCGTATT ATTCCACGGC AGAGCAGTAT
AAAAACGGCA TCAACCAGGT GGGCGCGGTG GCAAGTTGGC CGATTGCCGA TCGCTGGTCG
ATTGTCGGCG CGTACTACTT CGATACCAAT TCGAGCAAAC CTGCAGACCA GATGCTCGGC
TTGCAGTACA ACTCTTGCTG CTATGCGATC CGCGTCGGAT ACGAACGTAA GCTGAACGGT
TGGGATAACG ATAAACAACA CGCGATTTAT GATAACGCGA TTGGCTTCAA CATTGAGCTG
CGCGGTTTGA GCTCTAACTA CGGCCTCGGC ACGCAAGAAA TGTTGCGTTC GAACATTCTG
CCGTACCAAA GCTCTATGTA A
 
Protein sequence
MKKRIPTLLA TMIASALYSH QGLAADLASQ CMLGVPSYDR PLVKGDTNDL PVTINADNAK 
GNYPDDAVFT GNVDIMQGNS RLQADEVQLH QKQAEGQPEP VRTVDALGNV HYDDNQVILK
GPKGWANLNT KDTNVWEGDY QMVGRQGRGK ADLMKQRGEN RYTILENGSF TSCLPGSDTW
SVVGSEVIHD REEQVAEIWN ARFKVGPVPI FYSPYLQLPV GDKRRSGFLI PNAKYTTKNY
FEFYLPYYWN IAPNMDATIT PHYMHRRGNI MWENEFRYLT QAGAGLMELD YLPSDKVYED
DHPKEGDKHR WLFYWQHSGV MDQVWRFNVD YTKVSDSSYF NDFDSKYGSS TDGYATQKFS
VGYAVQNFDA TVSTKQFQVF NDQNTSSYSA EPQLDVNYYH NDLGPFDTRI YGQAVHFVNT
KDNMPEATRV HLEPTINLPL SNRWGSLNTE AKLMATHYQQ TNLDSYNSDP NNKNKLEDSV
NRVMPQFKVD GKLIFERDMA MLAPGYTQTL EPRVQYLYVP YRDQSGIYNY DSSLLQSDYN
GLFRDRTYGG LDRIASANQV TTGVTTRIYD DAAVERFNVS VGQIYYFTES RTGDDNIKWE
NDDKTGSLVW AGDTYWRISE RWGLRSGVQY DTRLDSVATS SSSLEYRRDQ DRLVQLNYRY
ASPEYIQATL PSYYSTAEQY KNGINQVGAV ASWPIADRWS IVGAYYFDTN SSKPADQMLG
LQYNSCCYAI RVGYERKLNG WDNDKQHAIY DNAIGFNIEL RGLSSNYGLG TQEMLRSNIL
PYQSSM