Gene SeD_A0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0097 
Symbolimp 
ID6873465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp101309 
End bp103669 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content53% 
IMG OID642783351 
Productorganic solvent tolerance protein 
Protein accessionYP_002214045 
Protein GI198242523 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0159547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.564558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC GTATTCCCAC TCTTCTGGCC ACCATGATCG CCAGCGCCCT TTATAGTCAT 
CAGGGGCTGG CAGCCGATCT CGCCTCACAG TGTATGTTGG GCGTGCCGAG CTACGATCGT
CCTCTGGTAA AAGGCGATAC CAACGATCTG CCGGTTACTA TCAATGCCGA TAACGCTAAA
GGTAACTACC CGGACGATGC CGTTTTTACC GGCAACGTGG ACATTATGCA GGGGAATAGC
CGCCTGCAAG CGGATGAAGT GCAGCTTCAT CAGAAGCAGG CGGAAGGTCA GCCGGAACCT
GTACGCACCG TCGATGCGCT GGGTAATGTG CATTATGATG ATAATCAGGT CATCCTTAAA
GGGCCGAAGG GCTGGGCGAA CCTGAACACC AAAGACACGA ACGTCTGGGA AGGCGATTAC
CAGATGGTGG GCCGTCAGGG GCGCGGTAAA GCCGATCTCA TGAAGCAGCG CGGCGAAAAC
CGTTATACCA TTCTGGAAAA CGGCAGCTTT ACCTCCTGTC TGCCTGGCTC CGATACCTGG
AGCGTGGTGG GGAGTGAAGT CATCCATGAC CGTGAAGAAC AGGTTGCGGA GATCTGGAAC
GCCCGGTTTA AAGTAGGTCC GGTTCCGATC TTTTATAGCC CCTATTTACA GCTACCCGTC
GGTGACAAAC GTCGCTCAGG TTTCCTGATC CCGAACGCGA AATACACGAC CAAGAACTAT
TTCGAGTTCT ACTTACCGTA TTACTGGAAC ATCGCGCCCA ATATGGACGC CACCATCACC
CCGCACTATA TGCACCGCCG CGGCAATATT ATGTGGGAGA ACGAATTCCG TTATCTCACG
CAGGCAGGCG CCGGGTTGAT GGAATTAGAT TATCTGCCTT CTGATAAAGT CTACGAGGAC
GATCATCCCA AAGAGGGCGA TAAGCACCGC TGGTTATTCT ACTGGCAGCA CTCAGGCGTG
ATGGATCAGG TGTGGCGTTT TAACGTCGAT TACACCAAAG TCAGCGACTC CAGCTACTTT
AACGATTTCG ACAGTAAGTA CGGTTCCAGT ACCGACGGTT ACGCAACGCA GAAATTCAGC
GTCGGCTACG CCGTACAAAA CTTTGACGCT ACGGTGTCGA CCAAACAATT CCAGGTCTTT
AACGATCAAA ACACCAGCAG CTATTCAGCG GAGCCGCAGT TAGACGTTAA CTACTACCAT
AACGATCTCG GCCCGTTTGA TACCCGGATT TACGGCCAGG CGGTGCACTT TGTTAACACC
AAAGACAATA TGCCGGAAGC AACCCGCGTC CACCTGGAGC CAACCATCAA TTTGCCGCTC
TCCAACCGCT GGGGCAGCCT GAACACCGAA GCGAAGCTGA TGGCGACGCA CTATCAGCAA
ACGAATCTGG ACAGCTATAA CAGCGATCCA AACAATAAAA ATAAGCTGGA AGATTCGGTT
AACCGCGTCA TGCCGCAGTT TAAAGTCGAC GGTAAGCTCA TCTTCGAACG CGATATGGCG
ATGCTGGCGC CGGGGTATAC CCAGACGCTG GAACCACGCG TGCAGTACCT GTATGTGCCG
TACCGCGACC AGAGCGGCAT CTATAACTAC GATTCTTCTT TGCTGCAATC CGACTATAAC
GGCCTGTTCC GCGACCGCAC TTATGGCGGT CTCGACCGTA TTGCTTCCGC CAACCAGGTC
ACGACCGGCG TCACAACACG CATTTATGAT GATGCCGCCG TTGAACGTTT TAACGTTTCT
GTTGGTCAAA TCTACTATTT CACGGAGTCT CGCACCGGCG ATGACAACAT TAAATGGGAG
AATGACGACA AAACCGGTTC GCTGGTTTGG GCAGGCGACA CTTACTGGCG TATTTCAGAA
CGCTGGGGGC TGCGTAGCGG AGTGCAGTAC GATACCCGTC TGGATAGCGT CGCTACCAGC
AGCAGCAGCC TCGAATACCG TCGGGATCAG GATCGTCTGG TACAGTTGAA CTACCGCTAT
GCCAGCCCGG AATATATTCA GGCTACGTCG CCTTCGTATT ATTCCACGGC AGAGCAGTAT
AAAAACGGCA TCAACCAGGT GGGCGCGGTG GCAAGTTGGC CGATTGCCGA TCGCTGGTCG
ATTGTCGGCG CGTACTACTT CGATACCAAT TCGAGCAAAC CTGCAGACCA GATGCTCGGC
TTGCAGTACA ACTCTTGCTG CTATGCGATC CGCGTCGGAT ACGAACGTAA GCTGAACGGT
TGGGATAACG ATAAACAACA CGCGATTTAT GATAACGCGA TTGGCTTCAA CATTGAGCTG
CGCGGTTTGA GCTCTAACTA CGGCCTCGGC ACGCAAGAAA TGTTGCGTTC GAACATTCTG
CCGTACCAAA GCTCTATGTA A
 
Protein sequence
MKKRIPTLLA TMIASALYSH QGLAADLASQ CMLGVPSYDR PLVKGDTNDL PVTINADNAK 
GNYPDDAVFT GNVDIMQGNS RLQADEVQLH QKQAEGQPEP VRTVDALGNV HYDDNQVILK
GPKGWANLNT KDTNVWEGDY QMVGRQGRGK ADLMKQRGEN RYTILENGSF TSCLPGSDTW
SVVGSEVIHD REEQVAEIWN ARFKVGPVPI FYSPYLQLPV GDKRRSGFLI PNAKYTTKNY
FEFYLPYYWN IAPNMDATIT PHYMHRRGNI MWENEFRYLT QAGAGLMELD YLPSDKVYED
DHPKEGDKHR WLFYWQHSGV MDQVWRFNVD YTKVSDSSYF NDFDSKYGSS TDGYATQKFS
VGYAVQNFDA TVSTKQFQVF NDQNTSSYSA EPQLDVNYYH NDLGPFDTRI YGQAVHFVNT
KDNMPEATRV HLEPTINLPL SNRWGSLNTE AKLMATHYQQ TNLDSYNSDP NNKNKLEDSV
NRVMPQFKVD GKLIFERDMA MLAPGYTQTL EPRVQYLYVP YRDQSGIYNY DSSLLQSDYN
GLFRDRTYGG LDRIASANQV TTGVTTRIYD DAAVERFNVS VGQIYYFTES RTGDDNIKWE
NDDKTGSLVW AGDTYWRISE RWGLRSGVQY DTRLDSVATS SSSLEYRRDQ DRLVQLNYRY
ASPEYIQATS PSYYSTAEQY KNGINQVGAV ASWPIADRWS IVGAYYFDTN SSKPADQMLG
LQYNSCCYAI RVGYERKLNG WDNDKQHAIY DNAIGFNIEL RGLSSNYGLG TQEMLRSNIL
PYQSSM