Gene SeAg_B0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B0102 
Symbolimp 
ID6796635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp105767 
End bp108127 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content53% 
IMG OID642774413 
Productorganic solvent tolerance protein 
Protein accessionYP_002145077 
Protein GI197250730 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000015449 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC GTATTCCCAC TCTTCTGGCC ACCATGATCG CCAGCGCCCT TTATAGTCAT 
CAGGGGCTGG CAGCCGATCT CGCCTCACAG TGTATGTTGG GCGTGCCGAG CTACGATCGT
CCTCTGGTAA AAGGCGATAC CAACGATCTG CCGGTTACTA TCAATGCCGA TAACGCTAAA
GGTAACTACC CGGACGATGC CGTTTTTACC GGCAACGTGG ACATTATGCA GGGGAATAGC
CGCCTGCAAG CGGATGAAGT GCAGCTTCAT CAGAAGCAGG CGGAAGGTCA GCCGGAACCT
GTACGCACCG TCGATGCGCT GGGTAATGTG CATTATGATG ACAATCAGGT CATCCTTAAA
GGGCCGAAGG GCTGGGCGAA CCTGAACACC AAAGACACGA ACGTCTGGGA AGGCGATTAC
CAGATGGTGG GCCGTCAGGG GCGCGGTAAA GCCGATCTCA TGAAGCAGCG CGGCGAAAAC
CGTTATACCA TTCTGGAAAA CGGCAGCTTT ACCTCCTGTC TGCCTGGCTC CGATACCTGG
AGCGTGGTGG GGAGTGAAGT CATCCATGAC CGTGAAGAAC AGGTTGCGGA GATCTGGAAC
GCCCGGTTTA AAGTAGGTCC GGTTCCGATC TTTTATAGCC CCTATTTACA GCTACCCGTC
GGTGACAAAC GTCGCTCAGG TTTCCTGATC CCGAACGCGA AATACACGAC CAAGAACTAT
TTCGAGTTCT ACTTACCGTA TTACTGGAAC ATCGCGCCCA ATATGGACGC CACCATCACC
CCGCACTATA TGCACCGCCG CGGCAATATT ATGTGGGAGA ACGAATTCCG TTATCTCACG
CAGGCAGGCG AGGGAGTGAT GGAATTAGAT TATCTGCCTT CTGATAAAGT CTACGAGGAC
GATCACCCCA AAGAGGGCGA TAAGCACCGC TGGTTATTCT ACTGGCAGCA CTCAGGCGTG
ATGGATCAGG TGTGGCGTTT TAACGTCGAT TACACCAAAG TCAGCGACTC CAGCTACTTT
AACGATTTCG ACAGTAAGTA CGGTTCCAGT ACCGACGGCT ACGCAACGCA GAAATTCAGC
GTCGGCTACG CCGTACAAAA CTTTGACGCT ACGGTGTCGA CCAAACAATT CCAGGTCTTT
AACGATCAAA ACACCAGCAG CTACTCTGCG GAGCCGCAGT TAGACGTTAA CTACTACCAT
AACGATCTCG GGCCGTTTGA TACCCGGATT TACGGCCAGG CGGTACATTT CGTCAACACC
AAAGACAATA TGCCGGAAGC GACCCGCGTC CACCTGGAGC CAACCATTAA TTTGCCGCTC
TCCAACCGCT GGGGCAGCCT GAACACCGAA GCGAAGCTGA TGGCGACGCA CTATCAGCAA
ACGAATCTGG ACAGCTATAA CAGCGATCCA AACAATAAAA ATAAGCTGAA AGATTCGGTT
AACCGCGTCA TGCCGCAGTT TAAAGTCGAC GGTAAGCTCA TCTTCGAACG CGATATGGCG
ATGCTGGCGC CGGGGTATAC CCAGACGCTG GAACCACGCG TGCAGTACCT GTATGTGCCG
TACCGCGACC AGAGCGGCAT CTATAACTAC GATTCTTCTT TGCTGCAATC CGACTATAAC
GGCCTGTTCC GCGACCGCAC TTATGGCGGT CTCGACCGTA TTGCTTCCGC CAACCAGGTC
ACGACAGGCG TCACAACACG CATTTATGAT GATGCCGCCG TTGAACGTTT TAACGTTTCT
GTTGGTCAAA TCTACTATTT CACGGAGTCT CGCACCGGCG ATGACAACAT TAAATGGGAG
AATGACGACA AAACCGGTTC GCTGGTTTGG GCAGGCGACA CTTACTGGCG TATTTCAGAA
CGCTGGGGGC TGCGTAGCGG AGTGCAGTAC GATACCCGTC TGGATAGCGT CGCTACCAGC
AGCAGCAGCC TCGAATACCG TCGGGATCAG GATCGTCTGG TACAGTTGAA CTACCGCTAT
GCCAGCCCGG AATATATTCA GGCTACGTTG CCTTCGTATT ATTCCACGGC AGAGCAGTAT
AAAAACGGCA TCAACCAGGT GGGTGCGGTG GCAAGTTGGC CGATTGCCGA TCGCTGGTCG
ATTGTCGGCG CGTACTACTT CGATACCAAT TCGAGCAAAC CTGCAGACCA GATGCTCGGC
TTGCAGTACA ACTCTTGCTG CTATGCGATC CGCGTCGGAT ACGAACGTAA GCTGAACGGT
TGGGATAACG ATAAACAACA CGCGATTTAT GATAACGCGA TTGGCTTCAA CATTGAGCTG
CGCGGTTTGA GCTCTAACTA CGGCCTCGGC ACGCAAGAAA TGTTGCGTTC GAACATTCTG
CCGTACCAAA GCTCTATGTA A
 
Protein sequence
MKKRIPTLLA TMIASALYSH QGLAADLASQ CMLGVPSYDR PLVKGDTNDL PVTINADNAK 
GNYPDDAVFT GNVDIMQGNS RLQADEVQLH QKQAEGQPEP VRTVDALGNV HYDDNQVILK
GPKGWANLNT KDTNVWEGDY QMVGRQGRGK ADLMKQRGEN RYTILENGSF TSCLPGSDTW
SVVGSEVIHD REEQVAEIWN ARFKVGPVPI FYSPYLQLPV GDKRRSGFLI PNAKYTTKNY
FEFYLPYYWN IAPNMDATIT PHYMHRRGNI MWENEFRYLT QAGEGVMELD YLPSDKVYED
DHPKEGDKHR WLFYWQHSGV MDQVWRFNVD YTKVSDSSYF NDFDSKYGSS TDGYATQKFS
VGYAVQNFDA TVSTKQFQVF NDQNTSSYSA EPQLDVNYYH NDLGPFDTRI YGQAVHFVNT
KDNMPEATRV HLEPTINLPL SNRWGSLNTE AKLMATHYQQ TNLDSYNSDP NNKNKLKDSV
NRVMPQFKVD GKLIFERDMA MLAPGYTQTL EPRVQYLYVP YRDQSGIYNY DSSLLQSDYN
GLFRDRTYGG LDRIASANQV TTGVTTRIYD DAAVERFNVS VGQIYYFTES RTGDDNIKWE
NDDKTGSLVW AGDTYWRISE RWGLRSGVQY DTRLDSVATS SSSLEYRRDQ DRLVQLNYRY
ASPEYIQATL PSYYSTAEQY KNGINQVGAV ASWPIADRWS IVGAYYFDTN SSKPADQMLG
LQYNSCCYAI RVGYERKLNG WDNDKQHAIY DNAIGFNIEL RGLSSNYGLG TQEMLRSNIL
PYQSSM