Gene SeHA_C0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0099 
Symbolimp 
ID6491909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp101210 
End bp103570 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content53% 
IMG OID642740387 
Productorganic solvent tolerance protein 
Protein accessionYP_002044061 
Protein GI194450910 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0097442 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC GTATTCCCAC TCTTCTGGCC ACCATGATCG CCAGCGCCCT TTATAGTCAT 
CAGGGGCTGG CAGCCGATCT CGCCTCACAG TGTATGTTGG GCGTGCCGAG CTACGATCGT
CCTCTGGTAA AAGGCGATAC CAACGATCTG CCGGTTACTA TCAATGCCGA TAACGCTAAA
GGTAACTACC CGGACGATGC CGTTTTTACC GGCAACGTGG ACATTATGCA GGGGAATAGC
CGCCTGCAAG CGGATGAAGT GCAGCTTCAT CAGAAGCAGG CGGAAGGTCA GCCGGAACCT
GTACGCACCG TCGATGCGCT GGGTAATGTG CATTATGATG ACAATCAGGT CATCCTTAAA
GGGCCGAAGG GCTGGGCGAA CCTGAACACC AAAGACACGA ACGTCTGGGA AGGCGATTAC
CAGATGGTGG GCCGTCAGGG GCGCGGTAAA GCCGATCTCA TGAAGCAGCG CGGCGAAAAC
CGCTATACCA TTCTGGAAAA CGGCAGCTTT ACCTCCTGTC TGCCTGGCTC CGATACCTGG
AGCGTGGTGG GGAGTGAAGT CATCCATGAC CGCGAAGAAC AGGTTGCGGA GATCTGGAAC
GCCCGGTTTA AAGTAGGTCC GGTTCCGATC TTTTATAGCC CCTATTTACA GCTACCCGTC
GGTGACAAGC GTCGCTCAGG TTTCCTGATC CCGAACGCGA AATACACGAC CAAGAACTAT
TTCGAGTTCT ACTTACCGTA TTACTGGAAC ATCGCGCCCA ATATGGACGC CACCATCACC
CCGCACTATA TGCACCGCCG CGGCAATATT ATGTGGGAGA ACGAATTCCG TTATCTCACG
CAGGCAGGCG AGGGAGTGAT GGAATTAGAT TATCTGCCTT CTGATAAAGT CTACGAGGAC
GATCACCCCA AAGAGGGCGA TAAGCACCGC TGGTTATTCT ACTGGCAGCA CTCAGGCGTG
ATGGATCAGG TGTGGCGTTT TAACGTCGAT TACACCAAAG TCAGCGACTC CAGCTACTTT
AACGATTTCG ACAGTAAGTA CGGTTCCAGT ACCGACGGCT ACGCAACGCA GAAATTCAGC
GTCGGCTACG CCGTACAAAA CTTTGACGCT ACGGTGTCGA CCAAACAATT CCAGGTCTTT
AACGATCAAA ACACCAGCAG CTATTCAGCG GAGCCGCAGT TAGACGTTAA CTACTACCAT
AACGATCTCG GCCCGTTTGA TACCCGGATT TACGGCCAGG CGGTGCACTT TGTTAACACC
AAAGACAATA TGCCGGAAGC GACCCGCGTC CACCTGGAGC CAACCATCAA TTTGCCGCTC
TCCAACCGCT GGGGCAGCCT GAACACCGAA GCGAAGCTGA TGGCGACGCA CTATCAGCAA
ACGAATCTGG ACAGCTATAA CAGCGATCCA AACAATAAAA ATAAGCTGGA AGATTCGGTT
AACCGCGTCA TGCCGCAGTT TAAAGTCGAC GGTAAGCTCA TCTTCGAACG CGATATGGCG
ATGCTGGCGC CGGGGTATAC CCAGACGCTG GAACCACGCG TGCAGTACCT GTATGTGCCG
TACCGCGACC AGAGCGGCAT CTATAACTAC GATTCTTCTT TGCTGCAATC CGACTATAAC
GGCCTGTTCC GCGACCGCAC TTATGGCGGT CTCGACCGTA TTGCTTCCGC CAACCAGGTC
ACAACCGGCG TCACAACACG CATTTATGAT GATGCCGCCG TTGAACGTTT TAACGTTTCT
GTTGGTCAAA TCTACTATTT CACGGAGTCT CGCACCGGCG ATGACAACAT TAAATGGGAG
AATGACGACA AAACCGGTTC GCTGGTTTGG GCAGGCGACA CTTACTGGCG TATTTCAGAA
CGCTGGGGGC TGCGTAGCGG AGTGCAGTAC GATACCCGTC TGGATAGCGT CGCTACCAGC
AGCAGCAGCC TCGAATACCG TCGGGATCAG GATCGTCTGG TACAGTTGAA CTACCGCTAT
GCCAGCCCGG AATATATTCA GGCTACGTTG CCTTCGTATT ATTCCACGGC AGAGCAGTAT
AAAAACGGCA TCAATCAGGT GGGCGCGGTG GCAAGTTGGC CGATTGCCGA TCGCTGGTCG
ATTGTCGGCG CGTACTACTT CGATACCAAT TCGAGCAAAC CTGCAGACCA GATGCTCGGC
TTGCAGTACA ACTCTTGCTG CTATGCGATC CGCGTCGGAT ACGAACGTAA GCTGAACGGT
TGGGATAACG ATAAACAACA CGCGATTTAT GATAACGCGA TTGGCTTCAA CATTGAGCTG
CGCGGTTTGA GCTCTAACTA CGGCCTCGGC ACGCAAGAAA TGTTGCGTTC GAACATTCTG
CCGTACCAAA GCTCTATGTA A
 
Protein sequence
MKKRIPTLLA TMIASALYSH QGLAADLASQ CMLGVPSYDR PLVKGDTNDL PVTINADNAK 
GNYPDDAVFT GNVDIMQGNS RLQADEVQLH QKQAEGQPEP VRTVDALGNV HYDDNQVILK
GPKGWANLNT KDTNVWEGDY QMVGRQGRGK ADLMKQRGEN RYTILENGSF TSCLPGSDTW
SVVGSEVIHD REEQVAEIWN ARFKVGPVPI FYSPYLQLPV GDKRRSGFLI PNAKYTTKNY
FEFYLPYYWN IAPNMDATIT PHYMHRRGNI MWENEFRYLT QAGEGVMELD YLPSDKVYED
DHPKEGDKHR WLFYWQHSGV MDQVWRFNVD YTKVSDSSYF NDFDSKYGSS TDGYATQKFS
VGYAVQNFDA TVSTKQFQVF NDQNTSSYSA EPQLDVNYYH NDLGPFDTRI YGQAVHFVNT
KDNMPEATRV HLEPTINLPL SNRWGSLNTE AKLMATHYQQ TNLDSYNSDP NNKNKLEDSV
NRVMPQFKVD GKLIFERDMA MLAPGYTQTL EPRVQYLYVP YRDQSGIYNY DSSLLQSDYN
GLFRDRTYGG LDRIASANQV TTGVTTRIYD DAAVERFNVS VGQIYYFTES RTGDDNIKWE
NDDKTGSLVW AGDTYWRISE RWGLRSGVQY DTRLDSVATS SSSLEYRRDQ DRLVQLNYRY
ASPEYIQATL PSYYSTAEQY KNGINQVGAV ASWPIADRWS IVGAYYFDTN SSKPADQMLG
LQYNSCCYAI RVGYERKLNG WDNDKQHAIY DNAIGFNIEL RGLSSNYGLG TQEMLRSNIL
PYQSSM