Gene SbBS512_E0048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0048 
Symbolimp 
ID6269611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp47296 
End bp49650 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content51% 
IMG OID641724307 
Productorganic solvent tolerance protein 
Protein accessionYP_001878867 
Protein GI187733413 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000576585 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC GTATCCCCAC TCTCCTGGCC ACCATGATTG CCACCGCCCT TTATAGTCAA 
CAGGGACTGG CAGCCGACCT CGCCTCACAG TGCATGTTGG GCGTGCCAAG CTATGACCGT
CCTCTGGTAC AGGGCGATAC CAATGACTTA CCCGTGACTA TCAATGCTGA CCACGCGAAA
GGGGACTACC CGGATGACGC CGTGTTTACT GGCAGCGTGG ATATCATGCA GGGTAACAGC
CGTCTGCAGG CCGACGAAGT GCCGCTCCAT CAAAAAGAGG CACCAGGACA ACCGGAGCCG
GTACGAACCG TTGATGCGCT CGGTAATGTC CATTACGACG ATAACCAGGT GATCCTCAAA
GGGCCGAAAG GCTGGGCGAA TCTGAACACC AAAGATACCA ACGTCTGGGA AGGTGATTAC
CAGATGGTGG GTCGCCAGGG CCGCGGTAAA GCGGACCTGA TGAAACAACG TGGTGAAAAC
CGCTATACCA TTCTGGATAA CGGTAGCTTT ACCTCCTGTC TGCCGGGTTC TGACACCTGG
AGCGTGGTAG GTAGCGAAAT TATTCATGAC CGCGAAGAAC AAGTTGCGGA GATCTGGAAC
GCCCGCTTTA AGGTGGGTCC AGTACCGATC TTTTATAGCC CCTATTTGCA GTTGCCGGTG
GGTGACAAAC GTCGCTCTGG TTTCTTGATC CCGAACGCCA AGTACACCAC CACCAACTAC
TTTGAGTTCT ACCTGCCATA TTACTGGAAC ATCGCGCCAA ATATGGATGC CACCATCACG
CCGCATTATA TGCATCGTCG TGGCAACATC ATGTGGGAGA ACGAATTCCG CTACCTCTCC
CAGGCGGGCG CTGGCTTGAT GGAACTGGAC TATCTGCCTT CAGATAAAGT CTATGAAGAT
GAACACCCGA ACGATGACAG TTCACGTCGT TGGTTGTTCT ACTGGCAACA CTCCGGGGTC
ATGGATCAAG TGTGGCGTTT CAACGTCGAC TACACCAAGG TCAGCGATCC TAGCTACTTC
AATGATTTCG ATAACAAGTA CGGTTCCAGT ACTGACGGCT ACGCAACGCA AAAATTCAGC
GTTGGCTATG CGGTGCAAAA CTTCAATGCC ACCGTTTCAA CCAAGCAGTT CCAGGTCTTT
AGCGAGCAGA ACACCAGTAG CTACTCGGCA GAGCCGCAGT TAGACGTTAA CTACTACCAG
AATGATGTTG GTCCGTTTGA TACGCGTATT TACGGCCAGG CAGTGCACTT TGTTAACACC
AGAGACGACA TGCCTGAAGC AACCCGTGTT CACCTGGAAC CGACCATCAA TTTGCCGCTC
TCTAATAACT GGGGCAGCAT CAATACCGAA GCGAAGTTGC TGGCAACCCA TTATCAGCAA
ACCAATCTTG ACTGGTATAA CTCCAGAAAC ACGACCAAGC TGGACGAATC CGTTAACCGC
GTTATGCCGC AATTCAAAGT TGACGGCAAA ATGGTCTTTG AACGCGATAT GGAAATGCTG
GCTCCGGGTT ATACCCAAAC GCTGGAACCG CGCGCGCAGT ATTTGTACGT GCCGTATCGC
GATCAGAGCG ACATCTATAA CTACGACTCG TCTCTACTGC AATCTGACTA CTCTGGCCTG
TTCCGGGACC GGACTTACGG CGGTCTTGAC CGTATTGCCT CCGCTAACCA GGTGACGACC
GGTGTCACAT CTCGCATATA TGATGATGCT GCCGTTGAAC ATTTTAATAT TTCCGTTGGT
CAAATCTACT ATTTCACGGA GTCTCGCACT GGCGATGACA ACATAACATG GGAGAATGAC
GACAAAACGG GCTCACTGGT GTGGGCAGGC GATACTTACT GGCGTATCTC CGAGCGTTGG
GGATTGCGTG GCGGGATTCA GTACGATACA CGTCTGGATA ACGTAGCGAC CAGTAACTCC
AGCATTGAAT ACCGTCGGGA TGAAGACCGT CTGGTACAGC TGAATTACCG TTACGCCAGC
CCGGAATATA TTCAGGCTAC GCTGCCTAAG TACTATTCCA CTGCTGAGCA ATATAAGAAT
GGTATTTCGC AGGTAGGTGC TGTCGCCAGC TGGCCAATTG CCGATCGTTG GTCCATTGTT
GGGGCCTACT ACTACGACAC CAATGCGAAC AAGCAAGCCG ACTCTATGTT AGGTGTGCAA
TACAGCTCCT GCTGCTATGC AATTCGCGTC GGTTACGAGC GGAAGCTGAA CGGTTGGGAT
AACGATAAAC AACATGCGGT ATATGACAAC GCAATCGGCT TTAACATCGA ACTTCGCGGC
CTGAGCTCCA ACTACGGTCT GGATACGCAA GAGATGCTGC GTTCGAACAT TCTGCCGTAT
CAAAACACTT TGTGA
 
Protein sequence
MKKRIPTLLA TMIATALYSQ QGLAADLASQ CMLGVPSYDR PLVQGDTNDL PVTINADHAK 
GDYPDDAVFT GSVDIMQGNS RLQADEVPLH QKEAPGQPEP VRTVDALGNV HYDDNQVILK
GPKGWANLNT KDTNVWEGDY QMVGRQGRGK ADLMKQRGEN RYTILDNGSF TSCLPGSDTW
SVVGSEIIHD REEQVAEIWN ARFKVGPVPI FYSPYLQLPV GDKRRSGFLI PNAKYTTTNY
FEFYLPYYWN IAPNMDATIT PHYMHRRGNI MWENEFRYLS QAGAGLMELD YLPSDKVYED
EHPNDDSSRR WLFYWQHSGV MDQVWRFNVD YTKVSDPSYF NDFDNKYGSS TDGYATQKFS
VGYAVQNFNA TVSTKQFQVF SEQNTSSYSA EPQLDVNYYQ NDVGPFDTRI YGQAVHFVNT
RDDMPEATRV HLEPTINLPL SNNWGSINTE AKLLATHYQQ TNLDWYNSRN TTKLDESVNR
VMPQFKVDGK MVFERDMEML APGYTQTLEP RAQYLYVPYR DQSDIYNYDS SLLQSDYSGL
FRDRTYGGLD RIASANQVTT GVTSRIYDDA AVEHFNISVG QIYYFTESRT GDDNITWEND
DKTGSLVWAG DTYWRISERW GLRGGIQYDT RLDNVATSNS SIEYRRDEDR LVQLNYRYAS
PEYIQATLPK YYSTAEQYKN GISQVGAVAS WPIADRWSIV GAYYYDTNAN KQADSMLGVQ
YSSCCYAIRV GYERKLNGWD NDKQHAVYDN AIGFNIELRG LSSNYGLDTQ EMLRSNILPY
QNTL