Gene EcSMS35_0058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0058 
Symbolimp 
ID6146361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp61291 
End bp63648 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content51% 
IMG OID641614959 
Productorganic solvent tolerance protein 
Protein accessionYP_001742175 
Protein GI170681284 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000119427 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.365449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC GTATCCCCAC TCTCCTGGCC ACCATGATTG CCACCGCCCT TTATAGTCAA 
CAGGGACTGG CAGCCGACCT CGCCTCACAG TGCATGTTGG GCGTGCCAAG CTATGACCGT
CCTCTGGTAC AGGGCGATAC CAATGACTTA CCCGTGACTA TCAATGCTGA CCACGCGAAA
GGGGACTACC CGGATGACGC CGTGTTTACT GGCAGCGTGG ATATCATGCA GGGTAACAGC
CGTCTGCAGG CCGACGAAGT GCAGCTCCAT CAAAAAGAGG CACCAGGACA ACCGGAGCCG
GTACGTACCG TTGATGCGCT CGGTAATGTC CATTACGACG ATAACCAGGT GATCCTCAAA
GGGCCGAAAG GCTGGGCGAA TCTGAACACC AAAGATACCA ACGTCTGGGA GGGTGATTAC
CAGATGGTGG GTCGCCAGGG TCGCGGTAAA GCGGACCTGA TGAAACAACG TGGCGAAAAC
CGCTATACCA TTCTGGATAA CGGTAGCTTT ACCTCCTGTC TACCGGGTTC TGACACCTGG
AGCGTGGTAG GTAGCGAAAT TATTCATGAC CGCGAAGAAC AAGTTGCGGA GATCTGGAAC
GCCCGCTTTA AGGTGGGTCC GGTACCGATC TTTTATAGCC CATATTTGCA GTTGCCGGTG
GGTGACAAAC GTCGCTCTGG TTTCTTGATC CCGAACGCCA AGTACACCAC CACCAACTAC
TTTGAGTTCT ACCTGCCGTA TTACTGGAAC ATCGCGCCAA ATATGGATGC CACCATCACG
CCGCATTATA TGCATCGTCG TGGCAACATT ATGTGGGAAA ACGAATTCCG TTATCTCACC
CAGGCGGGCG CAGGCTTGAT GGAACTGGAC TATCTGCCTT CGGATAAAGT CTATGAAGAT
GAACATCCAG AAGATAGCAA TTCACGCCGT TGGTTGTTCT ACTGGCAACA CTCCGGAGTG
ATGGATCAGG TTTGGCGCTT TAACATCGAC TACACCAAAG TCAGCGACCC GACCTATTTC
AATGATTTTG ATAATAAATA TGGTTCCAGT ACCGATGGCT ACGCCACGCA AAAATTCAGC
GTGGGCTATG CGGTGCAGAA CTTTGACGCC ACGCTTTCGA CTAAACAATT CCAGGTTTTT
GATGATACCA GTGGCAACAG CTATGCCGCT GAACCGCAGT TAGACGTCAA CTACTACCAT
AACGATCTCG GCCCGTTTGA TACCCGTGTT TATGGTCAGG CAGTGCATTT TGTTAATACA
AACAGCAATA TGCCGGAGGC TACTCGCGTT CATCTGGAGC CGACGATAAA CCTGCCGTTA
TCTAACACCT GGGGCAGTAT CAATACCGAA GCGAAGCTGC TGGCGACCCA TTATCAGCAG
ACCAATCTCG ACTGGTATAA CAGCAACCCA CAAAACAATA AGCTGGCTGA TTCTGTTAAC
CGCGTAATGC CGCAATTTAA AGTTGACGGC AAAATGGTCT TCGAACGTGA TATGGAAATG
CTGGCGCCGG GTTATACCCA AACGCTGGAG CCGCGCGCGC AATACCTGTA TGTACCGTAT
CGCGATCAGA GCGACATCTA TAACTACGAC TCCTCCTTGC TGCAATCCGA TTACTCGGGC
CTGTTCCGGG ACCGGACTTA TGGCGGTCTT GACCGTATAG CTTCCGCGAA CCAGGTCACG
ACCGGTATCA CAACTCGCGT ATATGATGAT GCAGCCGTTG AACGTTTTAA TATTTCCGTT
GGTCAAATCT ACTATTTCAC GGAGTCTCGC ACTGGCGATG ACAACATAAC ATGGGAGAAT
GACGACAAAA CGGGCTCGCT GGTGTGGGCA GGCGATACTT ACTGGCGTAT CTCCGAGCGT
TGGGGATTGC GTGGCGGGAT TCAGTACGAT ACACGTCTGG ATAACGTAGC GACCAGTAAC
TCCAGCATTG AATACCGTCG GGATGAAGAC CGTCTGGTAC AGCTGAATTA CCGTTACGCC
AGCCCGGAAT ATATTCAGGC TACGCTGCCT AAGTACTATT CCACAGCTGA GCAATATAAG
AATGGTATTT CACAGGTAGG TGCCGTCGCC AGCTGGCCAA TTGCCGATCG TTGGTCAATT
GTTGGGGCCT ACTATTACGA CACCAATGCG AACAAGCAAG CCGACTCTAT GTTAGGTGTG
CAATACAGCT CCTGCTGCTA TGCAATTCGC GTCGGTTACG AGCGGAAGCT GAACGGTTGG
GATAACGATA AACAACATGC GGTATATGAC AACGCAATCG GCTTTAACAT CGAACTTCGC
GGCCTGAGCT CCAACTACGG TCTGGGTACG CAAGAGATGC TGCGTTCGAA CATTCTGCCG
TATCAAAACA CTTTGTGA
 
Protein sequence
MKKRIPTLLA TMIATALYSQ QGLAADLASQ CMLGVPSYDR PLVQGDTNDL PVTINADHAK 
GDYPDDAVFT GSVDIMQGNS RLQADEVQLH QKEAPGQPEP VRTVDALGNV HYDDNQVILK
GPKGWANLNT KDTNVWEGDY QMVGRQGRGK ADLMKQRGEN RYTILDNGSF TSCLPGSDTW
SVVGSEIIHD REEQVAEIWN ARFKVGPVPI FYSPYLQLPV GDKRRSGFLI PNAKYTTTNY
FEFYLPYYWN IAPNMDATIT PHYMHRRGNI MWENEFRYLT QAGAGLMELD YLPSDKVYED
EHPEDSNSRR WLFYWQHSGV MDQVWRFNID YTKVSDPTYF NDFDNKYGSS TDGYATQKFS
VGYAVQNFDA TLSTKQFQVF DDTSGNSYAA EPQLDVNYYH NDLGPFDTRV YGQAVHFVNT
NSNMPEATRV HLEPTINLPL SNTWGSINTE AKLLATHYQQ TNLDWYNSNP QNNKLADSVN
RVMPQFKVDG KMVFERDMEM LAPGYTQTLE PRAQYLYVPY RDQSDIYNYD SSLLQSDYSG
LFRDRTYGGL DRIASANQVT TGITTRVYDD AAVERFNISV GQIYYFTESR TGDDNITWEN
DDKTGSLVWA GDTYWRISER WGLRGGIQYD TRLDNVATSN SSIEYRRDED RLVQLNYRYA
SPEYIQATLP KYYSTAEQYK NGISQVGAVA SWPIADRWSI VGAYYYDTNA NKQADSMLGV
QYSSCCYAIR VGYERKLNGW DNDKQHAVYD NAIGFNIELR GLSSNYGLGT QEMLRSNILP
YQNTL