Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0058 |
Symbol | imp |
ID | 6146361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 61291 |
End bp | 63648 |
Gene Length | 2358 bp |
Protein Length | 785 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641614959 |
Product | organic solvent tolerance protein |
Protein accession | YP_001742175 |
Protein GI | 170681284 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000119427 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.365449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC GTATCCCCAC TCTCCTGGCC ACCATGATTG CCACCGCCCT TTATAGTCAA CAGGGACTGG CAGCCGACCT CGCCTCACAG TGCATGTTGG GCGTGCCAAG CTATGACCGT CCTCTGGTAC AGGGCGATAC CAATGACTTA CCCGTGACTA TCAATGCTGA CCACGCGAAA GGGGACTACC CGGATGACGC CGTGTTTACT GGCAGCGTGG ATATCATGCA GGGTAACAGC CGTCTGCAGG CCGACGAAGT GCAGCTCCAT CAAAAAGAGG CACCAGGACA ACCGGAGCCG GTACGTACCG TTGATGCGCT CGGTAATGTC CATTACGACG ATAACCAGGT GATCCTCAAA GGGCCGAAAG GCTGGGCGAA TCTGAACACC AAAGATACCA ACGTCTGGGA GGGTGATTAC CAGATGGTGG GTCGCCAGGG TCGCGGTAAA GCGGACCTGA TGAAACAACG TGGCGAAAAC CGCTATACCA TTCTGGATAA CGGTAGCTTT ACCTCCTGTC TACCGGGTTC TGACACCTGG AGCGTGGTAG GTAGCGAAAT TATTCATGAC CGCGAAGAAC AAGTTGCGGA GATCTGGAAC GCCCGCTTTA AGGTGGGTCC GGTACCGATC TTTTATAGCC CATATTTGCA GTTGCCGGTG GGTGACAAAC GTCGCTCTGG TTTCTTGATC CCGAACGCCA AGTACACCAC CACCAACTAC TTTGAGTTCT ACCTGCCGTA TTACTGGAAC ATCGCGCCAA ATATGGATGC CACCATCACG CCGCATTATA TGCATCGTCG TGGCAACATT ATGTGGGAAA ACGAATTCCG TTATCTCACC CAGGCGGGCG CAGGCTTGAT GGAACTGGAC TATCTGCCTT CGGATAAAGT CTATGAAGAT GAACATCCAG AAGATAGCAA TTCACGCCGT TGGTTGTTCT ACTGGCAACA CTCCGGAGTG ATGGATCAGG TTTGGCGCTT TAACATCGAC TACACCAAAG TCAGCGACCC GACCTATTTC AATGATTTTG ATAATAAATA TGGTTCCAGT ACCGATGGCT ACGCCACGCA AAAATTCAGC GTGGGCTATG CGGTGCAGAA CTTTGACGCC ACGCTTTCGA CTAAACAATT CCAGGTTTTT GATGATACCA GTGGCAACAG CTATGCCGCT GAACCGCAGT TAGACGTCAA CTACTACCAT AACGATCTCG GCCCGTTTGA TACCCGTGTT TATGGTCAGG CAGTGCATTT TGTTAATACA AACAGCAATA TGCCGGAGGC TACTCGCGTT CATCTGGAGC CGACGATAAA CCTGCCGTTA TCTAACACCT GGGGCAGTAT CAATACCGAA GCGAAGCTGC TGGCGACCCA TTATCAGCAG ACCAATCTCG ACTGGTATAA CAGCAACCCA CAAAACAATA AGCTGGCTGA TTCTGTTAAC CGCGTAATGC CGCAATTTAA AGTTGACGGC AAAATGGTCT TCGAACGTGA TATGGAAATG CTGGCGCCGG GTTATACCCA AACGCTGGAG CCGCGCGCGC AATACCTGTA TGTACCGTAT CGCGATCAGA GCGACATCTA TAACTACGAC TCCTCCTTGC TGCAATCCGA TTACTCGGGC CTGTTCCGGG ACCGGACTTA TGGCGGTCTT GACCGTATAG CTTCCGCGAA CCAGGTCACG ACCGGTATCA CAACTCGCGT ATATGATGAT GCAGCCGTTG AACGTTTTAA TATTTCCGTT GGTCAAATCT ACTATTTCAC GGAGTCTCGC ACTGGCGATG ACAACATAAC ATGGGAGAAT GACGACAAAA CGGGCTCGCT GGTGTGGGCA GGCGATACTT ACTGGCGTAT CTCCGAGCGT TGGGGATTGC GTGGCGGGAT TCAGTACGAT ACACGTCTGG ATAACGTAGC GACCAGTAAC TCCAGCATTG AATACCGTCG GGATGAAGAC CGTCTGGTAC AGCTGAATTA CCGTTACGCC AGCCCGGAAT ATATTCAGGC TACGCTGCCT AAGTACTATT CCACAGCTGA GCAATATAAG AATGGTATTT CACAGGTAGG TGCCGTCGCC AGCTGGCCAA TTGCCGATCG TTGGTCAATT GTTGGGGCCT ACTATTACGA CACCAATGCG AACAAGCAAG CCGACTCTAT GTTAGGTGTG CAATACAGCT CCTGCTGCTA TGCAATTCGC GTCGGTTACG AGCGGAAGCT GAACGGTTGG GATAACGATA AACAACATGC GGTATATGAC AACGCAATCG GCTTTAACAT CGAACTTCGC GGCCTGAGCT CCAACTACGG TCTGGGTACG CAAGAGATGC TGCGTTCGAA CATTCTGCCG TATCAAAACA CTTTGTGA
|
Protein sequence | MKKRIPTLLA TMIATALYSQ QGLAADLASQ CMLGVPSYDR PLVQGDTNDL PVTINADHAK GDYPDDAVFT GSVDIMQGNS RLQADEVQLH QKEAPGQPEP VRTVDALGNV HYDDNQVILK GPKGWANLNT KDTNVWEGDY QMVGRQGRGK ADLMKQRGEN RYTILDNGSF TSCLPGSDTW SVVGSEIIHD REEQVAEIWN ARFKVGPVPI FYSPYLQLPV GDKRRSGFLI PNAKYTTTNY FEFYLPYYWN IAPNMDATIT PHYMHRRGNI MWENEFRYLT QAGAGLMELD YLPSDKVYED EHPEDSNSRR WLFYWQHSGV MDQVWRFNID YTKVSDPTYF NDFDNKYGSS TDGYATQKFS VGYAVQNFDA TLSTKQFQVF DDTSGNSYAA EPQLDVNYYH NDLGPFDTRV YGQAVHFVNT NSNMPEATRV HLEPTINLPL SNTWGSINTE AKLLATHYQQ TNLDWYNSNP QNNKLADSVN RVMPQFKVDG KMVFERDMEM LAPGYTQTLE PRAQYLYVPY RDQSDIYNYD SSLLQSDYSG LFRDRTYGGL DRIASANQVT TGITTRVYDD AAVERFNISV GQIYYFTESR TGDDNITWEN DDKTGSLVWA GDTYWRISER WGLRGGIQYD TRLDNVATSN SSIEYRRDED RLVQLNYRYA SPEYIQATLP KYYSTAEQYK NGISQVGAVA SWPIADRWSI VGAYYYDTNA NKQADSMLGV QYSSCCYAIR VGYERKLNGW DNDKQHAVYD NAIGFNIELR GLSSNYGLGT QEMLRSNILP YQNTL
|
| |