Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0947 |
Symbol | |
ID | 4077341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1009832 |
End bp | 1012027 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006250 |
Product | organic solvent tolerance protein |
Protein accession | YP_612942 |
Protein GI | 99080788 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0025554 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.755835 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAGA CGGTTGAATT GAAGAAACGG CGGCGAGGCG CGCTGCTGCG GACGGTGGTA CCCCTGTGGT TGGCGCTTGC GGCAGCGCCC GCCCTGGCGC AAAACGCGAC CTCTGCACAG GAAACCCCAG CCCCGGCGAT GCTGGTGGCG GACCGGGTAT TTGTATCCCC CGACCGCAAG CTGGTAGCCG AAGGAAATGT CGAGGCATTT CAGGGCGATA TCCGCCTTCA GGCGCGGCGC ATCTCTTATG ATCGCCAAAC CGGCATCCTT CAGATGGAGG GGCCGATCCG CATAGACCAG AGCGGCTCGA TCACGGTACT CGCCAATGCC GCCGAGCTGG ACAGTGAATT GCGCAATGGC ATCCTGCGTG GGGCGCGGAT GGTCTTTGAT CAGCAGGTGC AGCTCGCAGC CCTGCAGATG ACCCGCGTCG AGGGGCGCTA TTCGCAGCTC TACAAGACCG CCGTGACCTC TTGCCATGTT TGCGAAGATG GCAGCCCCCC GCTCTGGCAG ATCCGCGCCG AGCGCATCAC CCATGATCAG GCTGAGCGCC AGCTCTATCT GGAGGGTGCG CAGCTGATGG TGAAAGATGT TCCGGTCTTT TACTTTCCCG CACTGCGCCT GCCAGACCCA ACACTCGAGC GGGCTGATGG CTTTCTGGTG CCCTCTCTCT CCAGTTCCTC GACGCTTGGC GTAGGGGTGA AGATCCCCTA TTTCAAAACA ATCGGCCCGC ACAGAGACCT CACCATCACG CCCTACCTGT CCGAGAAGAC CAGAACGCTC GACCTGCGCT ACCGTCAGGC CTTTCGCAAC GGCCAGATCG AGATCACCGG CGCCTTCAGC CGCGATGACA TTCAGCCTGA TGATGGCCGG GGCTACCTCA GCCTGCGCGG CGCCTTCGAC ATCCCGCGAG ACTTCAAGCT GAGCTTTGAT CTGAACACTG TCTCGGATGA CGGATACTAC GCTGACTATG ACATCTCTGA CACCGACCGC ATTCGCTCCG AAATCTCGCT GATCCGGGTG CGGCGCGACC AGCTGATCGA AGGCAAGATC TCCAACTACA AAACCCTGCG CGACGCCGAG AACCAGGACT TCATTCCCTC GACCATCGTG ACCGGCACCT TCGAGCAGCG CCTCTTTCCC AAGGCCGTTG GCGGCGAGCT CCGTCTGCGG CTCAACGCCT CGCAATTCAG ACGCGAATCC TCGCTTGATG CCACCCTCAC AGATGCCAAC GGGCGCGACA TGAGCCGGGT TTCCGCCGAT GCCACCTGGC TGCGCAGCTG GATTTTGCCC TGGGGGATTG AATCGGTCTG GACCGCCGGG ATCGGGATCG ACAGTTTTGC GCTGTCAGAT GATGCAGCCT TTGACAATGA TGCCACCCGC GTCACGCCCA AGGCGGCGCT GACCCTGCGC CGCCCGATGA CACGCCAGAC CGCATCAGGG GCGGTTCAGG TGCTCGAGCC CATCGTGCAG CTTGGCTGGA CCCATGTGAA TGGCGACGAC ACCCCGAATG AGGCCAGCAA TATCTCCGAG TTCGATCAGG GCAACCTGAT GGCGCTCTCG CGGTTCCCCG AATCCGACGT GCGCGAGGAT GGCGAGACGT TTGTTTACGG CGTGAACTTT GCGCATTTCG ACACCTCCGG CTGGTTTGCA ACAGGTACCA TTGCGCAGAT CCACCGCGAT CAAGCGCAAT CGGGCTTTAC CTCCTCCTCG GGGCTGGATG GCCGCAACTC CAACGTCTTG GTGGCGGGGC AGCTGGGCCT GCGCAACGAC CTGACGCTGA CCGCCCGCAC CCTCTTTGAC GAGGAGTGGT CGGTGACCAA GGCAGAGTTT CGCGGCGATC TGGAACGTGA CCGGGTCAGT CTTGCAGGGA GCTATCTTTG GTTGCAGGCC GACGCGAGCG AGAACCGCAC GGAAGAAACC TCGGAACTCT GGTTCGATGG CACCTATGAC TTGAACCAGA CATGGCGCGC GGGCGCCAAC ATGCGCTACG ACATTACCGA TGGGCGCGCG ACCCGTGCAG GTCTTGGCCT CACCTACAGC AATGAATGCG TGACACTCGA CCTATCGCTC AGCCGTCGCT ATACCTCGAC GACAAGTGTT GAGCCATCAA CGGATTTCGG ATTTACACTG TCGCTCAACG GCTTTTCCGT CAAAAGCGGC AACACAACAA GCAGGCGATC ATGCAGCAAA ACCTAA
|
Protein sequence | MPKTVELKKR RRGALLRTVV PLWLALAAAP ALAQNATSAQ ETPAPAMLVA DRVFVSPDRK LVAEGNVEAF QGDIRLQARR ISYDRQTGIL QMEGPIRIDQ SGSITVLANA AELDSELRNG ILRGARMVFD QQVQLAALQM TRVEGRYSQL YKTAVTSCHV CEDGSPPLWQ IRAERITHDQ AERQLYLEGA QLMVKDVPVF YFPALRLPDP TLERADGFLV PSLSSSSTLG VGVKIPYFKT IGPHRDLTIT PYLSEKTRTL DLRYRQAFRN GQIEITGAFS RDDIQPDDGR GYLSLRGAFD IPRDFKLSFD LNTVSDDGYY ADYDISDTDR IRSEISLIRV RRDQLIEGKI SNYKTLRDAE NQDFIPSTIV TGTFEQRLFP KAVGGELRLR LNASQFRRES SLDATLTDAN GRDMSRVSAD ATWLRSWILP WGIESVWTAG IGIDSFALSD DAAFDNDATR VTPKAALTLR RPMTRQTASG AVQVLEPIVQ LGWTHVNGDD TPNEASNISE FDQGNLMALS RFPESDVRED GETFVYGVNF AHFDTSGWFA TGTIAQIHRD QAQSGFTSSS GLDGRNSNVL VAGQLGLRND LTLTARTLFD EEWSVTKAEF RGDLERDRVS LAGSYLWLQA DASENRTEET SELWFDGTYD LNQTWRAGAN MRYDITDGRA TRAGLGLTYS NECVTLDLSL SRRYTSTTSV EPSTDFGFTL SLNGFSVKSG NTTSRRSCSK T
|
| |