Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1188 |
Symbol | ostA |
ID | 5711956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1224020 |
End bp | 1226170 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641267099 |
Product | organic solvent tolerance protein |
Protein accession | YP_001532531 |
Protein GI | 159043737 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0274723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.555926 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGCG TCGCCCTGAC CCGCAGGCTC GCCGCGGCGC TGGCGGTCTG GCTCGCCCTG GCCTGCTGGA CCGGCCCCGT CCGGGCGCAG GAGGCGCAGG CGCCGCCTGC TCCGGCCACG CTGGTGGCCG ACCGGATCGT CGCGAACCCC GATGGGACGC TGATCGCCGA GGGGGCGGTC GAGATCTTCT ATGATGGGCG AAGCCTGAAG GCCGAGCGGC TGACCTATGA CCGGCTGGCC GACACGCTGG AGATCACGGG GCCGATCCAG CTCAGCTCCG GGCCGGGCTT CGTGCTGATC GCGTCCCAGG CGGAGCTGGG TACCGACCTG CAAGAGGGCA TCCTGCAAAG CGCGCGACTT GTGCTGGACC GGCAGGTGCA GATCGCGGCG GTGGAGATCC AGCGGGTCAA CGGGCGCTAC ACCCAGCTTT ACAACACCGT CGCGTCAAGC TGCGAGGTCT GCGCCGACCG GTCCGTGCCC CTCTGGCAGA TCCGCGCCCG GCGCATCGTC CATGACGCGC TGGAGCGGCA GATCTATTTC GAGCGCGCGG TGTTCGAGGT GGTGGGCATC CCCGTGCTTT ACCTGCCGCA GATGCGCGTA CCCGACCCGA CCCTGGAGCG CGCAACGGGG TTCCTGTTTC CGAATTTCCG AACCACCAGT GCGCTGGGGG TCGGCGTCGA GATCCCCTAT TTCATTGCGC TGGGACCGGA CCGCGACCTG ACGCTCAGCC CGCAGATCAC CACGAAGGAT TCGCGCACGC TTGGCCTGCG GTACAGGCAG GCGTTTTCGC GGGGAAACCT GACCTTCGAG GGGGCCTATA CCCGCGACGA TCTGGAAGAG GGCGATCGCG GGTTCGCCAC CCTGTTCGGC GCCTTCGACC TGGGCCGGGA TTTCGTGCTG TCGTTCGATC TCGAGACCGT GTCGGACGAC CAGTATTACC GCGATTACGG CTTCGATGAC GAGGACCGGA TCGACAGCGA GATCGAGATT TCGCGCACCC GCCGGGATGC GCTGGACCGG GCGACCGCAT CCTATTTCAC CACCTTCCGG GAGGACGAGG ATAACGACAC CATCCCGCGC TTCGTGCTGG ATGGCGAGAT CACGCGCCGC TTCGACACGC CGCTGATCGG GGGATATGCC GAGACCTCCC TGTCCTGGCT GGGCCTGGAA CGGCCTTCGG ATGCCAACGA GATCGGGCGC GACACGCGGC GGCTGAGCCT GCTGGGCACC TGGCAGCGCA GCTGGGCCAA CGACTGGGGC ATGGTGATGA CGGCCACGGG CGAGCTGGCG ATCGACCAGT TCCACGTTTC GCAGGATACC AGCTTTCCCG ACCAGCAGAC CCGGGTGACA CCGACACTGG CCGCCGAGCT GCGCTGGCCC TTCGAGAAGA GCAATGGCGC CACCCGCACG GTGATCGAGC CGGTGGCCCA GCTGGTCTGG TCGCGGGTGT CGGACGTGGA TGTGCCCAAC GAGGACAGCA CCGTGGCCGA GTTCGACGAG GCCAGCCTGT TCGACCTGAA CCGGTTCGCC GGGCGCGATC AGGTCGAAGG CGGCTGGCGC GCCAATTTCG GCGTCGGCTG GACCCGGTTC GACGCCAGCG GCTGGACCAC CGCCGTGACC GTGGGCCGGG TGCTGCGCGA GGATGACAAC ACCGCGCTGG CCCCGGGCGC GTCGGCGACC GAACGGGCGT CGGACTGGCT GACCACGGTG CAGCTGTCTT CGCCCACGGG CCTGGCGCTG CTGAACCGGG CGCAGTTCAG CTCGGATCTG TCGATCAACA AGAACGTGCT GCGGATGGGC TGGGAAGATC CGCTGAGCGC GCTGGCGCTG TCCTACACCT GGCTGCGGGC GTCGGAAGCC GAATCGCGGC CCGCCGATAC CAACGAGTTC AGCGTGACCG GGCGGCGCCG GTTCAACGAC ACCTGGGCCG GCGGGCTCGA ATTCAGCTAT GATTTCGACG CCTCCCAGGC CCGGGAGGCG GATCTGAGCC TTGAGTACCG CAATGAATGC GTGTTGGTGG AGCTCTCGGT ATCGCGGGAT TTCGACACCT CGCTTGATTT GCGCTCGACC ACGGATTTTG GCATCACGGT CTCGTTGCTC GGCTTCGGCC GTGGCGACGG TGCCGCGCGG ACATCGCGCT GCGGCGGATA A
|
Protein sequence | MNRVALTRRL AAALAVWLAL ACWTGPVRAQ EAQAPPAPAT LVADRIVANP DGTLIAEGAV EIFYDGRSLK AERLTYDRLA DTLEITGPIQ LSSGPGFVLI ASQAELGTDL QEGILQSARL VLDRQVQIAA VEIQRVNGRY TQLYNTVASS CEVCADRSVP LWQIRARRIV HDALERQIYF ERAVFEVVGI PVLYLPQMRV PDPTLERATG FLFPNFRTTS ALGVGVEIPY FIALGPDRDL TLSPQITTKD SRTLGLRYRQ AFSRGNLTFE GAYTRDDLEE GDRGFATLFG AFDLGRDFVL SFDLETVSDD QYYRDYGFDD EDRIDSEIEI SRTRRDALDR ATASYFTTFR EDEDNDTIPR FVLDGEITRR FDTPLIGGYA ETSLSWLGLE RPSDANEIGR DTRRLSLLGT WQRSWANDWG MVMTATGELA IDQFHVSQDT SFPDQQTRVT PTLAAELRWP FEKSNGATRT VIEPVAQLVW SRVSDVDVPN EDSTVAEFDE ASLFDLNRFA GRDQVEGGWR ANFGVGWTRF DASGWTTAVT VGRVLREDDN TALAPGASAT ERASDWLTTV QLSSPTGLAL LNRAQFSSDL SINKNVLRMG WEDPLSALAL SYTWLRASEA ESRPADTNEF SVTGRRRFND TWAGGLEFSY DFDASQAREA DLSLEYRNEC VLVELSVSRD FDTSLDLRST TDFGITVSLL GFGRGDGAAR TSRCGG
|
| |