Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33125 |
Symbol | |
ID | 5003446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 429510 |
End bp | 433574 |
Gene Length | 4065 bp |
Protein Length | 1354 aa |
Translation table | |
GC content | 54% |
IMG OID | 640418867 |
Product | predicted protein |
Protein accession | XP_001419161 |
Protein GI | 145349481 |
COG category | [S] Function unknown |
COG ID | [COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.517999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0400566 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCACA ACCGTCATCG CGCGCGATCC GGACGCTCGA CGCGCTCAAC GACGACGAAT CGCGCGCGAA CGCGCACGAC GGTCCGGTCG GCGGCTCGAG CGCATTATCA AGACGGCGAC GCGCACGCGG CGTCGCAGGA GGGTTACTAC AGGTTCCCGG TGATCCGTGG TAACGAGTTG TTCTTCGTGT GCGAAGACGA CGTGTACGCG ACCACGATCT CGGGGCTCGA TAAACGCGAG AGCGGCGCGG AGACGACGCC GCCGCGACGG CTGACGCAGG CGCACGGCGC GGTGCAGCGG TTAGTCGTCT CTCCCGATGG ATCACGCGTC GCGTTCGCGT GCGCAGAGGA TGGATACACC GAAATATACG TCGTGGACGC GCGTGGAGGT CCTATGAAAC AGTTGACACA CATGGGGGCT TCGTACGCGC GGGCGTGCTG CTTTTCAGAG GATGGTCGAC GGGTGTACTT TACGTCGAGC GGGGCGACGG CGGAACCGAA TGGAGACGAG CTTTGGGTGG TGGACTGCGA TGGTGGAGCG CCAATGAGAA TGAACCTCGG TCCAGTGCAT GATTTCGACG TGCGCAACGT GAACGGGAAA GAATTGGTTG TCCTCGGTCG TAATACTGAA GACACGGCGA CGAAACACTG GGACGGATAC GCGGGCGGCG CCGGTGGAGA GATTTGGTAT GGGACCTTAG ATAATTTGCT GCGACTTGAC TTGCGTCTTC CAAACGAGCG TTTGTTGCGA AATGTTGGCA ACGTGTCGTG GTTTGACGAC GAGCACGTCG CGTTCACGGA AGCGGGCGGG CGTTCGACAG CGTTCAAGGC TAAAATTGAT ACCAATGAAT TCAACACTCG CATCGTAGAT TGTCAAGGAT TCAGTGACTC CACAGCAGAC AACGAAGACA TTAAGTACTT TCCCGTGCGT CATCTGAGCA TTGATGTCAG TAGGCAAAGA ATCGTGTACA CCAGAGGCGG GGATGTATTC GTGGCAGAAG TTGTAGGTGG CGAACATAAG TCACCTATAA GGTTGCCTAT TGAGTGGCGT GGACCTCGTA CACAGCTTGC AAAACGATTT GTTCACGCGG ATGATTGGAT CGAAGACTGG GATTTACACC CAGAAGGCTT GACGATGATG GTTCTTGTTC GCGGCCAACC GTTCACGATG GGCATCTGGG ATGGACCGGT GTTGAGTTAT CCGCCAGCGA CGAAACTGCG ATCACCCCAA AGCTCAGTGA TAGCACCACT GGCGTCTCTT GCACAAAAGA GTCAAGCGCG TGTCCGGCAT GGGGCTTACT TGTACGACGG TGAACGTCTC GTTTTTGTAT CGGACGCCAG TGGTGAGGAG GACATTGAAG TGCATTGGGA AGAAGCTGAG CGACCGGCAA AACGATTAGG ATTGCATCAC GAGTTGCTTG GAAGAGTAGA ACGCCTGATA CCTAGTCCAG AGGCGCCATT GGTGGCGATC GTGAACCACA GAAACTCACT GTTGATAGTC AACGTTGAAA CTGGGGAAAT GCGAACGGCT GACACGTCGA GTGAAGTGGA CGGAATCGAC GATTTGACGT GGAGTCCTTG CGGGAATTGG CTCGCTTACA CGTACTACTT AAACAACGAA AGATCCTGTA TTCGCATTCT CGATGTTCGG AACGGAAAGG TGTTCGATGC CACAAACCCT GTGCTCGGAG ATCACTCACC TGCGTGGGAT CCCGACGGCA AATATCTGTA CTTTTTAGGA TCTCGAGAAC TCGAACCGGT GTACGATGCC GCCACGTTTG GCTTAAATTT TCCAACTGTT GAACGCCCTC ATTTGATCAT ACTGCAAAAG GATTTACGTA ATCCACTTCT CAAGGAGCTC AGACCGCCGT ACGATACGGA ATCGTCGTCT GGCTCTGACT ATGACTCTGA AACAGATAGT GATGTGGGAA GAAAGATGGA TAAACGAGGC AATGGAAGGG ACTACATGCC TCGCAAGAAG CCAGTTGTGG ATAAGGATGA TGATGGCAGC GACTGGTCAA CTGTGGATGG AGACAGTGAT GACCAAATCT TGTCGAGCGG CGACGAGGAT GACGATTACG AATCAGACGC GTCATATTAC CATGAAGACG CGCCCCCTGC GATTGAGATT CACATCGATG GCTTGACAGA AAGGGTAGTT GCGCTCCCAA TGCCGATATC ACGTTATGAT TGCCTTTGCG GTCTTGAAGA CGGACGATTT ATGGTTGTTG AATACCCTCC GAGTCGCGGC TCTCCCGGAA GCGTTGGTTT GGACTACTCG TCCGATGAAG ATGATGACGG TTTGGGTTCC CTTATTTCTT ACAGTATCCG AGATTTGCGT CGAAGTATCT TAATACACGG TGGCGTGAGC GAAGTGTCAC TGTCGATGGA TCGCAAATGC ATGGTTGTGG AAAAAGAGTC GGATGGATTC CTAGAGCTTC GCGTTTACAA AGCTGGCGTG CGGCCGGAGG AAGAAGGGAG CGACAGCGAA GAATTGGATC AAATGCGGTG CGATCGTAGG ACTGGCTTGG TGAATCTAGA TGGTCGCATT CGTGTACTAG TTGATCCTGC GAGAGAGTGG GCACAGATGC TCGGGGAAGT GTATCGTCGT CTTCGCGACG ACCTCTGGAC TGAGAAAATT TGGAATGAAA CGATTGGCGA TGACTGGGAA GTCATGTTTG AGGAGTATGT GAAAGTGCTT CCAAAAGTGA GTACGCGGAC CGAGTTTGGG GATTTATTGC GTGAAATTGC CGCTGGCGTT TGTTACTCGC ACGTGGCTAT CACGTCTGGT GATCCTGGAC GCTCCCATCG CAGACACTCT GCTGGGTATC TCGGCGCTGA CTTCACTTGG GACGGCAAAG TCGGCGGTTA TCGCATCTTG AACATCGTGA AGGGTGACAT ATGGGACGAC ATGCGAGGTG GTGTGCTCAG CAAACCGGGC GTCAATATCC ACGAGGGTGA CATACTTCTC TCGATCGATA GAGTACCGCT CACCGAAGAT GTTCCGCCGG CTGCGTTGTT GATTGAGAAG GGTGGCGTTG AAGTTCTGTT GACGGTCAAA ATTGATAGCG ACGGCAAAGG TGGTATTGAC GAAGCGCTCG ACAGACTCAT GCTGAAAAAA CAAAAGAACA AGAAAAAGGA CAAGAGAGAT GACAACGCAC CGAAGAAAGG TGATGTCATC CCGGTGCGTG TCCGAGCCAT GCACTCTGAA ATCGATGCGA GGTATCGCGA TATGATTCAG AAGCGCACGG AGCGAGTCCA CAGCCTGAGC GATGGCGTCG TCGGCTACTT GCACATTCCA GACATGGAAA GCACAGGGTA CTCCGAATTT TGGCGTCACT ATGCGTCTGA AGTTCGCAAG GGAAGCTTGA TTCTCGACTT GCGAGGTAAC ACAGGCGGAC ACATTAGTGA ATTGTTGCTC GCTAAGCTCT CGCAGCGCGC ATTGGCTTGG GACATCCCGC GACGCGGCGA GGTGCAAGTT TATCCATCCA ACACGCCTGG CCCGCTCGTG ATGCTGGTCG ATCAACGCAC AGGCTCTGAT GCTGAGCTCA TGGCGGAATC TTTCAGAAAA CTAGGTTTAG GACGAGTCGT TGGGATGCGC ACTTGGGGTG GTTTGCTCGC CATCAACGGC GTCGCCGAAC TCATCGATGG GTCCGAGTTG AGTTTGCCTT CACAAAATGT GCTCCTTGTC GACGAGGCGA AGGGCGTAGA CGCGAGATCC GACGCGACAC AAGCGTACAC GAACGCGGTG GAAAACCGCG GCGTCATTCC GGACGTCACA GTTGACATAT CTCCCGCTGA ATATTCTCGC CGCGAGGACC CGCAGCTCGA CACCGCCGTG CGCGAGGCGT TGCAGTTACT CAAAGACACC GGCGCCGCCG GCGTCGCGAC CTACCTTCGC AAGATCCGCG AGGACGAGAC AACGGCCGCT GAATTAGAAC GCAAACTGAC GCGCAAACCT TGGTCGTTTT CCACGTGGGC GCCGCTTCCG CCGACCAAGG AGGAAGAAGA GAAGCAACTT CGAGCCAAGC GCCGCGCCGG GCGAAATAAT ATCCCGCGCC CGTGA
|
Protein sequence | MHHNRHRARS GRSTRSTTTN RARTRTTVRS AARAHYQDGD AHAASQEGYY RFPVIRGNEL FFVCEDDVYA TTISGLDKRE SGAETTPPRR LTQAHGAVQR LVVSPDGSRV AFACAEDGYT EIYVVDARGG PMKQLTHMGA SYARACCFSE DGRRVYFTSS GATAEPNGDE LWVVDCDGGA PMRMNLGPVH DFDVRNVNGK ELVVLGRNTE DTATKHWDGY AGGAGGEIWY GTLDNLLRLD LRLPNERLLR NVGNVSWFDD EHVAFTEAGG RSTAFKAKID TNEFNTRIVD CQGFSDSTAD NEDIKYFPVR HLSIDVSRQR IVYTRGGDVF VAEVVGGEHK SPIRLPIEWR GPRTQLAKRF VHADDWIEDW DLHPEGLTMM VLVRGQPFTM GIWDGPVLSY PPATKLRSPQ SSVIAPLASL AQKSQARVRH GAYLYDGERL VFVSDASGEE DIEVHWEEAE RPAKRLGLHH ELLGRVERLI PSPEAPLVAI VNHRNSLLIV NVETGEMRTA DTSSEVDGID DLTWSPCGNW LAYTYYLNNE RSCIRILDVR NGKVFDATNP VLGDHSPAWD PDGKYLYFLG SRELEPVYDA ATFGLNFPTV ERPHLIILQK DLRNPLLKEL RPPYDTESSS GSDYDSETDS DVGRKMDKRG NGRDYMPRKK PVVDKDDDGS DWSTVDGDSD DQILSSGDED DDYESDASYY HEDAPPAIEI HIDGLTERVV ALPMPISRYD CLCGLEDGRF MVVEYPPSRG SPGSVGLDYS SDEDDDGLGS LISYSIRDLR RSILIHGGVS EVSLSMDRKC MVVEKESDGF LELRVYKAGV RPEEEGSDSE ELDQMRCDRR TGLVNLDGRI RVLVDPAREW AQMLGEVYRR LRDDLWTEKI WNETIGDDWE VMFEEYVKVL PKVSTRTEFG DLLREIAAGV CYSHVAITSG DPGRSHRRHS AGYLGADFTW DGKVGGYRIL NIVKGDIWDD MRGGVLSKPG VNIHEGDILL SIDRVPLTED VPPAALLIEK GGVEVLLTVK IDSDGKGGID EALDRLMLKK QKNKKKDKRD DNAPKKGDVI PVRVRAMHSE IDARYRDMIQ KRTERVHSLS DGVVGYLHIP DMESTGYSEF WRHYASEVRK GSLILDLRGN TGGHISELLL AKLSQRALAW DIPRRGEVQV YPSNTPGPLV MLVDQRTGSD AELMAESFRK LGLGRVVGMR TWGGLLAING VAELIDGSEL SLPSQNVLLV DEAKGVDARS DATQAYTNAV ENRGVIPDVT VDISPAEYSR REDPQLDTAV REALQLLKDT GAAGVATYLR KIREDETTAA ELERKLTRKP WSFSTWAPLP PTKEEEEKQL RAKRRAGRNN IPRP
|
| |