Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31833 |
Symbol | |
ID | 5001773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 608427 |
End bp | 611828 |
Gene Length | 3402 bp |
Protein Length | 1133 aa |
Translation table | |
GC content | 55% |
IMG OID | 640417194 |
Product | predicted protein |
Protein accession | XP_001417820 |
Protein GI | 145346696 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0358806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.410614 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATGGA GGACGAGCGG AAGGTTATGG ACGGCGGTGG TGTGGACGCT GGCGACGATT GCGCGAGGGA GGACGGCGAC GGCGCAGTAT CAGACGACGG CGACGACGCT GACGTTGGAG CACGTGTGGC CGGATACTGG AGCGGTGTAC GCGTCGACGG TGGTGTCGTT GTATGGGAGT GGATACGCGA ACGCGTTGCC GCCGATGGGG TGTCGGTTCG GGGAGATTGT CGCGAATAAG GACTCGGCCG CGAGCACGAC TTCGAAAGTC GTGTGCGCGA CGCCGACCAA CGTCTTCGCG GGTTTCGTCG CCGTTGGGTT GGCGCAGGCG ACGGGAAAGA GATACGTTCC CGGGTCAGAT GATTTAGTCG TGGACAACGG GCAGCATTCT TTCGAATTCG TCGTGCCTTG GAAACTATCC AAAGTGAATC CAGAGTACGC GTACAAGAGC GGCGGCGAAG TTCTGCGACT TTCAGGGACA CATTTTCGTC CAGGAATGCT GTGCACGTTC AAAGACTCTT CGCTGACATC CGAGTACAGA TTCATTTCTT CTGCGTTAGC CATGTGTGAG ACCCGTGCGT CGAGCGAAGC GGAAGGAACG GTGGATTTGA ATCTCACTCC AACTCACGCC GTCGGAGGCG GCGGCGCGAG CGTCGAGTAC CAAACCGCGC CCATCATAGA TGGGCCTTTA GTGTCCACCA CCGCAGTTGG TGGTGACGTA GTGATACAAG CATCAAGTAG TACTCCGCTG AGCGGCGCCA TAGCATCGTT CACGGCGAGT CCGATTCGTA TCGGATGCTG GTTTGACGGC ATTTGGGTGG CGGCGACTCT GCGCAGCGAA AGAGAGTTGG TGTGCAAAGC GCCTTTGCAG TCATTCGGCA CTCCATCGCT GAGTGTCGTG GATATGTATG CTCAAAGAAT GTTTCCGACG AATGCAACTC AGACTGGTTG GTTCACCTCA TTCACCGTGA GCAAAGACGA AGTTGTCGAC GTCGTGTTGC CTTCGGTTGG CAATGCGATG CGAAATACGG TGGCCGATTC TTCGACTCTG GTCGACTTTT TCGGCCGCAA CCTTATTGCT GGCTCGGGTT CTGTTGGTGC ACGCATTTGC CAAGCCTTGA ACCCGGTTAC ATCTCCCGTT TTAGTGACGA ATCAGGTCCA AAGCAAATGT GATTTTCCGG CGACACCATC CGAGACAGCG GCGCTGTCCA CGCTCAGATA CGGCTTTCAC GCTGTGAGTG CAGGAGCCGG TGCAAGTGCC TCAGCGCAAT TTCTGATTGT CTCGCCACCA CAAATTACGT CGGTGGTACC GGGATTTCTT CGTGCGGGTA CTGTCGCGAC ATTTTCGGGA CAAAATCTCA TGGACCCGTT TCGGCAAACA TGGTGTGGGC ACGACGGTAT AGCTCTCGTC GCGCACGCGG TGTCGAGCGC GCTCATCCGA TGCTCAGTCC CGTACCATCA TCAGCTGCCT TCGGGTTCCT CTAGTTCTGC GCACGATCTA ACCATAGACG TGCTTTCTGA TTTATCGTCG CCGGGGGCCG GAGGAACCAC CATGGGCTGG CTACCAGTAG CGAATGATTT AGAGAGCATC ACGCCAAACG TCGGAGCGAC GAGCGGTGGC ACTCGAACGG TGCTGAAACT CACGGGTGGA ACTATCCCAA CTTCATCTTA TTACACACCC ACGTGCAGAT TTGGCACGAT AGTAACGTCG GCGATACACG TCCCTGGAGG TGGAGTTGCG TGTAGCTCTC CTGCATACGC CGCGCGTAAC GTGACGGTGG GCATTGACGT CGAGTCGACG ATAGAATTTC AGTACGTCGC ACAGATTGGT GTGAGTGCCA TAGTACCAGG AGCTCTTCCA CAGAATGGGG GCTCGATATC TGTGTACTTG GATGCGGCAC TTTCGTCTTC GTATTCAGCT GACTGCGTTT TCGTCACCAG CGGCGGGGAT AAATTAAAGT CTTCTCTCGC CGATTCCTCG GGAACGCTCA AGTGCGCCTC GCCGCAGACT GGTATTGGCT TTGCAACGAT GGCGATCGTC GTATCTGCGG CTAGCACGAA CAACACTGCG TTTATAGACG AAGCAGCTGG GCAGTACTTC GACCTTGAAG TTCAAACCAG AACGCCTGGG GTGGAAGTTT TCCTCCCAAC GGGCGCGAAT TGGGTACAGG CGGAGGAAGT CATACACGCT GTGACGTCTG ATGGATCTCT TCTCGGCGAC AATTCGGGAG ACGACGATTT CTGGTGTGTG TTCGGAAAAG CTGGAATATA TGGAGGGTCA ATATATTTGA GTGCGGCGAG TAGAGTATCG GGAACTATCC TGAAGTGCAC CGTACCAGAC CTCGATACGC TAATAGTTTC TGGGCAGAGT GGATTTGAGA TTGTCATCGG TATTTGTTCT TCGACTGAAG TTTCGGCGAG CAGTTGCACC AGTTACGCGG GAACACGCGT GAGATACGAG AAAAAGCTGG GAGTTTCTTC AGTACTTCCA GTGAACGGTA CGCAAGCAGG AGGCGATGTT GCAGTTCTCG TTGACTCTGC ATCCGTGAAA GGTTTCGGCG CAAACGTGCC AAGCTGCAGA TTTGGCACAA TCTATCCCGT TGCGGGAACT TCGGTAACTG GAACGGGCGA GATAAATTGC GTGACACCGG CGCATGCTGC GGGTGTCGTT CCTGTGAGTG TTCCTCCGCT CGATTTGGGG TTGACGGCTT TGACATTTGA GTATATTTCC GTGTCCACTT CGCAATCGGT CTTGTCCACG ACGTACGGCG CCGATCCGTA CGTCGTTGCA TACCTGATTG AACCGACACC AATGATCACA GAGGTCGTGC CTTGGGTTGG TTGGAGTGGA AGTGTTGTCA CCTTGCTAGG CACAAACTTT CCAACTGGAT CCGCCGTCAA GTGCAGATTC GGATCTGTCT CCGTCGATGC TCAGGTTGTT TCTACGGCTG TGATACAGTG CGGCGATACT TCGCCGATTA CCTCTGACGA CGTCGAAGAG CAGCGCGTCG CTGTCACGAC GAGTTCGGGC GATTCAAATC CGAATGTGAC GACGTTAGCA CACTATGTCA TTACTCAAGG AGATATCAGC GCCATTGATG CCGCTGACGG TTGGCAACAG GGTGGAAACG TAGTCGGCGT CACCGTTGCC AAGTGGGTCC CTGAAGGCTA CACATCCTGT CGCTTTGGCA CGATAACCGT TCAGAGTAGA GGCGGAGACG GCTTCGGTGC GATTGGCAAG GCGTCGGTAT CGCAGTCATC GCAGTGGTGG TCAGATTCGA CTGATGGCAA GAAAATAGAG TGCGTATCTC CAGCAGGAGC GCAAGGAAAC GTAAATTTGG GAGTATCCAT CCTCGGAAGC ACTGCATCGT CTTTCATTGG TACTACGTTT ACCTACATTT AG
|
Protein sequence | MRWRTSGRLW TAVVWTLATI ARGRTATAQY QTTATTLTLE HVWPDTGAVY ASTVVSLYGS GYANALPPMG CRFGEIVANK DSAASTTSKV VCATPTNVFA GFVAVGLAQA TGKRYVPGSD DLVVDNGQHS FEFVVPWKLS KVNPEYAYKS GGEVLRLSGT HFRPGMLCTF KDSSLTSEYR FISSALAMCE TRASSEAEGT VDLNLTPTHA VGGGGASVEY QTAPIIDGPL VSTTAVGGDV VIQASSSTPL SGAIASFTAS PIRIGCWFDG IWVAATLRSE RELVCKAPLQ SFGTPSLSVV DMYAQRMFPT NATQTGWFTS FTVSKDEVVD VVLPSVGNAM RNTVADSSTL VDFFGRNLIA GSGSVGARIC QALNPVTSPV LVTNQVQSKC DFPATPSETA ALSTLRYGFH AVSAGAGASA SAQFLIVSPP QITSVVPGFL RAGTVATFSG QNLMDPFRQT WCGHDGIALV AHAVSSALIR CSVPYHHQLP SGSSSSAHDL TIDVLSDLSS PGAGGTTMGW LPVANDLESI TPNVGATSGG TRTVLKLTGG TIPTSSYYTP TCRFGTIVTS AIHVPGGGVA CSSPAYAARN VTVGIDVEST IEFQYVAQIG VSAIVPGALP QNGGSISVYL DAALSSSYSA DCVFVTSGGD KLKSSLADSS GTLKCASPQT GIGFATMAIV VSAASTNNTA FIDEAAGQYF DLEVQTRTPG VEVFLPTGAN WVQAEEVIHA VTSDGSLLGD NSGDDDFWCV FGKAGIYGGS IYLSAASRVS GTILKCTVPD LDTLIVSGQS GFEIVIGICS STEVSASSCT SYAGTRVRYE KKLGVSSVLP VNGTQAGGDV AVLVDSASVK GFGANVPSCR FGTIYPVAGT SVTGTGEINC VTPAHAAGVV PVSVPPLDLG LTALTFEYIS VSTSQSVLST TYGADPYVVA YLIEPTPMIT EVVPWVGWSG SVVTLLGTNF PTGSAVKCRF GSVSVDAQVV STAVIQCGDT SPITSDDVEE QRVAVTTSSG DSNPNVTTLA HYVITQGDIS AIDAADGWQQ GGNVVGVTVA KWVPEGYTSC RFGTITVQSR GGDGFGAIGK ASVSQSSQWW SDSTDGKKIE CVSPAGAQGN VNLGVSILGS TASSFIGTTF TYI
|
| |