Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25732 |
Symbol | |
ID | 5006293 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | + |
Start bp | 224529 |
End bp | 226438 |
Gene Length | 1910 bp |
Protein Length | 625 aa |
Translation table | |
GC content | 65% |
IMG OID | 640421714 |
Product | predicted protein |
Protein accession | XP_001422132 |
Protein GI | 145355790 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.788569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.748667 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGTTGC GCCGCGACGA CGCGGCGACG ACGACGGACG CGCGCACGGC GCTCGCGCGC GATCTCGCGC GCGATGGGTG GTACGGCGAC GACGCGGACG ACGTCGAGCG CGGCGAGCGG CGGCGGACGC GCGGACGGGG TGGGTTTTGG ACGCGCGCGC GGTGCGCGGC GTTGGCGTTG GGTGTTACGG CGTGCGCGTG CGCGAGCGGC GCGGGCGTCG CGTCCACGGC GCGCGGACGC GCGGGCGCGA GCGCGCTCGG GGTGCAACCC GACGCGAGAG GGACGCCGCC GCCGGAGAGC GCGTTCATGC GAACGGCGAG GAAGGCGTTC GGGGCGGCGT TGATGCGAGC GAGTTATCAC CAGACGCGCG CGAGCGGCGA CGTGGACACC GCGGAGGAGG CGGATGCGCT CGCGGATCGA GAGGCGATGG CGGTCGTCGT GGATGAGGAT TTCGAGGACG ATTCCAGGGA GGGTGATTTT CTGCGCGACG CGGGGAGACG CGCGGGGCCG GTTTCGGAGC TCGAACGGAG GCGAGAGGCG CGAAGGCGAG CGCATCGAGC GGCGGCGCGG CGCGCGGTGG CGCGCGCGAG CGCCGCGACG TCCGAACCAC CGCCGGCGCC ACCAGCGCCG ACACCGCCGG CGCCGGCGCG CGCGACGACG CACGGCGCGG CGCATCAGCA CCACGACGGC GGCGAGGGCG GTTTCTTGTC CGCGGCGCAC GTGCGAGAGG CGAAGCGACA ACACGCTCGT CACGACCGTC GAGTCGAACG GCGAGTGGAA TCGCGCGAAG ACGAAAAGGA AGAGAAAACA AAGAGGAAGG ATGCGGCGGA GGGTGATGCC GTCGAGGATC AACCTGGTGC GACGATGGAC GCGTACGACG ACGACGAAGA CTTCGTAGAG GCGGCGCCGT CAAAGCCGGC GGATGAGAGC GCGAAACAAA AGTACTTTAA ATCCATCAAC GGCGGTTACT TTGCCGCGCC ACAGACGACG CGAGGCGACG ATATCGGACG AATGATGAGA CACGACAGAG CGAAGCACCA AAGTTTAGCG TCCTTGCACA ACCGACCCGG AGCGATTCCG GGCTTGATCG ACCATTTGCA AGCCAACGGC GCTCAAGTCG GCCAAATGCG TCTGATGACG TACACGAATT CCGCGTATTG GCCCATGGCG AAGCTTTTCA TCGAGTCTGC CCAGCGCATA CCGGGATTGG CCGATGCGTT GACGGTGATG GTGAGCGACC GCGCGACGCT CAAGCAGTGC GTGGCCACGG GGGTGATGTG CTTCTTAGAT ACGGATATGA TCGACGTCCT CGGCGAACAC ATGAACAAAG ACGGGAGCAT CCAAGCCGGA CAAGAAGATA TGAGTGGAGA TTTAGGCAAA GCCTTGCGCG TAGTGTGGAC GTGGCGAAAA GTGCACGTGG TGTACACTCT CGTGAACGCC GGCTACGGGT GCTTGTTCCT CGACGCGTCG ACGCTGCTGC TGCGCGATCC GCGCTTGCTG ATCAAGCAAA AGCTCGACGC TGGCGCACTC TTGGTGACGC TTTCGGACTT TGGCGGCGCA CTGGAGCAAA AGGCCATCAA CACGGGCTTA ATCGGGGCGC GGCCGAATGA GTACGTCGGG AAATTGCTGG AGGATTGGAT GGCGCTCGAA CCCGAGGCGA CGGATACCGA GCAAGCTTCG CTCACATGGA ACATTGCGCC CAATGCTCGC GCAGACGGTG TTATCATCAC CGCCCTCTCG CAGGAAGTCG CACCGTCGTA CCTGACGTTC GACGTCTCGC AGCACTTGGA TCTAGACGAA GACGGAAGCG GCGAGCATCG CGGGTACATC GTACACGCCG CGTATTGCGG CTCAATCTCC GGAAAGTCAG CGTTCTTGTC GCGCGTGTCT CAATTAGCCG AAAATCCTAA
|
Protein sequence | MVLRRDDAAT TTDARTALAR DLARDGWYGD DADDVERGER RRTRGRGGFW TRARCAALAL GVTACACASG AGVASTARGR AGASALGVQP DARGTPPPES AFMRTARKAF GAALMRASYH QTRASGDVDT AEEADALADR EAMAVVVDED FEDDSREGDF LRDAGRRAGP VSELERRREA RRRAHRAAAR RAVARASAAT SEPPPAPPAP TPPAPARATT HGAAHQHHDG GEGGFLSAAH VREAKRQHAR HDRRVERRVE SREDEKEEKT KRKDAAEGDA VEDQPGATMD AYDDDEDFVE AAPSKPADES AKQKYFKSIN GGYFAAPQTT RGDDIGRMMR HDRAKHQSLA SLHNRPGAIP GLIDHLQANG AQVGQMRLMT YTNSAYWPMA KLFIESAQRI PGLADALTVM VSDRATLKQC VATGVMCFLD TDMIDVLGEH MNKDGSIQAG QEDMSGDLGK ALRVVWTWRK VHVVYTLVNA GYGCLFLDAS TLLLRDPRLL IKQKLDAGAL LVTLSDFGGA LEQKAINTGL IGARPNEYVG KLLEDWMALE PEATDTEQAS LTWNIAPNAR ADGVIITALS QEVAPSYLTF DVSQHLDLDE DGSGEHRGYI VHAAYCGSIS GNRKS
|
| |