Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14782 |
Symbol | |
ID | 5000917 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 806803 |
End bp | 809566 |
Gene Length | 2764 bp |
Protein Length | 342 aa |
Translation table | |
GC content | 61% |
IMG OID | 640416338 |
Product | predicted protein |
Protein accession | XP_001416769 |
Protein GI | 145344499 |
COG category | [R] General function prediction only |
COG ID | [COG2603] Predicted ATPase |
TIGRFAM ID | [TIGR03167] tRNA 2-selenouridine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.597893 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCGGT TCGCCGCGCT CGCGACGGCG CCGAGCGCGG TGCCGCGCGC CGCCGTCGAA GCGGTCGTCG ACGCGGGCGC GCGCGCGGCG CTCGTCGACG TGCGCTCGCC GGGAGAGCAC GCGCGAGGAC GCGCGCCGCG CGCGATAAAC GCGCCGCTGT TCGATGACGA CGAACGCGCG GCGGTGGGGA CGGCGTACAA GACGAGAGGA CGAGGCGAGG CGCTGGTGCT GGGGATGGCG GCGGCGGCGC CGCGGTTGGA GGCGATCGTG GAGCGCGCGA AGCGAGCGGT GGAGACGGTG GCGACGACGC GAGGGAACGG TGACGGCGAA GACGATGGTG AGATTGATGT GTACGTGATG TGCTTTCGAG GAGGGATGCG GAGCTCGTGC GTGGGGTGGT TGTTGAAGGA ACGGTTGCGC GGCGCGAGAG TCCACGTGGT CGAGGGCGGA TACAAGGCGT TTCGGAGGTG GGCGCTCGAG CGGTGCGGAC CGACGCGCGG ATTGCCCGCG CCGAGGGTGT GCGTCGTCGG TGGACGCACG GGGGTGGGGA AGACGCGCGC GCTGTTGGCG CTGCGCGCGA AAGGGGAACA GATTATCGAT TTAGAGGGGT TGGCGAACCA CGCCGGGAGC GCGTTCGGGT GGGTCGGGCG AGAGCCGCAG CCGACTTCGG AGCACTACTC TAATTTAGTC GCCTGCGAGT GGCACGCGTT GGATGCGAGC AGATGGGTGT TCATCGAGGA TGAGGGACCG CACGTCGGGC GGTGCTCGGT GGATCCGTTG TTGTTTGAGC GCATGCGAAA CGCTCCTCTG GTGCTTCGCA TGGTCGCCCC GCGCGAGGTG CGGTTGCACA CTTTAGTGGA AGATTACGCG ACGTCAGAGT TGCGTTCTCA TCCGGAATGG ATGCCCACAA TGCGTGAAAG CGTCGATAAG CTCGTCAAGC GGCTCGGTGG CGACCGCGTG CGCGACATTC GAGAAAAGCT CGAAGCCGGC GATTTCACCG CCGTCGCCGA AGGTTTGCTC GAGTATTACG ACGGTTTGTA CGACAAGCAC CTCATGAGCA AGCGCAAAGA CCGTCGCGCG GCGCGGGCCG CCAACGCCAA CGCCGCCAAC GCCAACGCCA ACGACGACGC GTGTTCGGTG CAATCCTTCA CGTCGTCCAC CACCGAAGAC GAGGCCCGAG CCGGCGTCGT CGTCGACGTC AACTGCGTCG CCAACGCCTC GGGTGGCCTC GACGAGGACC TTCTCGTCCG CGATGTTCTG TGCGCCGTCG CTCTGTTCGA ATGCGAAGGA CCGAACGACC CCGAACGCGC GTAGTCGATT CGATTCGCGT CTCGTCTCGA GCGCCGACTC GAGGCTGACT CAGCACGGTC GTGCCGACAC GCTTTGCAAT AATCCACGCG CGAGTTCGTC ACAAACGCGC GCGTCTCACG CACGTTCGAA CGCGCGAACG CTCAAATCCA CGAAAAAGAT GTTCTCTCTG TCCTCTACCG CCGCTCTCGC CCGCGCGCGC GTCTCGGCGC CGCGTAAAAC CGCGCGAAAG TTGACGATTC GCGCCTCCGT CAACTACATT CTCGATCGCG AGGGATGTTC GAGCGCGCGG TCCGACGACG ACGGCGCGCT GCTCCAGACG GCGGCGAAGA CGTGCCTGGC GGTTCCGATC GACGTGAACG CGCTCGTGGC GCAAAATAAG GGTGACTTGG AGTTTGAGAA CGAGCAGTTG AAGTTTGAGC TCGACGACTC GAGCGCGCTC GTCGTGCAAT CGCTCAAAGG CAAAACTCTG ATCGACGGGA AAGCGGCGCC CAAGGGACGA AAATTGAAAC TTCGCGCGGG ATCGAAGATT AAGATCGCCG ACGAGGAATT CACGGTGTAT CGAAACACGC ACGCGCACGC GTGAGACGCG AGTCGCCGAA CGCGAGTCGC CGAACGCGAA TCGAATCAAG ACCACACTCT TTGACTCCGC ACCTTTTATC GTTGTACCAT CGCTGTATAG CTACTAACTC TGTTGTAAAA CACGAGCGGA CCTTACTCGA TCACTCAAGT GCTCCACCGC GGATCCGGGG CGTGTGCGGC CGTCGGCGAA TGGCGAAAGA ATATAAATGA ATACGTATTA AAATCGATTC GAGTCAATTC ATTAATAGGC CAGTTCGGGC TGGTCCACGT ACTAATCAAA TAATAAGATT TTTACAAGGG GCGTTCAGAC GCGCAGGACT TTGACGACGC CGTTTGAGTT CGCGGCGACG AGCACGGATT CATCGCTTCG CCACGCGCAC GCGCTGATGA AACCGCGAGG TTCGTTCGCG TCCCGAGCGA ACCCGATGGA CGCGATGGGC GACGAAAGAT CCTTGCGGTA AATGAATACC TCGTTCGTTT CGCTCCCGCA CGCGACGTGG TCGCCGACGG CGGTGAGGCC GACAAAGTTT CGTTCGTTGA CGTGTCCTTT GAGCGCGCAC GTGAGTTCGC CTGTGTTGAC GTTCCACACG TTGATGGTGT TATCCGTCGA TGCGCTCGCG ATTTCGTTGG CGTTGAGGTA CTTGACGTAA CTGACCGCCT TGCGATGGCT GTCGAGGATT TGCACCGGCT CGGCCAAGCG TCGCAAATCA AAAATATACA CCTTTTGGTC GACGCAACCG ACCGCGATGC AATGCGCGTT CTCGGGTGAA TACTGCGCAC AGCAGACGTT CGCTTTCATG TCGATCTCGT GCACGCTGTT CGGTTGATCG GTGTTCCAAA TCTTCACCAA GTAATCATCC GAGCCACTCA CTAA
|
Protein sequence | MERFAALATA PSAVPRAAVE AVVDAGARAA LVDVRSPGEH ARGRAPRAIN APLFDDDERA AVGTAYKTRG RGEALVLGMA AAAPRLEAIV ERAKRAVETV ATTRGNGDGE DDGEIDVYVM CFRGGMRSSC VGWLLKERLR GARVHVVEGG YKAFRRWALE RCGPTRGLPA PRVCVVGGRT GVGKTRALLA LRAKGEQIID LEGLANHAGS AFGWVGREPQ PTSEHYSNLV ACEWHALDAS RWVFIEDEGP HVGRCSVDPL LFERMRNAPL VLRMVAPREL VKRLGGDRVR DIREKLEAGD FTAVAEGLLE YYDDVRFHVD LVHAVRLIGV PNLHQVIIRA TH
|
| |