Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32043 |
Symbol | |
ID | 5002356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 175037 |
End bp | 178202 |
Gene Length | 3166 bp |
Protein Length | 889 aa |
Translation table | |
GC content | 57% |
IMG OID | 640417777 |
Product | predicted protein |
Protein accession | XP_001418171 |
Protein GI | 145347434 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0485159 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0952764 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCCGCGCGAC GCGCGACGCG CGACGCATCG CGAACGAGCG CGCGACGCGA CGGTCGATCG ACGAAACGCG TCTCGACGCG CGCCGACGAC TCGCGACGGG CCCGGACGCG CGAGCCGAGC GCGAGGGCGC GGTGAAGACA CGAACGGGGG ATTGACCGCG CGGCGGCATG AGCGCGACGG CGAGCGCGCG CGGCGGCGCG GGATATGCGC GGGATGGACG CGCGAGCGCG CGAACGATGC GAGGGATGCG AGCGCGACGA ACGACGGGCG TCGGGACGCG CGGCGGTGGA CGCGGGACGA GGGAAGAGGG AGGAACGGCG GCGTCGCGCG CGCGAGGGTG CCCCGCGACG ACGACGCGGG CGATATCCGA GGACGCGGCG TTCGGCGCGA AGGAGGCGTC GAACGCGAGC GGGAAGATCG CGAAGGGCTA TGGACGGTGG TCGACGATGA CGCTGGGGGA GCTGATCGAT CGCGCGGGGG AGCTCGAGGC GAAAGTGGAC GGGACGCGCG CGAACGAGTC GGAGTACTTT TACATTTTTC GCGAGTTGGT GCGATGTAAA AGACTTCATG ATTCGGTGGA TTTATTGAAG CACATGAAGG AGCGAGGGGT GAAAGAGCTC GGGCGAAGAG TGTCGCACAG AGACTTCTTC AGCGCGTGTC GTTCGCTGCG CGTTGTGTCC GTCGGATTTG AGTTCGTGGA CGTCATCGAG AGCGGAGACA TTCGACCGTA CAACATGTTG GTGCACGCAT GCGCCACGGC GGGGGATTTA CAAGCGGCGA CGCTGGCGAT AGAGAAGATG AAAAATGCGG GATTTGAGCC TGATTTACAA GCGTACACCA CGCTTCTTGG AGCTTGCTCA AAGTGTGGGG ACGTCGAACG CGCGTTCGAG GTGTACGCCG AACTCAAGAG GGCGGGGTTT GAGCCAAACG AGAAGACGTA CGGATCGATG ATCGACGCCA TCTCGCGAGA CCTTGCGACG TCTCTTAAGG GTTCGAGGAA GAGACGAGTA GACTCCGAAC ACGTTCGTTC GACTTTACAA AGCTGTTTCA TGATTTTTGA GGAGATTAAA ACCACGAATA TGAAGCTGGA TAAGATAGTG ATGAATAGCT TGCTCACCGT GTGCGCTCGC GCGGCGGTGG TTCCGAGTGT GCGCAAAGAA GCTTGCGAGA AAGTGGCGAT GGTACACGAC GAAATGATTG AGCGTGGCTT TGAGCTCGAC TCTTACGCGT ACCAGGCTCT CATCTGCTGC GCCTTGGCTG AGAAGAATTA CACGAGGGCT TTTGAGTACT TTGACGAGAT GCACGACGCC GGTATCAATG GTACCACCGA AGTATACACA GTGATGATCA GAGCGTACGG GAAGCTTGGT AAAGCTGATA AAGCTAAGCT GATCTGGTAC GCCATGTTGG AGGATAACAT CATTCCGGAT CAAATGAGTT ACGCGACGAT GATGCGACTC GCCCTGTTGG ACGAGGACGA CGATTTCTGC GACGAGCTGA TGACGTCCAT GAGACGCAAT CGCGTCCGCC CAGGGCCTGA GTTGTACTCT ACGCTCACTG GTGTGGCCGC GCGACAAGGC GATGCTTCAC AGGTGGAAGA GATCATGCAA AACGCGAAGA AGCGAGGCGT GGTGGCTCCC ATCGAATGCT ACAACTCGCT CATCGCCGCG CACGCTCGCG CCGACCGGCC AGATCTCGCC GTCGAGGCTG CGGGCAAGCT CGAAGCGGCT GGGTACGAGC TTGACGCTAT TTCATACGAA GGACTTATTT TCGCGTACGC TTTCGCGAGA GATGTCGAAG AAGCAAGTAA CATGTTTGAG CGTCTCCTCG AGTCTGGTAT TCGCCCGACA TTCCCGACAT TCAACTGTCT CGTGGCCGCT CACGCTCGAA GTGGTGATAT GGACGAGGCA TGCCGCTTGG TAAGTGTTAT GAAACAGCAT GGATACGTGG AGGATTCGAT CACGTGGCGC GAGCTTCTTT TGGGCAGCGT TCAGTCGGGT GACATTGAAG CCGCATGGAA GATGTACAAA GAGTCCCGCG CGTCTGGAAA TGCCGATTCC GAGCGTGCAC TCAACACGAT TCTCGGTCAA ACTTTAGTGC ATATTAGAAG TCTCACGGAT ATGAAGAACC GATCGAACGG GAAACCGAAC GAGTTCGGCT CATTTGACGA CGAAGGGGAT TACATTGCCC AGGAATGGAC GGAACGTGCG GTCGCGGCTT TCCACGAAGC CACGCTCGCT GGAATCAAAC CTCGGGTTGA AACGTTGTCC ACAATGCTCG CTTGCTTACG TCCACCTTCG ACGGACGAGC AAAATGCAGC TGAGTACAGC GAAGTTGCCC GAGCCGTGAG TCACGAGACG AGCTCCCATG AAGACGCCGC CAGGTACTAC CCTTCGCAAG CCCTCATCAT GTACGAAGAA GCTCAAGGCT TGGGTATTGT ACCGAAGTTT AGTCGTGATG ATGAAGACTT TGTCTACGAT ATCCGAGAGT TCCCACCAGC GGCTGCCGAG GTCATGTTGT TGACGTGGTT GCGTGTCGTT CGCCGGCGCA CAGACGCGCA TGGATTAGAC GCTACGATAC CGACTATGAC TATTCGTGTG AGAGCTGACG AAGAAGTCGT TCGAATGATC AAGGAGCAAC ACATGGATCG AATTGATCAC TCGCTGGGTC GCTTGTGCAA GACTGGAGAA CGGTTGCTGA CATTATTGCG CCGCCTCCGT ATCAATTACG GTGGTGGCTT GCAGGAGGGC ACTATCGAAT TGAGTGGCCA CGCGCTCGGT CGATGGCTTC AAGGTTTTGT TCCGGGCGAC TTCGGCAATC ACACCGGTTC CGTGTTCAGT GAGCATTCAC TTTCGGGCGG GGTGAGAGAC CAAGCGATGC GCATCCGCGC CAATTCTTTC GGCAGTAAAG ACGACGACGT GTGGACTCCG TCAAAGATGC GTCAAGCCGC ATTCAATATC CACGATTATT ATGGCAATGA TGATGACGAC CCGTCAGATT TTGGTGCCAG GCCATTCTAT CCCAAGAACT GGGTATCACA GAGTTACGTG TCGAGCTATG ATGAGGACGA TGACGACGCC ACAGATCTCG AGCGCATCTT AGGAAGTCGC AAGTAACGTA CGCACGCGCG CGCAGAAGTA CGTAGATTAT TTTTTTAGGA ATTCAT
|
Protein sequence | MTLGELIDRA GELEAKVDGT RANESEYFYI FRELVRCKRL HDSVDLLKHM KERGVKELGR RVSHRDFFSA CRSLRVVSVG FEFVDVIESG DIRPYNMLVH ACATAGDLQA ATLAIEKMKN AGFEPDLQAY TTLLGACSKC GDVERAFEVY AELKRAGFEP NEKTYGSMID AISRDLATSL KGSRKRRVDS EHVRSTLQSC FMIFEEIKTT NMKLDKIVMN SLLTVCARAA VVPSVRKEAC EKVAMVHDEM IERGFELDSY AYQALICCAL AEKNYTRAFE YFDEMHDAGI NGTTEVYTVM IRAYGKLGKA DKAKLIWYAM LEDNIIPDQM SYATMMRLAL LDEDDDFCDE LMTSMRRNRV RPGPELYSTL TGVAARQGDA SQVEEIMQNA KKRGVVAPIE CYNSLIAAHA RADRPDLAVE AAGKLEAAGY ELDAISYEGL IFAYAFARDV EEASNMFERL LESGIRPTFP TFNCLVAAHA RSGDMDEACR LVSVMKQHGY VEDSITWREL LLGSVQSGDI EAAWKMYKES RASGNADSER ALNTILGQTL VHIRSLTDMK NRSNGKPNEF GSFDDEGDYI AQEWTERAVA AFHEATLAGI KPRVETLSTM LACLRPPSTD EQNAAEYSEV ARAVSHETSS HEDAARYYPS QALIMYEEAQ GLGIVPKFSR DDEDFVYDIR EFPPAAAEVM LLTWLRVVRR RTDAHGLDAT IPTMTIRVRA DEEVVRMIKE QHMDRIDHSL GRLCKTGERL LTLLRRLRIN YGGGLQEGTI ELSGHALGRW LQGFVPGDFG NHTGSVFSEH SLSGGVRDQA MRIRANSFGS KDDDVWTPSK MRQAAFNIHD YYGNDDDDPS DFGARPFYPK NWVSQSYVSS YDEDDDDATD LERILGSRK
|
| |