Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37792 |
Symbol | |
ID | 5005993 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | + |
Start bp | 39719 |
End bp | 42787 |
Gene Length | 3069 bp |
Protein Length | 934 aa |
Translation table | |
GC content | 58% |
IMG OID | 640421414 |
Product | predicted protein |
Protein accession | XP_001421820 |
Protein GI | 145355127 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.772788 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.533349 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTCT TCGCGCCGTA TCGCGCGCTC GGCCTCGTGA ACGCGCAGCG AGGATGCGCG CTCGTCGTCA AGCGGCGCGG GACGGAGACG TTCGTCACGG TGAGCGCGGA GGACGCGTTC GCGGTGTACG ACGCGCGAAA ACTCACATTG GTCTTTCGAA GCGCGCGGTT CGCGACGAGC GACGCGCGAG GGATCGGGGC GATGGCGGTG AGGAAGGATT ACACGTTTTG CGCGATCGGA CGGGAGATAA GGTGCGCGAA ACGATTGGCG GAGTGCTGCG AAGGACTGGG GCGGGACGAC GGCTCGGGAC ACGACGCGCG GGTGACGACG CTGCACGCGT TCGGACGACA CTTGGCGAGC GTGGACGAAA ATGGGGCGGT GAAGATGTGG GACATCGATG ATGACGCCAT GCGGGCGCGC GAGCGGGCGT GGGCGGTGGG ACGAGGAGAA CCGACGGGGC CGGAGGGCGA GAGCGACTCG GAGCTCGAGC TCGGCGCGCG AGCGATGGAG ATGCCTCGGA CGTTCGCGGC GACGACGGTG TGTCATCCCG ATGGATACGT GGATAAGTTA CTGTTTGGCT CGAGCGATGG GCGATTGGCG CTGATGAACG TGCGAGCGGG TAAATTGGTG CACGAATTTG CGGGCTGGGG GTCGGCGGTG ACGGCGCTGG AGAACTCGCC GGCGACGGAT GTCGTCGCCG TCGGTTTGGC GGATGGGCGC GTGCTTTTGG TGAACGTATT AGAAGATAAA GTTTTGTTTA CGCTCACGCC CGAGCGTGGG GTAAAGGTGA CGGCGCTGGC GTTTAGAACG GATGATCAGG ACGACGTGCT GTGCGTCGGC GACGAAACGG GACGCGTGAC GGTTTGGGAT TTGGAAAAGC GCTCGTTGCG CACGCTCATC GTGCAATGTC ACGAAGGTCC GGTGGTTTCG TTGAAATTTC TCGATGGACA ACCGGTGATG GTGTCGAGTG GGTTCGATAA CACGTTGAAG GAGTGGATTT TTGATAGAGA AGACGGCGAC GCTCGACTCT TGCGCTTTCG CGCCGGACAC TCCAAGCCGC CGACGAGCGT GTCGTTCTAC GGTGAAGGCA AGAAACTTTT AGCGGCTGGG AGCGATCGAA CGTTGCGATT TTTTCACGCG TTTCGTGATC AGCAAAACAT CGAGTTGAGC CAAAAGAATG TCAGCAAGCG CGCGAAGAAA ATCGGCGTCG CGGAGGAAGA GTTGAAACTT TCTCCGGTTA CGAAGATGGC GTGGGGCGAG CTTCGCGAGC GCGATTGGGC AAACGTCGTC ACCGCGCACG AAGGTTCAAA CAAGGCGTAC ACGTGGCGTA TTTCCAAAGG TGCGCTCGGT GAACACATTT TGCAATGTCC GAAGGACGAC GGCAAGTGCG AAGTCAAGTC GGTTGCGATT AGCGCGTGCG GCAACTTTGC CTTTCTCGGT GCCGCAAACG GGGCGATCCA TCGATTCAAC TTGCAATCGG GCGCGCATCG TGGCGCGTTC GAGCGCGTCC TCGACGCTGA TGAGGTGATG CGCACAAAGA AGAAGAAGAA CGGAAACGAA GGGTACAATT TCCCCGGCGG CAAGCGTTCG TTTTGGGCTC TCGCGAATCA AACCGGTGGA AACGCAGAGG GAAAATTGCG CGTTGCCGCG CACGACGGTG AAGTGACGTG CATACAAGCA CACTGCGCGA ATAGAAGCGT CGTCACCGCG GGTGTCGACG GGATGATTCG CGTGTGGAAA TTTAGCGAGT TGAAGATTGA CTTGGAAATC GACGTCGGGT GCGGCGTGCG GTGCGGTCAC TTGCACGAGG ACTCATTGCT CGTCGTTGGA TGCGCTGATA AGCACGTGCG CGTATACGAT ACCATGACGG GTAAACGTGT TCGCACGTTC AAGCCACGCG GTCAGGAAAA TGAGGCTGGA GATATCACGA GCGTGCAAAT CAGTGAAAAC GGTAAATGGA TCTTCGTCCT CGACACCACC GGAACGATAC GAGTGTACGA TATTCCCGCC GCGAGATTGA TTCAACACAT GATTCTCGGT GCGGATAAAG TCACCGCCAT GAGCTTTTCA CCGCGAATGG ACTTTTTAGC CACCGTGCAC GAGAACAGGG TCGGGTTGTA TTTGTGGGTG AACATGCCGA TGTTTGATTA CGACACCAAA TTGGCGTACG GGCGCAAAGT TTCCATCGCA CTTCCGAGAA AACACGCGGA ATCCGACGCA GACGGCGGCG TGCGCGATGC GTCTGAGGAG ACGAACAAGG AGTCCAACGA TACCGACGTG TACATTCACC CGTTCGAAGA AGACGAAGAA GAAGAGCATC GCAATTTGCA AGAGTTGGAA GAATATTTCA AAGAAATGGC CACCGGCCCG AAACAAATAG CCCCTGGCAT GATTACTCTG GCGATGATGC CTCAGACGCA AATCGAAATG CTACTGAATC TGGAGACGGT GCAAGCGAAG AGTAAGGTGA AGACCGATGA CAAAGAACCG GAATTGGCGC CGTTCTTTCT GCCCACAGCC GCCGCGAGCG ACGACGTGCG TCGATCTGTC TTCGATCCCG CGCGAGAATC CGAGCTCAAC GCCAAAGACG ACACCGGCGA TGATGATTCC AAAGCACCGA AGAGTCGAAT TTTGCGTCAA GGCGCGGATC TCGCCGGCGC GTCCGCGACG CCGCTCTTGA CGTTGATTCT TCGAGGAGAA CGATCGGAGG ACTACACCGA AGCGCTCGAA TTCTTGAAGA ATGCGTCGAT ACACGTCGTC GACGCCGAGC TGCGATCGCT CGGACCTTGG GATCACAAGT TCATGTCCGA GGACGACGTG AAAACTTTGC GGAGCGCCAT CAAATTTTTC ACCGCCGCAA TCGCGAGCGG CATGTACTAC GAGATGGTCA ACGCCCAACT CAACGTGTTT CTCAACGTGC ACACCACCGC GATCATGCAA TCCAGCGCGC TCGTCGACGA GTGTCACGCC CTGCGAGAGG CCATGCACAA ATCCTACAGC AGGATCGACG ATTTGTTCAA CGAAATTCGG TGCACGCTCA GTTTCCACAT CGGCGACGCC GGCGTCTAG
|
Protein sequence | MALFAPYRAL GLVNAQRGCA LVVKRRGTET FVTVSAEDAF AVYDARKLTL VFRSARFATS DARGIGAMAV RKDYTFCAIG REIRCAKRLA ECCEGLGRDD GSGHDARVTT LHAFGRHLAS VDENGAVKMW DIDDDAMRAR ERAWALGARA MEMPRTFAAT TVCHPDGYVD KLLFGSSDGR LALMNVRAGK LVHEFAGWGS AVTALENSPA TDVVAVGLAD GRVLLVNVLE DKVLFTLTPE RGVKVTALAF RTDDQDDVLC VGDETGRVTV WDLEKRSLRT LIVQCHEGPV VSLKFLDGQP VMVSSGFDNT LKEWIFDRED GDARLLRFRA GHSKPPTSVS FYGEGKKLLA AGSDRTLRFF HAFRDQQNIE LSQKNVSKRA KKIGVAEEEL KLSPVTKMAW GELRERDWAN VVTAHEGSNK AYTWRISKGA LGEHILQCPK DDGKCEVKSV AISACGNFAF LGAANGAIHR FNLQSGAHRG AFERGKLRVA AHDGEVTCIQ AHCANRSVVT AGVDGMIRVW KFSELKIDLE IDVGCGVRCG HLHEDSLLVV GCADKHVRVY DTMTGKRVRT FKPRGQENEA GDITSVQISE NGKWIFVLDT TGTIRVYDIP AARLIQHMIL GADKVTAMSF SPRMDFLATV HENRVGLYLW VNMPMFDYDT KLAYGRKVSI ALPRKHAESD ADGGHRNLQE LEEYFKEMAT GPKQIAPGMI TLAMMPQTQI EMLLNLETVQ AKSKVKTDDK EPELAPFFLP TAAASDDVRR SVFDPARESE LNAKDDTGDD DSKAPKSRIL RQGADLAGAS ATPLLTLILR GERSEDYTEA LEFLKNASIH VVDAELRSLG PWDHKFMSED DVKTLRSAIK FFTAAIASGM YYEMVNAQLN VFLNVHTTAI MQSSALVDEC HALREAMHKS YSRIDDLFNE IRCTLSFHIG DAGV
|
| |