Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37252 |
Symbol | |
ID | 5001346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 601402 |
End bp | 603471 |
Gene Length | 2070 bp |
Protein Length | 689 aa |
Translation table | |
GC content | 52% |
IMG OID | 640416767 |
Product | predicted protein |
Protein accession | XP_001417548 |
Protein GI | 145346134 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000000758174 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATG CGACCATATT GTACGCGCGG GGCGAATACG CCGAGGCGGT GAGTAAGTTG CACGCGGCGA TTGTGAAAAT CCCGCATTCG AGCGAACCGT ACGAGCAATT GGCGCTGGTG TACGAGGAAA CGAACGATTT AGAAAAGGCG TTGGATAGTT ACTCTTTGGC GACGGCGGTG AAGCGTGGGG TGGACCCGTC GATGTGGTAT CGCATGGCGT CGCTGGCGGT AAACGTGAAC AACAAAGATT ACGCCATACA TTGCTTAGCA AAGGCGGCGA GGTCTGACCC GCACAATTAC GAGAATAAAA TGGATCAAGC AACGTTGTAT TCGGAGCTTG GAGATGCGAA AAAGGCGATT GAGCAACTTG AGTGGGTGCT GAAGGACGAT TTGCCGCCGC TCGACGGCGC TATTTTGCGT GACGCCGTGG TCTTGCTCGC CAAGTTGTAT TATAGTGCTG ATATGCGAGA TAAGGCTGAG CATGCTCTAG AGCACATGCT GAAAGCGTAT CCACAGCACA TCGACGCCAC AGTGGTGAAC ATTTTGATTG AGTTGAAGAT TGAGTTCCGC AAGTATTCCG AGGTACTAGA AATTGTCGAG CGTTCGCGCG CGAATATTTT AGAGCACGTC GACTCTGGAC AGCTGCCTTT GGACATTTCC GTAAAACAAG GTCAATGTTT GCTTTACGAA GGACAAACGG AAGAAGGCAT GGAGCGCATT GAAGAGCTCT TGCGGCACAA GGTGACTGAG TTTGATGACT TGTACTTTGA CTGCGGAAAA ACTCTGATGG AGGTTGGGTT GGCGTCGAAG GCGGAAGAGG TGTTCATGCA CTTAATCGCG CTCGACGAGT ACGACAATGT CGATATGTGG CAGCGCGTAG AGCGATGCGT TCAGCAATCG AGGGGCTTGC GCGCGGTAGT CGAATTTTAC GACATGCTCC ATGACAAGCA CCCGAGCGAC GTATTCATCG CCGTATCTCT AGCCGATTCG CTTTCGCGTT TTCAAGACGA CGAAGAATCG TTGAGGCGGG CGAGGTCACT TGTGTCAAAT TTAGATGACG TCGAGGTACA GAAATATGGC ATTTTATTGC GCGTGACTGC TTTACAAAGA AAGCTCTTGA GCGAAGCCGA ACTGACTGTC ATCATTCCAG CTGCGTTGAA GCTCTTGGAA GACTTGAGCG AAAAAAGATC GCAACGCAAG CTCCAACGCG CGGGACAGGG GAGCGATGAT TTCGTGGACG ATAACGTTCG GATATCAGAT GATGATGTGT TTGCTAATAT TATAAGCGGT GCCGAGGTCG CTATTCGGCT CGGTCGACAC GAAGAAGCGG GAACCATCGT GAATCATGCG CTTTCATTCT CCGCGGGGAG CGTGCTGACG CGAGAGCAGA CGGCTTCGCT ACGATACCTA AAGTCGCTCG TGGCGTACAT GATGGGCGAT TTGCAAGAGT CTGCGGCAAG CTGTCGTAGT GTTCTTGAGG TATTCCCGAA TTCGGTAACG GTTTGGAACA TGCTGATGCA CATGGCTATC GATTATCCTC GAGCGCTCAG CGTTGGAACA TCAAAGCTCG CCAAGCGTCT CGTCGCGAGT AGCCTCGACG ATGCCTCACG CGAACGTCTG CTACCTTTGA TGGCATCTGG GTACGTTCAC ACGTGGAACA AAAAGTGGTC TATAGCAATG CACGATTTCC TCACGGCTCT CACGATCGCG CCGAACGATC ACGAGGTGAA TTTATGTGCG GCGATTTCTC TTTTGCACAT GGCAACGAGA AATTCAAACG AGCAGCAGCG TCACGCACTG GCGCTTCGAG CTGTCGTTCT GCTCGAACGT ACCGCGGAGT TGAACACGAC AAGCCCACAA GAAGGGATGT ACAACTTGGC CCGAGGCCTG CAACATCTCG GCTTTCCGCA CCTCGCTCGT CCGATTTATG AGCGTTGTCT GGAGATGCCG GTGAGCTCCG AGGCGGACGA TTTGCGTCGC GAAGCGGCGT ACAACTTATC CCTCATCTAT CGTTCTTCGA ACGCAAACGG CTTAGCGCGC GCGATTCTAC GAAAGTATAT GACGGTTTAG
|
Protein sequence | MSDATILYAR GEYAEAVSKL HAAIVKIPHS SEPYEQLALV YEETNDLEKA LDSYSLATAV KRGVDPSMWY RMASLAVNVN NKDYAIHCLA KAARSDPHNY ENKMDQATLY SELGDAKKAI EQLEWVLKDD LPPLDGAILR DAVVLLAKLY YSADMRDKAE HALEHMLKAY PQHIDATVVN ILIELKIEFR KYSEVLEIVE RSRANILEHV DSGQLPLDIS VKQGQCLLYE GQTEEGMERI EELLRHKVTE FDDLYFDCGK TLMEVGLASK AEEVFMHLIA LDEYDNVDMW QRVERCVQQS RGLRAVVEFY DMLHDKHPSD VFIAVSLADS LSRFQDDEES LRRARSLVSN LDDVEVQKYG ILLRVTALQR KLLSEAELTV IIPAALKLLE DLSEKRSQRK LQRAGQGSDD FVDDNVRISD DDVFANIISG AEVAIRLGRH EEAGTIVNHA LSFSAGSVLT REQTASLRYL KSLVAYMMGD LQESAASCRS VLEVFPNSVT VWNMLMHMAI DYPRALSVGT SKLAKRLVAS SLDDASRERL LPLMASGYVH TWNKKWSIAM HDFLTALTIA PNDHEVNLCA AISLLHMATR NSNEQQRHAL ALRAVVLLER TAELNTTSPQ EGMYNLARGL QHLGFPHLAR PIYERCLEMP VSSEADDLRR EAAYNLSLIY RSSNANGLAR AILRKYMTV
|
| |