Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18874 |
Symbol | |
ID | 5006440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009373 |
Strand | - |
Start bp | 76254 |
End bp | 78515 |
Gene Length | 2262 bp |
Protein Length | 754 aa |
Translation table | |
GC content | 66% |
IMG OID | 640421861 |
Product | predicted protein |
Protein accession | XP_001422431 |
Protein GI | 145356423 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.00000500904 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00000019569 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGCGGG ACGATGACGC GGCGCGGGGG GCGTTCGCGG CGATGCTGGT GGACGGGGAG GGAGACGCGG AGGAGGACGC GCGCGCGGTG GCGAGACGGG TGGGCGCGCG CGGGAGACGC GCGGCGGCGT CGGTGCGCGC GATCGAACGC GCGATCGAAC GCGCGGTGGG CGCGGAGGCG TCGATGGGGA CGATGGGACG GTCGGGGTCG GGGTCGGGGG GGCGTCGAGG CGCGCAGACG ACGGCGACGG GGCGACGCGG GACGGACGCT TCGGACGCGC TCGCGGCGGC GATGGAGACG TTGCGCGCGC TCGCGGTCGA TGAAAGCGGC GGTGCGGTCG TCGACGCGGT CGGCGGTGGC GACGACGACG ACGACGACGC GCGACGTCCG CGTCGCGCGA ACGAGGATTT GGTCAAAGAC GAAGACGCGC GCGATTACGC GCGTCGCGGC GAGGATTCCA TCGGTTTACG CGACGACGTC GACGTCGAGC GTTGGGTGCG CCGAGGATTC GAGTGTTTAC ATCGATTCGC GACGATTCGA AGCAAACGTC GCGATTCGCG GGCGACGCTC GGTGATCGCG AGTTCGACAC GAGCGTCGAC GCGCCCGCGC CGCCGCTCGT CGACGCCTCG GGCGGATTGG ATGAAGCTAA ACAAGCGCAC GATCTCGACG TCGTCAGGCG CATGGACGAC GAAACTTCTG AAAGCGTTCA TAACCTCGTC AAGTCTTGGC TCGCCGAGCA GCGCGACGCC GAACGAGCGA TGCTTGAGCT AGCCGAGCTC GTGCGCGACG ACAACGCGGA TTTGCTCGCG ATTCAGCGAC GTGAGCTCTT GACGCGCGCG CTCGGCGCCT GGTGCGAATT TCGCGACGTC ACCGCGGCGC TCGAGCTGGG TGTGGAAGAA TTCCGTCTGG CCGAGAAGCG CACGCGGTTT TATCTTCGCG AGAACGAAAT GTGTTACAAG CACGTGACGC CGCCCGGGGA TTCCAAGGCG CTGGAAATCG TGCTGAATAA GCATTTGGAT CACGCGAGGA ATTCTTTCGC CGTCGCGGCA CCGAGCGCGA AAGAGATCGA GCGCCACGCC AAAGAATGCG TGCTGGCGAC GCGAGCGTTG AACTCGTGTG ACGGTGATTC GTCGCGCGTT GTCGACAAGG CGTACGCCGA TCGTGCGCTC GACGTCAAAG TCATCCTGCG AGACGTCATG CGGGATTTAA AGCGCTCGTT TCGAGCCATC ACTTCCGCCG CGAGCGGCGG CGCTTCCTCC ACGAACGCCG GCGCGACGTC GACAAATTAC GATGTCGAAG GTTTGCATCG AAGAATCCAA CCCGTGATGG ACAGGTACAA ATCCACCGCC CTCGACGGCG TCGTCGACCG CGTCGTTCGA GCGCTCAATC TTGAGGGCGC GCACTTGGCG TACGGCATCG AACTCGCAAA GTGCGCGACG TTCGTGAAAG TCGTGAGCGG TGCGGTCAAA GAGCACGTTA AAGCACTGGA CGACGCGCGT TTGGAAGCTG CGCTCAAGGC GTTCGAAGAT GACGACGATG TTCTCCGCGG CGGCGACGCC AACTCTAAGA GATCCGCCGG CGGCGCGTCT TCGAAGAAGA AACGGAAATC TTCGTCGTCC AAGGCGAAGC GAGCGCTGAC GAAAGTCGCG CGCGACGTCG AGCGCGATGA AATCACCGTC GCTGACGACG CCGAGGACGA ACCCGAAGCG CCGACGACGC CGCGATCGAA GACCCCCGAA CCGACGCTCG AAGACGCGGC GAATTCGGAT TCGGGCGGCG AGTGGACGGC GGCGCGATCG CGAAGGAAGC CCAACACGCC GCCGCGACCG ACGTTCGCGA AGGAAATGGA TGCGCCTCGA AGACCAACCG TCGCGACGAT GCCGAATCCA CCGCCGCTGC CACCGAATCC GCCGCCGTTG CCGAGTGGAA AGCCGCCGGC GCGCATCCCG GCGCCGAGAT CGACCGCCGT CGCCGCCGCC GCGCCGCCTC CGCCTCCGCC TCCGCCTCCG CCGCCGACGG GGCTGTCCGC GCTACCGGAT TTTCCACCCA AGCCGAAAGT CGCATCGCAA TCGACGTCCG AGTCCGCGAC CCCCTCGAGC GACGCCGAAA GCGCCGTCGA TCGAGAGTCG CGATTGAAGG AATTCCCGGC GTTGAAGACG AACGATTCGC CGGCGCTCGC GGCGCCGCCG TCGAGCAAGA CGCACGAAGA AAACCAGCCA TCGTCCAGCG TGCCCGCGGC GCCGCCGAAG AAGTCGACGA TG
|
Protein sequence | MLRDDDAARG AFAAMLVDGE GDAEEDARAV ARRVGARGRR AAASVRAIER AIERAVGAEA SMGTMGRSGS GSGGRRGAQT TATGRRGTDA SDALAAAMET LRALAVDESG GAVVDAVGGG DDDDDDARRP RRANEDLVKD EDARDYARRG EDSIGLRDDV DVERWVRRGF ECLHRFATIR SKRRDSRATL GDREFDTSVD APAPPLVDAS GGLDEAKQAH DLDVVRRMDD ETSESVHNLV KSWLAEQRDA ERAMLELAEL VRDDNADLLA IQRRELLTRA LGAWCEFRDV TAALELGVEE FRLAEKRTRF YLRENEMCYK HVTPPGDSKA LEIVLNKHLD HARNSFAVAA PSAKEIERHA KECVLATRAL NSCDGDSSRV VDKAYADRAL DVKVILRDVM RDLKRSFRAI TSAASGGASS TNAGATSTNY DVEGLHRRIQ PVMDRYKSTA LDGVVDRVVR ALNLEGAHLA YGIELAKCAT FVKVVSGAVK EHVKALDDAR LEAALKAFED DDDVLRGGDA NSKRSAGGAS SKKKRKSSSS KAKRALTKVA RDVERDEITV ADDAEDEPEA PTTPRSKTPE PTLEDAANSD SGGEWTAARS RRKPNTPPRP TFAKEMDAPR RPTVATMPNP PPLPPNPPPL PSGKPPARIP APRSTAVAAA APPPPPPPPP PPTGLSALPD FPPKPKVASQ STSESATPSS DAESAVDRES RLKEFPALKT NDSPALAAPP SSKTHEENQP SSSVPAAPPK KSTM
|
| |