Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33188 |
Symbol | |
ID | 5003191 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 527893 |
End bp | 529526 |
Gene Length | 1634 bp |
Protein Length | 510 aa |
Translation table | |
GC content | 57% |
IMG OID | 640418612 |
Product | predicted protein |
Protein accession | XP_001419191 |
Protein GI | 145349544 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.860325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.52884 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGCC CCGCGCGATC GTCCAACAGT CGAGTGAAGC GCGCGAAAAC GGCTCAAAGT TCCGCCGACG TGGTCGCTTC CGCCCCGGAA GTCGGCGCTT TGTACACGGA AGACTATCGC GTGAAAGGTT TGCACGTCCG CGATCACTTC ATCGCGGTGC GTCCCGACGC GTGTCGATGC GTTTTCGACT CGACGCGGCG CGCGCGGCGC GCGAGGGCGA AAAACTGCTG ACTGACGGCG ACGGCGACGC GACGCAGGTG CCCGTGTGTC ACGCTCGAGG CGATTCGAAC GCGATGCGCG TGTTTTTTCG AGAAGTGGTG ACGGCGGCTC GAGGGAAACT CACGAGCGAG GAGCGGAAGT CGCTGCCGGC GGTACTGTTT TTGCAAGGCG GACCCGGATT CGAGTGCGCG GGACCGCTCG AGGCGAGCGG CTGGTTGGGG GAAATGGTCA AGGAACATCG AGTGTTTTTG ATGGATCAAC GAGGCACGGG ACGTAGCGAC AGCGAGATTG TCCATCCAAC GCTCAACCGG GATGCGTCTG GACATCCCTT GTCGTACCCC AGACATTGGA CCGACAAAAA CACGTCGCCG GCGAAGGCGT GGGCCGTTCA CTTGAAGAAT TTCCGAGCAG ACAGCATCGT GAAAGACGCC GAGTTGTTCC GTAAGACGGT GCTCGGTGAA GATGTGAAAT GGACGCTGCT TGGGCAATCC TTCGGCGGCT TTTGCATCAC GACGTATTTG TCTTTCGCTC CCGAAGGCGT GAAAGAAGCG CTGCTTACTG GCGGTTTGCC TCCGCTCATC GACGAACCAG CGAGTGCGTT AAACGCGTAT CGAAAATTGT TCGAGCGCGT TCAGACGCAA AACAGAAAAT ATTTCGAAAG ATTTCCGTAC GATGTCGACC GCCTCTATGC GCTGTATGTC CAACTTCAGA ACGAAGGTCC ACGGATCTTG CCCGGCGGTG GGTTGCTCAC CGTTCCGCTG GTACGAGCGC TCGGTTTTTC TAATTTAGGC ACAGCGCAGG GGATGGAGAG GTTGCATTAC ATCATGCAGT ACGTAGAGAT TCATTATGCC GACGAAGAAA TTGTTGGAGC TCACTTGCCG CACAAGTTTT TGATTGAAGT GGAAAACTCG TTTAGGCACT TCGAGACGAA CCCGTTGTAC GCGGTCCTGC ACGAGGCGAT TTATTGCAAC GGTGCGTGCG CCATCGGTGC GGCCGACCAA GTGTGGCTCG AGCGAGTTGG CGAAGATCTT TACAGCGCGT TTGGTGAGCC TGAGTCTGAT GACTCATTGC GCCGTGCGTT TACGGGGGAG TGTGTTTACT CTTCATTCTT CGAAGACATC CCATCGCTTC GCCCGTTCAA AGCGATCGTT CAAGAGCTCA AAAAGGACAA AGATTGGCCA AAGGTGTTGT ACGATACGGA GCAGCTGGCC AAGAATACCG TCCCAGTGGC GTGCGCGAGT TACGTCGAGG ACATGTTCGT CGATTTCGAT CTCGCCTCTG AAACGGCTGC GAAAATTCGG GGCGCCCGCG TCTGGAGCAC CAGCGAATAC ATGCACTCCG GCATTCGCGA AGACGGCGCC CGCATCGTCC AAAAGCTGTT GTCCTTCGTT CGCGACGAGG ATCCAATTCG TTAG
|
Protein sequence | MTSPARSSNS RVKRAKTAQS SADVVASAPE VGALYTEDYR VKGLHVRDHF IAVPVCHARG DSNAMRVFFR EVVTAARGKL TSEERKSLPA VLFLQGGPGF ECAGPLEASG WLGEMVKEHR VFLMDQRGTG RSDSEIVHPT LNRDASGHPL SYPRHWTDKN TSPAKAWAVH LKNFRADSIV KDAELFRKTV LGEDVKWTLL GQSFGGFCIT TYLSFAPEGV KEALLTGGLP PLIDEPASAL NAYRKLFERV QTQNRKYFER FPYDVDRLYA LYVQLQNEGP RILPGGGLLT VPLVRALGFS NLGTAQGMER LHYIMQYVEI HYADEEIVGA HLPHKFLIEV ENSFRHFETN PLYAVLHEAI YCNGACAIGA ADQVWLERVG EDLYSAFGEP ESDDSLRRAF TGECVYSSFF EDIPSLRPFK AIVQELKKDK DWPKVLYDTE QLAKNTVPVA CASYVEDMFV DFDLASETAA KIRGARVWST SEYMHSGIRE DGARIVQKLL SFVRDEDPIR
|
| |