Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27017 |
Symbol | |
ID | 5005103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | + |
Start bp | 73356 |
End bp | 74555 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | |
GC content | 62% |
IMG OID | 640420524 |
Product | predicted protein |
Protein accession | XP_001420897 |
Protein GI | 145353173 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 0.233001 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0292982 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCA TCGTCATCGA CGACAAGTAC GCGTTGAGTG ACTCGGACTC TGTCGACGCC GACGCCCCGC GATGCGCGAA CTGCGGCGTC GCGAGCGACC GACTCAAAAA GTGCGCCAAG TGCCGGCGCG CGCACTTTTG CAACGCCGCG TGCCAGCGCG CGGCGTGGGA CGCGCACGCG CGCGAGTGCG TCGCGGATGC GAACGCGAAA CCGGCGTACA AACCGCCCGA ACCGCCGCGA ATGCCGACGA AGGCGGAAAA GGAAGAGGCG AAGGAGAGCG AGACGCGGCG AATTCGCGAG ACGACGTTGC CGCGAGCGCG CGCGGCGTTG CGAAGAGATG GCGTCTCGAC AACGGTAGAT TTAGACGAGT TGATCGAGGG ATTAGAAGAC GCGATCGTGT TCGCGATCGG GGAGGAGGAT CAGGGGTTGA CTCGCGAGGT GCGGTTGGTG CTGGCGAGGG CGTATTTAGA GGCGAAGCGC GCGGATGAAT GTTTGCACTA TTTGGCGCCG GCGCTGGAGG AGGCGCGAAA GGAGGGCGGG GCGGCGAGCG CGGACGCGCA CACGCTCGCG GCGAAGGCGC ATTGCGCGAA GGGCGAGAAA GAACAGTGTC GCAAGGAATT GACGGCGGCG TTGGATTGCG CGAGCGAATC GACGAGCGAC GAAGCGCAGT GCGATACGTT GCTCGACGCG GGGATTATTT TACACGATCT CGGTGACTGG GAGCGATGTG CGCCGTTGCT CAGCACCGCG GGCGAGGCGG CAGAAAAACT CGGTCGCTTG CGCGAGGCGG CGCGCGCGTA TAATCGCGCG GGTTCGGCAC TTTTGCGTTC GGGTCGGCCC GACTACGCCG GGCGATGCTG GACTCGAGAG CTGCGAGTGC TAGAGGCGGA CGATTCCACC GATCCAGGGA CGTTGGCGCA GGCTTTCGCG AACTGTGCGA GCGCTTTTTT ACTCACTCGC GGCGAAGACG ATGATGCGTT CAACTTACAC AAGAAGTCCG CGCTCACGAA GGCCCGCGAG TCTGGAAATG ACGCTGAGGC TCGCGTTTAC TTGCAATTAG GCAACGCTTA CAAACTCGCC GGAGACGCGA TAGATGATAG TTTAGCGCGC GCGAAAGATT GTTTCGAGAA AGCAAAGTCG TTATCGGCGA CCGATGCTGG CGAGATCGCT TCGCGGGCTT TAGAAATGCT TAGTCTGTAA
|
Protein sequence | MSTIVIDDKY ALSDSDSVDA DAPRCANCGV ASDRLKKCAK CRRAHFCNAA CQRAAWDAHA RECVADANAK PAYKPPEPPR MPTKAEKEEA KESETRRIRE TTLPRARAAL RRDGVSTTVD LDELIEGLED AIVFAIGEED QGLTREVRLV LARAYLEAKR ADECLHYLAP ALEEARKEGG AASADAHTLA AKAHCAKGEK EQCRKELTAA LDCASESTSD EAQCDTLLDA GIILHDLGDW ERCAPLLSTA GEAAEKLGRL REAARAYNRA GSALLRSGRP DYAGRCWTRE LRVLEADDST DPGTLAQAFA NCASAFLLTR GEDDDAFNLH KKSALTKARE SGNDAEARVY LQLGNAYKLA GDAIDDSLAR AKDCFEKAKS LSATDAGEIA SRALEMLSL
|
| |