Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18717 |
Symbol | |
ID | 5006194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | - |
Start bp | 237704 |
End bp | 239437 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | |
GC content | 66% |
IMG OID | 640421615 |
Product | predicted protein |
Protein accession | XP_001422241 |
Protein GI | 145356022 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.160009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.144375 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCTG GGGATCCGGT GAGACGCGCG GCGGCGACGC GGGCGCGGGC GAAGTGCGGG ATGGGAGGGA GAGACGACGA GGCACGGGGG GCGGGCGGGA CGCGCGAGGC GACGGCGGCG ACGACGGGGC GGGGGATCAG GACGCGGTAT CAAGGGGATT GGTTGGACGT GGGAGGGAAA GATGGGACGT GCGCGGTGGA GGAGTGCGTG GAGCGGTGGC GCGCGGCGCC GTTTGAGCCG TACGCGGGGA GAGGGAAGGA TTGCGCGTAC GCGGTGTGCG CGGGGGAGAC GACGAGCGAA CGCGCGAGAC GCGGCGCGGC GACGGTGATG CGAGAGATTT CTCGCGAGTA CGCGGCGCTG GGGCTGGGGA CGCACGCATC GATGGTCGAA GGCGACGACG CGGGCTTGTT CGACGGCGCC GAGGACGGTG CGAGCGCGTC GACGCTTTCG GCGTTGGAGC GGTATTCGAG AACGTTGAGC GCGGCGGCGC AGCGCGCGGC TGAGCTCGGC GCGCTCGGTC CGCGAATGTG CGTGATTTAC GTCGTCATTC CCGACGAAGT CGAAGATATC GACGCCTTGA CCGTGATCGC GCTCGCATCG CACGTCATGA GCGCCGCGAC GGCGTCGGTG GCGCGACGCT TCTTGTCCGT CTCCGTGCAA GCGATTCCGT CGTCGTGGTG CGAAGACGCG TACTCTTGGT CATCCACGAG CGTGCGAGCG ATGGCGTTCA ACGTCTTCAC TAAACTCACC AGACCTACGG TGACGCGGGG ATTGTCGCTC CCGAACGATC ATCTCAACGA TGCCCCGCGA AGTTTTCGCA CCTCCGAGAC GACGGACGGG CGCGAGGGCG TCTCGGGCGC CGCCGTCGTC GGCAGGCGAA CGCCGCACCC GTTGTTTAGG ATTCCAGAGA CGTCGAGCGA GCGCAATCTC CCACCGTTTC CCGCGTACGA GCCGCTATAC ACGTTGACGC CCGAAATCGA CGACGACGAC TCGCGTGTTC GTGGCTTGCA CTGCGCGTAC GTCGTGGCGG CGTCGCGATG GATCGTCGCC TCGTGGTCCG ACTCACACGG CGAATTCCTC ACGCTCGAGG CCGAACCGTT CGCGGACGAG GCCGACTGCC TCGCGACTGG CTTGCGATGG CTCATAGATC GAACGAGCGC ACTCGCCGAG CAGTTGGCGT TCGCGTACGG CGCGAAAGCG AATGAGCGAT TAAAGTTTCA GCGCGCGGCG ATTTGTCGCC TGGGTGCGCC GTCATCGGCG GAGCGCGCGG CGCTGGAAAC AGCGTGCAAA GCTGCGCCCG CACCGCTCGA TCGCGATTTC TTGACGCGTC TGGAGATGAC ATGCTTTGAG CCCGACTCCG TGCCCGCGCG CATCGCCGCC CTGGTGCCGA CCACGGCGCG CGATGTGTCA TTCGTCGCGG AGAGCGTCGA GGAGACCAAG ACCGTCAAGA CGTACGCGGC GCCGCCGTGT GCGCAGAAAT CATTCCGCGT CGCATTCAAC ACCAACGTAT CGAGCGCGCG CGTGCGAGCG CTCGACGCCA CGACAGATAT GAACACGCTG AAGCATTTGG CCGCGCTATA CGCCACGCGT CTGTCGCAGC TCGGAATGAT GTGCTTGAGC GAAAACATCG CGGATGAATT TGGAAGCATT CGCGCGCCGC TCCCGCTGCA CGCCGAGGTG TGCGTGCGGT TCGCCAGCAC GCTCCAGACG CTCGAGGCGA ACGGGGAGCA GTAG
|
Protein sequence | MSAGDPVRRA AATRARAKCG MGGRDDEARG AGGTREATAA TTGRGIRTRY QGDWLDVGGK DGTCAVEECV ERWRAAPFEP YAGRGKDCAY AVCAGETTSE RARRGAATVM REISREYAAL GLGTHASMVE GDDAGLFDGA EDGASASTLS ALERYSRTLS AAAQRAAELG ALGPRMCVIY VVIPDEVEDI DALTVIALAS HVMSAATASV ARRFLSVSVQ AIPSSWCEDA YSWSSTSVRA MAFNVFTKLT RPTVTRGLSL PNDHLNDAPR SFRTSETTDG REGVSGAAVV GRRTPHPLFR IPETSSERNL PPFPAYEPLY TLTPEIDDDD SRVRGLHCAY VVAASRWIVA SWSDSHGEFL TLEAEPFADE ADCLATGLRW LIDRTSALAE QLAFAYGAKA NERLKFQRAA ICRLGAPSSA ERAALETACK AAPAPLDRDF LTRLEMTCFE PDSVPARIAA LVPTTARDVS FVAESVEETK TVKTYAAPPC AQKSFRVAFN TNVSSARVRA LDATTDMNTL KHLAALYATR LSQLGMMCLS ENIADEFGSI RAPLPLHAEV CVRFASTLQT LEANGEQ
|
| |