Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25201 |
Symbol | |
ID | 5004447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 107892 |
End bp | 109954 |
Gene Length | 2063 bp |
Protein Length | 682 aa |
Translation table | |
GC content | 66% |
IMG OID | 640419868 |
Product | predicted protein |
Protein accession | XP_001420417 |
Protein GI | 145352145 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGATC GCACGCGACA GATCGCGCGC GAGTGGGTGA TGAAGTGCGC GGACGTCGTG CTGGACGCGC GCGCGCGCGC GGCGACGCCG CGCGACGCGA GGGATGGGGG GATGAAGGCG AACAGGTGGT TCAACTGCGC GTGCGCGAGA CGACGCGAGA CGACGAGGGC GATGGAACGC GACGAAGCGG CGCGCGACGG CGGGACGCGC GCGGTGAAAG AACTGGAGGG CGATTCTGTG GACGCGTTCG TCGAGGGCGC GGTGACGGTG GATGTGTATT TCGAACGGGG GGGGGACGCG TCGACGAGCG GCGAACGCGC GCTCGTCGAG CGATGGGCGT TTCACGCGGG CGCGGATGGA GAACGAGTCG GTGGGAACAA TCATTTTAAC AAGGATTTGG ACACGGCGGT GGTGTATAAG CGAGCGGTGA TCATGGTGCG CACGGTGGTG GCGATGTTGC GGACGCTTCC CGCGCACGGC GCGCGCTTGC GAGCGATGCG TCGACGAGGC GCGCGAGACG GCGGCGGAGG AGGCGGGTTT AGTTTTGAGA TCAAGAACGT CGATCGCGGA GGGACGATCG CGAGCGCGCA CCCGGGGAAA GCGGAGGGGT ACCGCACGTA CGCGTTCACG GACGTGCCGA CGTCCATCGG GAAACTGAGC GCGTGCGTGT ACTTTTTAGA CCAAGTAGCG ATGAGCGCGT TGGAGGACGA GTGCGTCGCC GTCGTCGAAA CGGCGCCGAC GCCGTGCGAA CGCCTGGACG CGAGCGTCGG TCCCGACGTC GTCGCGCTCG CGGGCGGCGA AATGTCGCTC TCGCCATCGC CGACTGGGAA ATCAGACGCA CCCGAGTCGC TTCAACGCGC GCGTCGCGAG GAGGATTCGA GTCCGATGTC GCGCGGGATA CTCTTCGACG AAGCCTCGCC ATCGCCGTCG CCGAACGCGC GGCCACCGAT TCGCGGTGGA ATGACGGCGT CGTTATCGTC GGGGTCGCTT CAGCAACAAG CGAAACCTCC CGTGAGCCCA TCCGTCGTGC CGTTGGCGGA TTACACGAGC CCGAATTATC AACGATTCGA CAACGCTCCG AGTTCTGCGC CGAATAACGG GCCGGCACCG ATATACGGAT TCAACGCCGC GCCTGTCGGC GCGAAACATG CGCTGCAAAG CGCGATCGAG TCGCCGTTGG TGGAACCTCC GCGTCCGCCG CCGAGACGCT CGACGATGGC GTCGTGTTTG AAAATCGACG TCGGCACATC CATCGACAAA GACGATTTCG TCGCCAAACC GCCAGGAGCG AACGACATCG GGGCGCCGAG CCCCTCGATC GCGGTGGCGC AACCACCGCC ACCGCCAGCG CCGGCTTCGC CGATGACGAC GATCGGTTTC TCGCCCACGA CGGCGGCGCC CATCGCGTGC TCGCCGCGAG TCCGAGCGCC GAAAGACGGC CGCTCGGGCC CCGCGCCGCA CGGTTCGCCT TCGCCTTGGG GTGGTTTCGT GTGTCCTGAA TCTCCAATTC CGAGCCCGGG AGGCGCGGCG TTTGCGCACC CTCGGAGCGT TCCACGTTCC TCTTGGTCGC CGAGCTCAAG CCTCGGCACG TCGATTCGCG ACGTCATCGG CGTATACCCG CACTCGCCCG GTGGCGTCTC CAACTTCGCG CGACGCATGA GCGGCGCGAG CGATGACGCC GCACCAGGGT CGCCTTTGAC GCCGCACGCC GCGGACGAGT CCATCGACAA CCAAGACGAG TTCCCGTTTG CCATCGACGA CGCCGACGCG ACGGCGACGG CGACGTACGT CGAACACACC CCTGCGGACC TGCTGAGCTT GCTCGAAAAC CCGTGCACGC TCCGTCGACG AAGTCTCGAC GGACCGCTGC CCCTTGACGC CGCGTTGGAA GATTCTACGG ATTCCATCGT GAACCAGAGC GCCGAGCTCG ATAAATCCGT CGGCGGCGAC GACAAATCCC GCGCCGCTTC GAACGGCATG ACCCTCGGTT CCGCCCTTCG CGCGTTAGCC GAGCTCGACG TCGCGGCTAA AACGCAGTTT ACCGATTAGT CTAGCGTCGT CGC
|
Protein sequence | MGDRTRQIAR EWVMKCADVV LDARARAATP RDARDGGMKA NRWFNCACAR RRETTRAMER DEAARDGGTR AVKELEGDSV DAFVEGAVTV DVYFERGGDA STSGERALVE RWAFHAGADG ERVGGNNHFN KDLDTAVVYK RAVIMVRTVV AMLRTLPAHG ARLRAMRRRG ARDGGGGGGF SFEIKNVDRG GTIASAHPGK AEGYRTYAFT DVPTSIGKLS ACVYFLDQVA MSALEDECVA VVETAPTPCE RLDASVGPDV VALAGGEMSL SPSPTGKSDA PESLQRARRE EDSSPMSRGI LFDEASPSPS PNARPPIRGG MTASLSSGSL QQQAKPPVSP SVVPLADYTS PNYQRFDNAP SSAPNNGPAP IYGFNAAPVG AKHALQSAIE SPLVEPPRPP PRRSTMASCL KIDVGTSIDK DDFVAKPPGA NDIGAPSPSI AVAQPPPPPA PASPMTTIGF SPTTAAPIAC SPRVRAPKDG RSGPAPHGSP SPWGGFVCPE SPIPSPGGAA FAHPRSVPRS SWSPSSSLGT SIRDVIGVYP HSPGGVSNFA RRMSGASDDA APGSPLTPHA ADESIDNQDE FPFAIDDADA TATATYVEHT PADLLSLLEN PCTLRRRSLD GPLPLDAALE DSTDSIVNQS AELDKSVGGD DKSRAASNGM TLGSALRALA ELDVAAKTQF TD
|
| |