Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17719 |
Symbol | |
ID | 5005056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | + |
Start bp | 7725 |
End bp | 9371 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420477 |
Product | predicted protein |
Protein accession | XP_001420880 |
Protein GI | 145353134 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 0.414961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGC GCGCACCCGC CGCGAGCGCG ATCGGGGCGA TGCATCACCG AGCGTCGACG CGCGCGGCGT CAACGTCTTC GACGCGCGGC GGGAAGCCGA CGCGATGTTC GAGCGCGACG TCGACGTCTC GAACGGCGTC GTGGGTCGAC AGATCGGCGC GGCGCGCGAG GACGACGAGG CGAAAAACCA TCGTTCGCGA TGGAGCAGAG TTCGCGAACG ATTCGGGTGG GTACGAGCGG TACCTGAGCA GCGACGCCGC GGAAGGGGGC GAAGGGGACG GCGAGGACGG CGCGGCGGCG CGCGCGAAAG AAGGCGCGCA AATGTCCACG TCGTCGGCTC AGGCTCGGGA ACGTGAACGT GAAGTTGGTG CCAGTTACGA CGACGAGTTC GTCGCGCTCG TGCGCGCGCG CAAGGAGGCT ACGGAGGAGG AGACGAGGAA AAGGTGGAAC GAAGGGGCGG CGACGCCGCG AGTGGCTTAC GATCTCGGGC AGGATTACAT ACGCAGGGTC TCGGTGGCGT ATCCGTACGC CGTAGTCGGC TCCGCGCGCG GTGATGTGGC GGTGTGCGAC GTCGTCGACG CTTCCGCGCT CGCAGTGTCG CCCGCGGCGC ACAAGAAGGA TTGGTCCGAC GTCGACGCCA GGCCGTTGGG AGAACGCGTG CTCTTGGGGG CGCACGATGG CGGTGCGGTC ACGGCGGTGG CGATCACGAA AGATGCGCCG AAGCAAGAGA AGTATGATTT CGCGCACGTC GCATCCGGAG GCCGAGACGG CGTCGTACGA CTGTACAAGG CGTGCAAGAA TAGCGCCGCG CTGCACGAGC TCGGCTGCGC CAATCACGAC GACGTCGTTA CGGGCGTTAC TTTCGCGCGA GGCTCACTTT GGAGCGTCGC TCTCGATGGT CGTTTGTGTC GCTGGTCTCT TGCGACTTCG CACGCCAAGG TCAAGTCTGC CAAGCACGCC AAGGAGTCGC TCATCGACGC CGACGAGTAC GTCCGCGAAC AGGCGCCGGC GCTCACCAAG GAAGGTGAGT GGCGTAGCGG ACAACCCACG CTTTGTTTGA ACGCGTGTGA AAGCTCTGGG TTGATTTCAA TCGGAAACGC CGATGGATCC GCTGCGTTGC TGAGCGCGGA AGTGAGTCAT ACGACTATCG GCGAAACAGC CGTCGTAAAC GCTTGGATTG CTCACGAAGG CTCCACCGTC CGCGCTATCG CGCACGGCCC GAACGGGAAC ACCGTAGTCA CTGGTGGTGG CGACGGCGTG ATCCGCGTGT GGTACTTGGT CGAGGAACAA ACGGTCGGTG GTTTTAGGAG CATGAACAGA CAGAAACTAG GCCTCCCGCA AGTCACGCCG ATTCTCGTCG CCGAACTTCG AGGACACACC TCCGCCGTCG TGTCTCTGTC CACGGGATGC CCTGGTCGAT TAGTCAGCGG CGCTCACGAC GGAACGATTC GAGTGTGGGA CTTGGATGTG AAGCCCCCGC CCAAAGTGGC GAACATTAAA GCCGTTCGCA GAGACGCGAG ATACGCGGTT CTTGGTCACA CCATTTGGCT CGGCGGTACG TACGCCGACG CCGAACGCAT CATATGCGAC GGCGCGAACA ATGTGTTGTT GGAGTACGAC TTTTCCAACG CACCCCAGGA CGTCTGA
|
Protein sequence | MTMRAPAASA IGAMHHRAST RAASTSSTRG GKPTRCSSAT STSRTASWVD RSARRARTTR RKTIVRDGAE FANDSGGYER YLSSDAAEGG EGDGEDGAAA RAKEGAQMST SSAQARERER EVGASYDDEF VALVRARKEA TEEETRKRWN EGAATPRVAY DLGQDYIRRV SVAYPYAVVG SARGDVAVCD VVDASALAVS PAAHKKDWSD VDARPLGERV LLGAHDGGAV TAVAITKDAP KQEKYDFAHV ASGGRDGVVR LYKACKNSAA LHELGCANHD DVVTGVTFAR GSLWSVALDG RLCRWSLATS HAKVKSAKHA KESLIDADEY VREQAPALTK EGEWRSGQPT LCLNACESSG LISIGNADGS AALLSAEVSH TTIGETAVVN AWIAHEGSTV RAIAHGPNGN TVVTGGGDGV IRVWYLVEEQ TVGGFRSMNR QKLGLPQVTP ILVAELRGHT SAVVSLSTGC PGRLVSGAHD GTIRVWDLDV KPPPKVANIK AVRRDARYAV LGHTIWLGGT YADAERIICD GANNVLLEYD FSNAPQDV
|
| |