Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28967 |
Symbol | |
ID | 4999965 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 608352 |
End bp | 610643 |
Gene Length | 2292 bp |
Protein Length | 688 aa |
Translation table | |
GC content | 60% |
IMG OID | 640415386 |
Product | predicted protein |
Protein accession | XP_001415547 |
Protein GI | 145340883 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0441407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGCGT CGGCGCGGAC GCCCGCGACG CCGGCGTCGA CGTCGCGCGC GCGCGGACGC GCGTCGAGCG CCGCGGTGCG GTGCGCGCGG ATGGGCGCTG GGCGCGACGA CGCGGGAGGC GGACGCGTCG CGCGCGACGT CGCGTCGCGC GCGCGCGCGA CGCGGACGCG CGAGCGAGGG TCGACGCGTC GACGCGGGCG ATGGACGCGC GCGCGCGCGT CGTGGGGAAG CGACGCGCGG GCGGGGTACG AAATTTCGAC GAGTTGGGAC GATTACGACG ACGAGGCGGA CGACGACGAG AGAGGCTCGA ACGAGGCGGC GGACGACGAG GAGACGGAGG AAGAGGAGAG CGCGGCGGCG ATCGTCGCGG TGGACGTCGT CGCGTCGCCG GAGTCGGTGT GGGCGTGTTT GGCGAGAGGG GAGAATTTGT CGGAGTTCGC GCCGACGCTG CTCGTGGCCA AGCGCGGGCC GAAGCAATAC GGATCGACGG AGACGGAGGT GCGATTGCTC GCGATGGATT ACACGCTGTT CACGCCGAGA ATGCTGAATA CGGTGAGCGT TCGGATCGTG GATAAGTCGA AGGGGCCACC GAACTGGGAA AACGGGCCGG CGAAGCTTGG ATTTTTGGCG AGAAACGTCG ACGAGCGAAG CGATTTCATG CTCGTGGGAT CGTTTTGTAT TCGGCCTATT TTTGGCGAAT CGTACAAGTG CAGATTGGAG TACAGGGCTG AGTTGAGGCC GACAAATTAC GAAGTTCCAG CTGATGTCCT GCGCAGGGCG GTGGAGGAGA GCTTCCCGTT CGTCATCGCG AGCGTGACGA AGCAGGCGGT GAAGAGAGAC CAACGACGAC TGCGGTCGCC GGGATTCCTG CCTCAGCTCG AGACCCCTTT CGGGGCATCC AGGGAACTGG CAAAGAAAGC GGACGATTCT TTAGTACCGA GTGGATATCT CGGATTGAGT GAGGTCAACG TCCCGACGCA AAACTCGGAA TCGAGTCAAG ACGAGGAGAC GCCGGATGCC TCCAAACCCG CCGCGGTGCG GATGCGTCCG CGCGGCGAGA CAGCCAAAGC CTCAGAGGCA CCGAAGAGCT GGAGAGCTAT CGGTGGTGGT GCGCAAGGAG AGAAGTGGTT TTCAAGCGAC ATTACCCCGT TTGAAAATCC AGGCTCGCTC GAGGTGCACA TGCGACGGTA CGATACGGAC AGTTTGTTGC ACAGACGCGC GCTGGCCGCG GTTCGCATCG AGGCGCCGCC GGCGTTGGTG TGGGATTTGT TGACGAATTA CGAAAACATG CCAAAGTTCA TGCCGCATCT CATGCACACT GAGTACATTC AAAGATATAA TGCAGTCGAG CGTGAGGCGT CCGAAAAGAT CAAACGATTG CGTCTCAGGC AAGTCTTCGT AAAGTGCGAC CTCTTTCACG CCATCGAAGA GTCGACGGCG CTGGACGTCG TGCAGAAGGA CGACCGCACC GAGTTACAGT TTCGCGTGTT ACAGAACCCG AAATTCGGCG CGCTTCAAGG TAAATGGCTC GTCGTTCCGA CCGAAGACTC CGCAGCCACG GTGTTGAAAT TCGCCATCGA GGGCGTTGTG AGCAATGCTG GAATAGATGG GACGGCGAAA AAGGTGGACC CGCTCAACGA GCGCATCGTG TTTGAAGAGA TTTCAACCAT GCTCAAACAA GCGAGAGATT TCATGGAAGG AATTGCGAGT AAAGAGGTGC AGTCATACGG GAACGTCAAC ATCAAAGTCG CCGATCTTGT GCTGAAAGGC GCTGGAATGT CCGTCGACGA GGAAGACGCC GTCGACGAGC AAATCGTCGG CGCGCTCAAC TCTGAAGCCA CTGAAAACCA AGAGAAACAG ATACAAGCGT TGAAGCGCGA ACTTATCACT CTCGGCTTCG GCGAGAACAA GTGCATGCCG ACGCGAGAAC AGTTACGCGG CGGGCGTCAC TGGGACGCCA TCCAGCAAAT CGAAAGCCTC GGCGGTTTCG TAAAGGTGGC GCAACTTCTA GATTGGTCGG GCGCGAAGAC GCGACCACGT GGGTATTGGA CACTGCGAAC GTTGGAGTTG GAGATCAAAG ACTTCATCGC CAACACAGAG GATCCGAATG TTCAACGGAA CCCCCGAAGG ATGCCGTCTC AGAAATCGTT GCGCGACGCC GGGCGAGCGG ATATCGTCAA TGCGCTCAAG CGTTTCGGCG GCGCCGAGAA AGTGGCGGCC AGCATGGGTC TCGAATTCGG TAGCGGCAAC AAACGTTCGT CCGCGAGCGC GCGCGGCGGC GGCGACGATT AG
|
Protein sequence | MRASARTPAT PASTSRARGR ASSAARGSNE AADDEETEEE ESAAAIVAVD VVASPESVWA CLARGENLSE FAPTLLVAKR GPKQYGSTET EVRLLAMDYT LFTPRMLNTV SVRIVDKSKG PPNWENGPAK LGFLARNVDE RSDFMLVGSF CIRPIFGESY KCRLEYRAEL RPTNYEVPAD VLRRAVEESF PFVIASVTKQ AVKRDQRRLR SPGFLPQLET PFGASRELAK KADDSLVPSG YLGLSEVNVP TQNSESSQDE ETPDASKPAA VRMRPRGETA KASEAPKSWR AIGGGAQGEK WFSSDITPFE NPGSLEVHMR RYDTDSLLHR RALAAVRIEA PPALVWDLLT NYENMPKFMP HLMHTEYIQR YNAVEREASE KIKRLRLRQV FVKCDLFHAI EESTALDVVQ KDDRTELQFR VLQNPKFGAL QGKWLVVPTE DSAATVLKFA IEGVVSNAGI DGTAKKVDPL NERIVFEEIS TMLKQARDFM EGIASKEVQS YGNVNIKVAD LVLKGAGMSV DEEDAVDEQI VGALNSEATE NQEKQIQALK RELITLGFGE NKCMPTREQL RGGRHWDAIQ QIESLGGFVK VAQLLDWSGA KTRPRGYWTL RTLELEIKDF IANTEDPNVQ RNPRRMPSQK SLRDAGRADI VNALKRFGGA EKVAASMGLE FGSGNKRSSA SARGGGDD
|
| |