Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_2239 |
Symbol | |
ID | 5003390 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 51627 |
End bp | 53219 |
Gene Length | 1593 bp |
Protein Length | 514 aa |
Translation table | |
GC content | 59% |
IMG OID | 640418811 |
Product | predicted protein |
Protein accession | XP_001419267 |
Protein GI | 145349702 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0895904 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGCGAGT ACCGAAAACT TCCAATCAAA CGCTACGCCG CGCGAGCGAA GCGAGAGACG GGCGAGGGAA GGTACTGGCG AGAATATAAA TCCACCGCGC TGAGCGAACA GGTGAACGCG GTGACGAGCG TGTCGTACGG AGGCGCGGGG TCGTCGGGGG GCGAACGCGG CGCGTTGGCG GCGACGAGCG GGGCGAGGGT GACGCTGTAC GCGCCGAGCG GAGCGAGAAA ATTGAGGACG TTCGCGCGAT TTAAGGACGT GGCGTACAGT GGTGTGCTGA GAGACGATGG GAAGGCGCTG GCGGTCGGAG GACAGGCTGG GGTGGTGCAG TTGTTTGATT GCGGGTCGCG AGCGGTTTTG AGAAAGTTTA CGACGCACTC CGCGGCGGTT CGCGCGGTGC GATGGAGCGC GGATAAGCTG CACTTAGGGT CGGCGAGCGA CGACGCGACG GTGCGAATAT GGGATATTTC CACTGGGAAT TGCGTGCGAA GGCACGATGG GCACACGGAT TACGTTCGAG CGCTCGAGCG GAGTACGGTT TCTCAAGAGA TGTGGGCGAG CGGGTCGTAC GACCACACGG TGAAAATTTG GGACGCTAGA CAAGGACGCG AGGCGGTGAT GACGCTCGAT CATGGTTCGC CCGTGGAAGA TGTCGCGTGG TATCCCAACG GAAACTTGCT CGTCTCCGTC GGTGGCGAGG ACGTGTGCGT GTGGGACGCC ATCGGCGGCG GCCGGTTGCT TCGTCGGTTG CGCAGTCACC AGAAGACCAT CACCACCGTG CACGTGCACC CGGACGCGGG CCCGCCATCG TTCGCATCTG GATATGAGAT CGGTAGCGAA AGCGCGCTCG AATCAAACGC GCCTCGCATG ATCACCGGCT CCTTGGACGG CTTCGTCAAG ATTCACGAAC TCGACACTTT CACCGTGACG CACTCGATCA AGTACCCTGG ACCCGTGCTG ACGTGCTCGC TCTCGCCAGA CGCGAACTGC CTTGCCACTG GACTAGCCAA TAAAGTATTG AGCGTTCGCA GACGAACGAA ACCTCGTAAC AGTGACGACC CTTCGGGGTA TCAAGGCGTT CGAAGTAAAA AGAAGGGCTT CACGGTGAAG AAACCTCGAC GACTGGATGC GAGTCATTGG CGGTACTTTA TTCGAGGTCA AAACTCTAAA GCGGCGGCAG ACGCCACGCG AGTGCTTCGT CGACGACGTG TGCACTTGGC CGCGCACGAT CGAATGTTGA AACAATTTAG ATACGGAGAC GCCCTGGACG CGGCGTTGCA CGTGGGTAGG GCGGAAGTTG TCGCCGCAGT CATAGAAGAA GTCGGTCGAA GAGGGGGGCT TCAGAAGGCA CTCGCCAACC GCGACGACCA GTCGCTGCTT CCGATTTTGG AGTATATAGA GAAAAATATC TCCAAGCCGC GCCACACGGC GCAGATGGTG AACATTGCGA ACCGAATCGT CGACTTGTAC GGTGGCGACG TCGGCGCGAG TTCGGCTGTG GACAACGCAT TGCGTAGAAT CCAGTTGAAA ATCAAAGCGC AACTGCGACT ACACGAAGCA TTGACGCAGT TACAAGGGAT GGCTTTGACG ATA
|
Protein sequence | LGEYRKLPIK RYAARAKRET GEGRYWREYK STALSEQVNA VTSVSYGGAG SSGGERGALA ATSGARVTLY APSGARKLRT FARFKDVAYS GVLRDDGKAL AVGGQAGVVQ LFDCGSRAVL RKFTTHSAAV RAVRWSADKL HLGSASDDAT VRIWDISTGN CVRRHDGHTD YVRALERSTV SQEMWASGSY DHTVKIWDAR QGREAVMTLD HGSPVEDVAW YPNGNLLVSV GGEDVCVWDA IGGGRLLRRL RSHQKTITTI GSESALESNA PRMITGSLDG FVKIHELDTF TVTHSIKYPG PVLTCSLSPD ANCLATGLAN KVLSVRRRTK PRNSDDPSGY QGVRSKKKGF TVKKPRRLDA SHWRYFIRGQ NSKAAADATR VLRRRRVHLA AHDRMLKQFR YGDALDAALH VGRAEVVAAV IEEVGRRGGL QKALANRDDQ SLLPILEYIE KNISKPRHTA QMVNIANRIV DLYGGDVGAS SAVDNALRRI QLKIKAQLRL HEALTQLQGM ALTI
|
| |