Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18863 |
Symbol | |
ID | 5006431 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009373 |
Strand | - |
Start bp | 49834 |
End bp | 50920 |
Gene Length | 1087 bp |
Protein Length | 303 aa |
Translation table | |
GC content | 64% |
IMG OID | 640421852 |
Product | predicted protein |
Protein accession | XP_001422423 |
Protein GI | 145356407 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 0.00172466 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000000976033 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCGACG ACGATCTCGC GCGCGGCGAC GCGCTCTGCG CCGCGGCGGC CAAGAAACTC AGGGTGCGAC GCGCGCGCGC GAGCGCGCGA TCGGACGCGC TCGAGCGGCG CGAACGCGCG CGCTCGACGC GAACGATCGA CGCGAATGAA TGAATGAATG AATGCGAACG AACGAACGAA CGAACGAAGG AACGAAGGAC GACTGACGAC GGCGCGACCG ACTGAACGAC GACGACAGAG CGTTGGATTC TTCAGCGCGA TCACGGGCGC GAACAGGCAC GAGGAGGCGG CGGAGCTGTA CGAGCGCGCG GCGACGTCGT TTAAGCTGGC GAAGAGCTGG CGAAGAGCGG CGGACGCGTA CGAGGCGCTG GCGACGTGCA GAGCGACGAC GAAGGAGACG CACGACGCGG CGTCGGCGCA CGTGGACTGC GCGCAGATGC TGAAGAAGTG CGGGGCGAAC GAGGAGGCGA TCGGACACTA CAGGGAAGCG TCGAACGCGT ACGCGCGATT GGGACGACTG GCGCAGGCGG CGAAACATTT GAAGGAGATC GGCGAGACGT ACGAGAGCCT GGGGACGGCG GAGGGGGATG AACGCGCGGT GGAGGCGTTC TCGAGCGCGG CGGATCTGTA CGACGGCGAG GGCGACTCGG GACGGACGAC GGGGAATAAT TGTAAACTGA AGGCGGCGAC GCTGCTGGCG AGCAAGCTCG ATCGATTCGA AGAGGCGACG GAGATTTTTG AGGACGTCGG ACGCGCGTCG TTGAATAACA ATTTACTGCG GTTCTCGGTG AAGGGGTACT TTTTACAGGC GGGGATCTGT CGATTGTGCT GGAACGACGC CGTCGGGGTG CTGAACGCGT GCGAGCGATA CGAGGAGAGC GACCCGGCGT TCGCGTCGTC GCGCGAGCGC GATTTGTTGG TGAATTGCGC CAAGGCGTTC GAGGCGGGCG ATCAAGACGC GTTTTCGAGC GCGGTGGCGG AATTCGACTC CATGTCCAGG CTCGACGGTT GGAAGACGAC GATGCTATTA AAGGCTAAGA AGCGCATCGT CGCCGCCGTG GAAGCCGAGG AGGACGATCT CACGTGA
|
Protein sequence | MGDDDLARGD ALCAAAAKKL RSVGFFSAIT GANRHEEAAE LYERAATSFK LAKSWRRAAD AYEALATCRA TTKETHDAAS AHVDCAQMLK KCGANEEAIG HYREASNAYA RLGRLAQAAK HLKEIGETYE SLGTAEGDER AVEAFSSAAD LYDGEGDSGR TTGNNCKLKA ATLLASKLDR FEEATEIFED VGRASLNNNL LRFSVKGYFL QAGICRLCWN DAVGVLNACE RYEESDPAFA SSRERDLLVN CAKAFEAGDQ DAFSSAVAEF DSMSRLDGWK TTMLLKAKKR IVAAVEAEED DLT
|
| |