Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19171 |
Symbol | |
ID | 5006847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | + |
Start bp | 34270 |
End bp | 35307 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | |
GC content | 58% |
IMG OID | 640422268 |
Product | predicted protein |
Protein accession | XP_001422789 |
Protein GI | 145357160 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 69 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTTG CCACCCAAGC GGCCGCGTCC GACGGCGCTA AGAAGCCGCA AGAGCCCAAG GCGCCGGTAA AAGCCGTCGA ACTCGACGGC GGCGCCGCGC TCAAGATCAT GAAACATTGC GCCGACGCCG CGCCCGGGAG CGCGACCGGA CAGCTGTTGG GGCTGGATAT CGGCGCGAGC CTCGACGTGA CGGCGTGCTT TCCGTTTCCC AAGGTGACCG CGGACGATCA GTACGATCCG GACGGCTCGT TCGCCGCGGA GGAGGGCGCG GCGTATCAAC TCGACATGAT GCGGTGCTTG CGAGAGATTA ATGTGGATTC GAACATCGTT GGATGGTATC AGAGCACGTA TCTCGGGACG TTTTATAACG AGGAACTCAT CGCGACGTTT TTGTCCTACT CCGAGAGCTT GCAGCGCTGC GTGTGCGTGG CGTACGATCC GACGTACGCG GAGCAAGGCG TGTGCGCGTT GAAGGCGTTG AAACTGAGTG AAAAGTTTAT CGAGGCGTAT AAGAATGGCA ACGGAGAGTT GACCGTGGAA CAGATTCATG AGAAAGGGTT GAAGTGGAAT GATGTCGTGG TGGAGGTGCC GCTGACGATT AGGAATAGCG CGCTCGCCAC GGCGGTGATG GGCGAGTTGA TGCGAGACAA GGATGTGACG CTCAACGACA CCGATTACGA GAGATTGGAT TTGAGCACGG CGCCGTTCGT GGAGGAGAAC ATGAAGTTGC TCGGCGAGTG CATGGAGGAT TTCGCGAACG AGCAGCAGCG TGTGGCGTAT TACCAGCGCA ACATGCAGCG GTACCAGTCG CAACACGCCC ACTGGTTGCA CAAGCGTCGT CAAGAAAACG CGCGTCGACG CGCCGCAGGG GAGGACTTGC TTCCGGAAGA AGATCCCAAC TACAAAGCTC CGCAACCGCC GAGTCGCTTG GAGAACTTTT TGATCACCAA CCAAGTCGCT GAGCACGTGA CTCACCTCGA AAATTTCACC AAGAAGACGT CGGCCAAGCT CGATTTGGTC ACCGCGCTGG GGAAGTAA
|
Protein sequence | MSFATQAAAS DGAKKPQEPK APVKAVELDG GAALKIMKHC ADAAPGSATG QLLGLDIGAS LDVTACFPFP KVTADDQYDP DGSFAAEEGA AYQLDMMRCL REINVDSNIV GWYQSTYLGT FYNEELIATF LSYSESLQRC VCVAYDPTYA EQGVCALKAL KLSEKFIEAY KNGNGELTVE QIHEKGLKWN DVVVEVPLTI RNSALATAVM GELMRDKDVT LNDTDYERLD LSTAPFVEEN MKLLGECMED FANEQQRVAY YQRNMQRYQS QHAHWLHKRR QENARRRAAG EDLLPEEDPN YKAPQPPSRL ENFLITNQVA EHVTHLENFT KKTSAKLDLV TALGK
|
| |