Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42459 |
Symbol | |
ID | 5003399 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 79670 |
End bp | 80849 |
Gene Length | 1180 bp |
Protein Length | 356 aa |
Translation table | |
GC content | 64% |
IMG OID | 640418820 |
Product | predicted protein |
Protein accession | XP_001419063 |
Protein GI | 145349277 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGG AGACGCGCGA GGACGCGACG GATGGATACC GCGCGCGCGT GGACGTCGTG TACGCGCGGC TGAGCGGCGC GGACGCGAGG GAGGACGGCG ATAAATTGCG CGTGCTGCTG GTGCGTCGCG CGAGGGCGAG ATCGAGCGCG CGGAGGGGAA GGGGAGAGGC GCGCGCGAGC GGGCGGGACT GACGAGACGA CGACGCGCGC GGCGACGACT CGAACGAAGA ATTACGGAGA ACACGGACGG GAATTGGTGC CCGTGGACGC GGCGGTGCGA TTGCTGGAGA CGATGGCGGA GGGTGAGGAC GCGGTGGTGG CGCTGGCGCG CGGGAGGGGT GTGGACGAGA CGCGGTTGCG AGCGATGCTT CGAGCGACGA CGTTCGCGGT GGTGCCGATG GAGAACGAGC GAGGACGGGA GCTGGTGGAG AGGGGGGATC CGTGCGAGCG GAAGAATGGG CGAGGGGTGG ACCCGAATAG GAATTGGGGG GTGAATTGGG GGGTGAAGGC GCCGGATTAC GATCCCAAGG AGGAGTTTCC GGGGACGGCG CCGTTTAGCG AGCCGGAGAG TCGGATATTT CGGGATCTCG TCGCGTCGTT CGAGCCGCAC GCGGTGGTGA ATTGGCACAG TGGGATGTCG GCGATATTCA CGCCGTATGA TCACGTCGCG CGCGAGCCCA CTGGGGCGGG GGCGGAGGCG ATGATGCGTT TCGCGCGCGT CATCGACGCC GAGCACTGCG CGAAAAAGTG CACGCTCGGT TCGGGCGGGA AGGGCGTGGG GTACCTCGCG CACGGTACGG CGACTGATTA CATATACGAA AAGATGAAGG TGCCCGTGGT GTACACGTGG GAAATATACG GCGATCTCGA TGCGCCTTTC GAGGATTGTC ATCGCGCGTT CAACCCGACG ACGAAGGAGA CGCGCGACGC CGTCGTCGAA GCGTGGTTCG GAGCGCCCAT CACGCTCGTA TCCATGCTCG ACCAGCACCC CGACATAAAT TTCAAACATC AAAGCGTCGT GCCGGCTGTC GCGTCATCTT CATCGTTCGC AGTTGGTGAT GAACAGCGCC GTTTCCCTTG GACAATTTCG TTGGCCTTCG CGTTCTTCAC CTTGGTGGCG CTGCGTCGAC TTCGGCGATC CAAGCGCGGC GGTGGTGCGG CGATCGGTAC ATCGCTTTGA
|
Protein sequence | MTTETREDAT DGYRARVDVV YARLSGADAR EDGDKLRVLL NYGEHGRELV PVDAAVRLLE TMAEGEDAVV ALARGRGVDE TRLRAMLRAT TFAVVPMENE RGRELVERGD PCERKNGRGV DPNRNWGVNW GVKAPDYDPK EEFPGTAPFS EPESRIFRDL VASFEPHAVV NWHSGMSAIF TPYDHVAREP TGAGAEAMMR FARVIDAEHC AKKCTLGSGG KGVGYLAHGT ATDYIYEKMK VPVVYTWEIY GDLDAPFEDC HRAFNPTTKE TRDAVVEAWF GAPITLVSML DQHPDINFKH QSVVPAVASS SSFAVGDEQR RFPWTISLAF AFFTLVALRR LRRSKRGGGA AIGTSL
|
| |