Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_40166 |
Symbol | |
ID | 5000047 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 529963 |
End bp | 531216 |
Gene Length | 1254 bp |
Protein Length | 401 aa |
Translation table | |
GC content | 52% |
IMG OID | 640415468 |
Product | predicted protein |
Protein accession | XP_001415524 |
Protein GI | 145340837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.37612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTTCG TGTGCGCCGT CATCCTTCGC CTCGTCCTCA TCGCTTGGAG CGCGTATCAA GACGCAAACT TTGACGTCAA GTATACCGAC ATTGACTACT TCGTCTACAC CGACGCCGCG CGTCATGTCG TCCGCGGCGG ATCGCCGTAT GAACGAGCAA CGTATCGATA TCCACCGCTA TTGGCCGTCT TACTCGCGCC GAACGTGTTG GTGCACGAAA TGTGGGGGAA AGTGTTCTTC AGCACGTTGG ACATCGCGGT GGGCGGTTTG ATTTTGAAAA TCGGTCGGCG ACGCGGTATG AACGCGCGAG AGCTCAAATA TGCTTTGTGG TGTTGGTTAT TTAATCCGTT CACGTGCGCG ATAAGCACGA GGGGAAGCTG CGAGGCATTG ACGGGAGTGT TGATGCTGTT GACGGTTGAG GCTCTCACCG CGGGCGCAAC GACAAGGGCC GCAATCGCGT ACGGATTCGT CGTTCACATG AGGCTGTATC CAATCATACA CGCATTGATG TTCGTTGCGT TTCTTAATAA GGATTACATG GGCAATCGCG CTTTGTTCGG TAAGCGAGGA TCCAAAGCGC TTTCGTGGGT GACGGTAGAA AACGTCAAGT TTGCCGTGGT TTCGTCGGCG ACATTTTTCG CGCTAACGGC TGGTTCGTAC GCCGTGTATG GCATGGATTA CATCGATGAG GCAATTCTGT ATCACGCGCA AAGAAAAGAC CATCGTCACA ACTTCTCACC GGCGTTTTAC GGGATATATC TGAGCATTCA TCCGACGACG GACGCTCCAG ACTTGAACGG TTCAGCAATC GTTCAAACCG CCGATCGGTT GGCTATGAGT CCGTTGCCCA TGCTTACAGT CGTTCTATCA CTTGGGTTTG CGTTTGCTAG CGACATGCCT TTCGCACTTT TTGTGCAGAC ACTCGCGTTT GTAGCTTTCA ACAAGGTGTG CACGGCGCAA TACTTTGTTT GGTGGTTCAT GCTCTTGCCA CTCGTTTTAC CATCGCTGAT GCGAAGTGCG AATCGAAAAC GTGTGGTGTT CGCCACGCTG ATTTGGCTCA TCGCCCAGTT ACACTGGCTG GCTTGGGCGT ACGCCCTCGA ATTCAAAGGG GCGCAAGTAT TTGAGAGCGT GTGGTTGGCG TCCATCGCGT TCTTCGGCGC AAACATTTGG CTCTTGTTGA ACATCATCGC AGCGTATGCG CACGCACCGA TATTTTCCAG AGGTCGCTTG CAGAAGTTTT CAAAAGTAGA ATAG
|
Protein sequence | MAFVCAVILR LVLIAWSAYQ DANFDVKYTD IDYFVYTDAA RHVVRGGSPY ERATYRYPPL LAVLLAPNVL VHEMWGKVFF STLDIAVGGL ILKIGRRRGM NARELKYALW CWLFNPFTCA ISTRGSCEAL TGVAAIAYGF VVHMRLYPII HALMFVAFLN KDYMGNRALF GKRGSKALSW VTVENVKFAV VSSATFFALT AGSYAVYGMD YIDEAILYHA QRKDHRHNFS PAFYGIYLSI HPTTDAPDLN GSAIVQTADR LAMSPLPMLT VVLSLGFAFA SDMPFALFVQ TLAFVAFNKV CTAQYFVWWF MLLPLVLPSL MRSANRKRVV FATLIWLIAQ LHWLAWAYAL EFKGAQVFES VWLASIAFFG ANIWLLLNII AAYAHAPIFS RGRLQKFSKV E
|
| |