Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_200 |
Symbol | |
ID | 5004268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 204086 |
End bp | 206854 |
Gene Length | 2769 bp |
Protein Length | 923 aa |
Translation table | |
GC content | 62% |
IMG OID | 640419689 |
Product | predicted protein |
Protein accession | XP_001420111 |
Protein GI | 145351494 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0639] Diadenosine tetraphosphatase and related serine/threonine protein phosphatases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0238096 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGGGCGCGC GCTGTGGACA CACGATGACG GCGCTGCGAT GGAATCAAAA GACGAAGATA GTCACGTTCG GGGGTGCGAC CGAGCTCGAG GGCGGAAGCG GGGCGAACGC GAGCGCGAAC GTGGGGTTGA GCCCGCAGGC GGGTCGAGAC GCGGGCGCGT GGGTCAAGCT GAGCGGCGCG ACGAATGAAT TGCACGTCTT GGACCCGTTT AACGGGGAGT GGGGGAAGTT GGAGTGCGGC GGTGACGTGC CGAGCCCGCG GGCGGCGCAC GGGGCGGCCA CGGTGGGAGG GATGCTCGTC GTGCTCGGCG GCATCGGGCC GGACGGGTTG GCGGATGAAG ATTTGTACGT GTTGGATTTA GCGACGAGGG ACCCGAAGTG GCACCGCGTG CACGTGCGGG GGCAAGGTCC GGGGCAACGA TACGCGCACG TGTTGAGTTT CGTGGCGCAG AGATTTTTGG TGGTGATCGG GGGGACGGAC GGTTCGAAGT GCTTGGACGA TACGTGGGTG TTGGACACGA CGACGAAACC GTACGAATGG ACCAAGTGCG CGCCGAGTGG GCCCGTGCCG AGCGCGAGAA CGTACGCGAG CGCGAGTACT CGAAGCGATG GTCTGTTGCT GCTCTGCGGT GGCCGCGGTG CGGACGGTTT CGCGCTGAAT GACGCGTACG GTTTGGCTAG GCACCGCGAC GGGCGTTGGG AGTGGGCCGA AGCGCCGGGC AAGGCACCAA CGCGTCGATA CCAGCACGCG ACCGCATTCG TGGACACGAG ATTGCACATC ACCGGTGGGG CGTCGGGTGG GGGACAGCTC GTCACGAACG AAGCGACGAT GTCGATGCTG GACACGTCGT CGGGTGGAAG CACGGGATGG CGGGACGCGA AGAGTTCCGA GGGCAAGGCA CAGCTTTCCT ACGACGCGAG CTCACTCGTC GGACGACGGT GCCGCCATGC AAGTGTGGCG TATGGTCCAT TCATTTTCGT GCACGGTGGT TTGAGAAACG GAACGTTGTT GGATGATTTG GTCGTGCTCG AAGAGCCGCC GCAAGAGAGC GGTTCGACGC GCGCCGAACG CACGCGAGAG CTCGCGACTC TCATCGATCC GAATTCCTTG GCTTGGCGTA AATGGCTCGG TGAAACCGGG CTGTCGGCTG ATATATTAGG AATGCACTCG CCGAGAAACG TTCGAGACTC GTTCGCAATC AACACGCAAC GAGACGCAAC GTATTCAAAC GGATCCTTCA GTTCGCCCTC GTCTCCCGAG GGCATCGCCT TGCGACCGCT CGGAGGTTCG CCGGGGTCCC CGGATTCGCC CGACAACATG AACGCCGCTG CGGAGAAAGA GCTGCGTGCG GCGAGCGCGC AAGAAGCGGC AGCCGCGCTC GAGTTGGTCG CTCGAAGAAA GTTTTCGTTG GGCGAATCTG ACTCTGGCTC TCCGGGATCG TCCGTGCACA CGCCATCGCC TGGATTCGCT CGTTTCGGCT CCCCAGAATC GGCTCTACGC ACGCCCGCGT CGGAAGTTAG GTTACATCAT CGTGCCGTAG TCGTTGCCGC CGCGCCGCCA GATTCTGGCG CGAAGTCAAC GCCGCGAGGC GTCGCGAGCA TGGTGCGGCA GCTGTCCATC GATCAGTTTG AAAACGAGGC GCGACGCATC GGTACTCCCG GTGTGGATAT GCTCACCCCC GGAGACACTC CAGCGAAGAT GGCGCGCGCG AGACGCGCCG CTGAGCTCGG GGCGCAACCC GTGCACAAGG CCGTGCTCAC GCACTTGCTT CATCCGCACA CCTGGGAGCC GAGCACGGAT CGTCGATTCT TTCTTAGCGC GAGCGCCATC AACGAGTTAT GCGACGCCGC AGAGCATTGT TTCAAGAACG AAGAGACCGT ACTTACGGTG AAAGGTCCGG CGAAAATCTT TGGCGACCTG CACGGTCAGT TTGGAGATTT GATGCGGTTA TTCGCCGAAT ACGGCGCGCC ATCGACGGCG GGCGACATCG CATACATCGA TTACGTTTTC TTGGGAGACT ACGTCGATCG CGGTGCGTAC TCCTTGGAGA CCATTTCTCT GCTGCTCGCG CTGAAAATTG AGCACCCGCA AGCTGTGCAT TTACTTCGCG GTAACCACGA AGAGTCGGAC ATCAATGCGT TGTTTGGATT CCGCATCGAG TGCGTCGAGC GTCTGGGCGA ATCCGCGGGC GACGCAGTCT GGCGAAGATT CAACGAGTTG TTCGAATGGC TCCCGCTTGC GGCCGTCATC GAGGATCGTA TTTGCTGCAT GCACGGTGGT ATCGGCCGTA GCTTGACGCA CATCTCGCAA ATCAATGAGC TGAAGCGACC GTTGAACATG GAAAACGGCG GCGTCGAGCT CATGGACATT CTCTGGAGCG ACCCGACAGA GAACGATGGT ATCGAAGGTT TACGCCCGAA CGCGCGGGGT CCAGGTCTCG TGACTTTCGG GCCAGATCGT GTGAAGGCGT TCTGCGAAAC GAACGGGATC CAGATGATCA TCCGCGCGCA CGAGTGCGTC ATGGACGGTT TCGAGCGTTT CGCCCAAGGA CAACTGCTCA CCGTCTTTAG CGCAACAAAC TATTGCGGGA CCGCGAATAA CGCCGGAGCG ATTCTGGTGT TAGGGCGCGA TCTGACGTTA TACCCCAAGC TCATCCACCC GTTACCACCG ATCGCCATGG AGTCTTTATC GCCCTCCGAC CGCATCGACG ACAACTTGTG GCTGCAAGAC GTCAACCGCG ATCGTCCGCC GACGCCTCCG CGCGGCCGC
|
Protein sequence | PGARCGHTMT ALRWNQKTKI VTFGGATELE GGSGANASAN VGLSPQAGRD AGAWVKLSGA TNELHVLDPF NGEWGKLECG GDVPSPRAAH GAATVGGMLV VLGGIGPDGL ADEDLYVLDL ATRDPKWHRV HVRGQGPGQR YAHVLSFVAQ RFLVVIGGTD GSKCLDDTWV LDTTTKPYEW TKCAPSGPVP SARTYASAST RSDGLLLLCG GRGADGFALN DAYGLARHRD GRWEWAEAPG KAPTRRYQHA TAFVDTRLHI TGGASGGGQL VTNEATMSML DTSSGGSTGW RDAKSSEGKA QLSYDASSLV GRRCRHASVA YGPFIFVHGG LRNGTLLDDL VVLEEPPQES GSTRAERTRE LATLIDPNSL AWRKWLGETG LSADILGMHS PRNVRDSFAI NTQRDATYSN GSFSSPSSPE GIALRPLGGS PGSPDSPDNM NAAAEKELRA ASAQEAAAAL ELVARRKFSL GESDSGSPGS SVHTPSPGFA RFGSPESALR TPASEVRLHH RAVVVAAAPP DSGAKSTPRG VASMVRQLSI DQFENEARRI GTPGVDMLTP GDTPAKMARA RRAAELGAQP VHKAVLTHLL HPHTWEPSTD RRFFLSASAI NELCDAAEHC FKNEETVLTV KGPAKIFGDL HGQFGDLMRL FAEYGAPSTA GDIAYIDYVF LGDYVDRGAY SLETISLLLA LKIEHPQAVH LLRGNHEESD INALFGFRIE CVERLGESAG DAVWRRFNEL FEWLPLAAVI EDRICCMHGG IGRSLTHISQ INELKRPLNM ENGGVELMDI LWSDPTENDG IEGLRPNARG PGLVTFGPDR VKAFCETNGI QMIIRAHECV MDGFERFAQG QLLTVFSATN YCGTANNAGA ILVLGRDLTL YPKLIHPLPP IAMESLSPSD RIDDNLWLQD VNRDRPPTPP RGR
|
| |