Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33639 |
Symbol | |
ID | 5003504 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 556668 |
End bp | 558188 |
Gene Length | 1521 bp |
Protein Length | 392 aa |
Translation table | |
GC content | 62% |
IMG OID | 640418925 |
Product | predicted protein |
Protein accession | XP_001419634 |
Protein GI | 145350483 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0631] Serine/threonine protein phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.214309 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCCCTCGCAC AACGGCGCGC TTTGACGCGT CGCGCGCGAT GGGGGCGTAC CTCAGCCAAC CGGTCACGCG CAAGGACTCC ACCGACGGCG CCGACGCGCG CTTCGCGTAC GGCACGACCG CGATGCAGGG GTGGCGAACG AACATGGAGG TGCGACGACG ACGAGATGAC GCGGACGCGA CGACGCGACG ACGACGGCGC GCGCGTCGTC GCGACGAGCG GCGAACGCGA CGACGACGGA CGCGCGATGG GAAAGAAACG CGACGCGACG ACGCGCGCGC GCGAGCGATG GAGACGACGA CGCGCGAGGG ACGGCGAAAG AGCGCGACGG CGGGGACGAC GACGACCTGG CGAAGACGCG AGGCGGAGGC GCGAAAGCGG AGGGACTGAC GAGCGCGCGA CGCGCGATGA AACAGGACGC GCACGCGACG ATATTGGATT TGGACGCGGA TACGGCGTTT TTTGCGGTGT TCGATGGACA CGGGGGGAAG GAGGTGGCGA TGTACGCGGC GAAGCGCCTG CACGAGACGC TGAAGGAGAC GGAGAGCTAC GTCGCGGGGG ACGTCGCGAG AGGATTGGAG GAGAGTTTTC TCGCGCTCGA CCGGAAAATG CTGGCGAAGG AGGCGGCGGG AGAGTTGAAG GCGTTCAGGG CGGGCGGAGA AAAGGACGAT TCGAGCGGGT TCGGGGGATT GCTGGGGGAC GGCGCGAGCG CCGAGGAGCA AAAGAATCGG CGAGCGGAGA TCAACGCCAA GCTTCGCGCG GCGTTGATCG AACAGGTCAA GGAGTCCAAC CCGGACATCG ATGAAAATGA TATTAAATTT GATTTTGAAT TGGAGGATGG AGATTTTAAC GAAATTGCGA GCTCGAGCGG CGGCGACGGC GCCGACGACG CCTCGCACGA AAATTGGACG GGCCCGCAAG CGGGCGCGAC CTCGGTCGTG GTGTGCGTTC GCGGTGACAA GGTGTATTGC GCCAACGCCG GGGACTCTCG AGCGGTATTC TCGCGGAAAG GCGGCGAGGC TGTCGAGATG AGTGAGGACC ACAAACCGAT GAATGACGGC GAGCGCAAGC GCATCATAAA CGCGGGCGGT TTCGTCAGCG AAGGACGAGT CAACGGCTCG TTGGCACTGT CTCGCGCGTT GGGAGACTTT GAGTACAAGA TGAACAAAGA GCTCGACGAA AAGCAGCAAG CGGTGACGGC GTTTCCGGAG ATTAGAGAAT TCCAACTGCA AGAGGGCGAT GAGTTCATGA TTCTCGCGTG CGACGGCATT TGGGATGTCA TGTCGTCGCA AGAGTGCGTG AATTTCGTGA GAGAGCGACT CGTTGCGAAG CTCAAGTCGG GCGAGAGCGA CTTGAAACTG AGTCAAATAT GCGAGGAGCT TTGCGACAGG TGTCTGGCGC CCGACACCAG AGGCTCGGGC CTGGGATGCG ATAACATGAG CGTCGTCGTC GTCCTACTGA AGAAATTTTG TTCGATTGCG TGAGCGAGCA TCGAGCGGAA GGCGTAATCA CGCGTAGACT T
|
Protein sequence | MGAYLSQPVT RKDSTDGADA RFAYGTTAMQ GWRTNMEDAH ATILDLDADT AFFAVFDGHG GKEVAMYAAK RLHETLKETE SYVAGDVARG LEESFLALDR KMLAKEAAGE LKAFRAGGEK DDSSGFGGLL GDGASAEEQK NRRAEINAKL RAALIEQVKE SNPDIDENDI KFDFELEDGD FNEIASSSGG DGADDASHEN WTGPQAGATS VVVCVRGDKV YCANAGDSRA VFSRKGGEAV EMSEDHKPMN DGERKRIINA GGFVSEGRVN GSLALSRALG DFEYKMNKEL DEKQQAVTAF PEIREFQLQE GDEFMILACD GIWDVMSSQE CVNFVRERLV AKLKSGESDL KLSQICEELC DRCLAPDTRG SGLGCDNMSV VVVLLKKFCS IA
|
| |