Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38832 |
Symbol | |
ID | 5001897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 529879 |
End bp | 530997 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417318 |
Product | predicted protein |
Protein accession | XP_001418039 |
Protein GI | 145347149 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00223865 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00603177 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTACCGG GTCAATGCTT CAGAACAAAA GATTACGATT TGGAGGGCGT GGTCGCTCGC GCTGGTGATG ATACGGCTCG TATAAACACT GGCTCACATC CATCGCGTCG AGGCGCAAAG AAAACTACCA ACGGTGATTT ATTCGAAGCG TCGCTCGCGA CGTACGCGGA GAGCGCAAAG GATGCGAATA CGGTTTCCGA CGACGACGAC GACGTCGTTG AAATTCATCT CAGCAACATC AAACTCGGAA GACTAATTGG TCAGGGCGCT TACGGTGCGG TGTACGTGGG AAAGTGGAAC AAACGCGTCG TCGCGGTGAA AAAGTTACAC GGTGTCCCAC CCGGTGCTGG AAAGAGCGAT TTGAAGACGT TCGTGAGAGA AGTCGCCGTG TTGAGCGCGG TGCGACATCC AAAAATCGTG CGCATGTTTG GAGCGTGTCT CAAGCTGCCG CATCTTTGTA TAGTCGAGGA AATGATGGAC GGAGGAAGCT TGCACGCGCT TCTTCACCAA GACAAGCAGT ACGACGTGAG TCTAGACGAC GTCGTGAGGA TAGCGCTCGA CGTAGCGCAG GCGATGGCGT ATTTGCACTC CAGATGCATC GTTCACCGAG ACTTGAAATC GCACAACGTG CTTCTCAACG GCCGTGGCGC CAAAGTTGCG GATTTCGGCA TCGCGCGCGC GCTTGAGCGG ACGTTGCTCA GCGCCGGCGG TTCGGCGACG AGCGCCGGCG GCGCGTCCGC CGGAACCGCC GGAACGCCCG CGTACATGGC TCCAGAGCTT TTTCACGGCG ACGCCGACGC GGTGACGACC AAATGCGACG TCTTCTCCTT CTCCGTCCTT CTGTGGGAAT CACTCGCGCG GTCGATCCCC TGGGAGTGGT TCGCGAACCA CATGCAAATC ATCTTCGCCG TCGCCATCCA ATCGCAGCGT CTACCGCTGG ACGCGCCGCC GTTCGCGCGC GACGACGTCG TCGTCCGCGC GTTGGTCGAC GACGTCATCG TGCCCGCTTG GCAAACCGCC CCCGACGCTC GACCGGATTT CACCGAGCTC ATCGTCGTCT TAAACCGAGC CCTGCGCGAG CTGCTAGCCG AAAGTCGCGT CGCCGTAGAT TCGCGCTAG
|
Protein sequence | MLPGQCFRTK DYDLEGVVAR AGDDTARINT GSHPSRRGAK KTTNGDLFEA SLATYAESAK DANTVSDDDD DVVEIHLSNI KLGRLIGQGA YGAVYVGKWN KRVVAVKKLH GVPPGAGKSD LKTFVREVAV LSAVRHPKIV RMFGACLKLP HLCIVEEMMD GGSLHALLHQ DKQYDVSLDD VVRIALDVAQ AMAYLHSRCI VHRDLKSHNV LLNGRGAKVA DFGIARALER TLLSAGGSAT SAGGASAGTA GTPAYMAPEL FHGDADAVTT KCDVFSFSVL LWESLARSIP WEWFANHMQI IFAVAIQSQR LPLDAPPFAR DDVVVRALVD DVIVPAWQTA PDARPDFTEL IVVLNRALRE LLAESRVAVD SR
|
| |