Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37994 |
Symbol | |
ID | 5003977 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 482518 |
End bp | 484149 |
Gene Length | 1632 bp |
Protein Length | 521 aa |
Translation table | |
GC content | 60% |
IMG OID | 640419398 |
Product | predicted protein |
Protein accession | XP_001420012 |
Protein GI | 145351283 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1112] Superfamily I DNA and RNA helicases and helicase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.273766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCGA GCACTCTCAA AGCCGTGAAG TCTAAAGTTT TGAACGACGA ACAGTTGCGT GTGGTCGAAG ACGTCGTGTC AGGTGCGGGG AAGCAGTATC CATATATTGT TTGGGGGCCG CCCGGTACCG GAAAGACACT CACGATCGTG GAATGCGTCG CGCACGTGCT CGAGATGTTC CCTCACGCGA GAGTACTGCT CGCGGCGCCC TCGGCGTTCG CCGCGGATAT TCTTTGCTCG CGCTTGGCGA AGCGACTCAC CCCTTTCAAA AAGAAAATGA TCGTACGCGT GAACGACGTT CGTCGCACGC CTGAATCCGT GAAAGCCGAC GTGCGATTTC ATTCGCTCGA AATTTGGCGA GACGACCCGG AGGAAGCGAA ACAGTACGCG AGCGTGCCAT TTCACTTCTT CAAGCGACCG GATCCTCTGA AACATTTGAA ACATGCGCGC GTCGTCGTGT GCACGTGCAC GAGCGCTGCT TTGTTGCGCA AGCTGCCGAT GCCTGTCGAT AGTGTCGTCG AGAACTGGAC GCCGACGCAT ATTTTTGTCG ACGAGGCGGC GCAGGCTTTG GTTCCGGAGA CACTCATTCC TTTGTCGCTC GCCAGTTCGG AAACTAGCAT CGTTCTCGCC GGCGATTCCA AGCAGCTCGG TCCCAACGTG CACTCGAAAG AGGCTGCGCA AGCTGGTTTG CGAAAGTCTC TGCTCGAAAT GTGGATGGAT CACTCAAAGG AAGAAGTCGC TCGAGGCGTC TGGAACGGCA CGCAACTCCG AGCGTGCTAC CGCTCGCATC CCGACATCGT CGCGCTGCCA TCGAGAATGT TTTACGACGG TACCGTGGAG AGTTGCGCGC CGACGGCAAA CACGGATTTG CCAGCAAATT GGGAGAACTT TTCTCGAGGC GCGGGCAACG GACGCGCGAG TCGTTTCCTC TTCTACGGCG TCAAGGGACG ACAGCGCAGA GAAGGCAACA CGAGCAGCTG GACCAATCCG ATCGAATGCG CCGAACTGGT CGACTTACTC GAAGCCTTAC TGGATAGCAC GAACCTCACA CCCGCCGACG TCGCCGTGAT GGCGACGTAT CGTCGACAAG TCGTGCTCAT TCGCATCGCG CTTCGCGCGC GCTCGCTCGG CGCCATTCGC GTCGGTACCG TCGACGATTT CCAAGGGCAA GAGGAGAAAA TCATCTTCAT CTCCACCGTC GTCACGCGTC CAACGACCCT CGACGCGTTG GATTCCGAGA TTGGCTTCCT GAACAACCCC AAACGTTTCA ACGTCGCAAT CTCCAGAGCG ATGGCGTTAA ACGTCATCGT CGGACATCCC CTCGTGCTTC TTCAGAATCC CCTATGGGCC GAGCTCGTGC GCGAATGCGT TCGCCGCGAC GCCTTTCGCG GCGCCGGCGC CGAGTACCTT CCTCGTTTCG CCGGTGGCGG CCACGATTTC GCCCTCCCGT CGTCTCTCGA CGACGACGAC GTCCGTCCCT CTCGCGGCGC CTCCGACGCC GTCGCCGACG CCGTCGCCGC CGTCGCCGAG CTCGCGCTCC TCGGCGGCGG CGCCTCGGAC GCGTTGTCGT CCCAGGACGG TCACGCGTGG GACGATTGGG GCGACGAGCC CTCCTGGCGC GTCGCCGTGT GA
|
Protein sequence | MTPSTLKAVK SKVLNDEQLR VVEDVVSGAG KQYPYIVWGP PGTGKTLTIV ECVAHVLEMF PHARVLLAAP SAFAADILCS RLAKRLTPFK KKMIVRVNDV RRTPESYASV PFHFFKRPDP LKHLKHARVV VCTCTSAALL RKLPMPVDSV VENWTPTHIF VDEAAQALVP ETLIPLSLAS SETSIVLAGD SKQLGPNVHS KEAAQAGLRK SLLEMWMDHS KEEVARGVWN GTQLRACYRS HPDIVALPSR MFYDGTVESC APTANTDLPA NWENFSRGAG NGRASRFLFY GVKGRQRREG NTSSWTNPIE CAELVDLLEA LLDSTNLTPA DVAVMATYRR QVVLIRIALR ARSLGAIRVG TVDDFQGQEE KIIFISTVVT RPTTLDALDS EIGFLNNPKR FNVAISRAMA LNVIVGHPLV LLQNPLWAEL VRECVRRDAF RGAGAEYLPR FAGGGHDFAL PSSLDDDDVR PSRGASDAVA DAVAAVAELA LLGGGASDAL SSQDGHAWDD WGDEPSWRVA V
|
| |