Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50379 |
Symbol | |
ID | 5003755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 421298 |
End bp | 424369 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | |
GC content | 57% |
IMG OID | 640419176 |
Product | predicted protein |
Protein accession | XP_001419803 |
Protein GI | 145350838 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4581] Superfamily II RNA helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0819342 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0284916 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCGA CGGCCGAGCT CGAGTGCGCG AACGCGCGCG CGTCCGAACA CTCGGCGCGC GCGCGCGACG CGGCGAAGCG ACGCAAGGCG ACGGAGGCGG CGGCGGATGG AATCGCGGAC GGCGGCGGCG ACGACGACGG CGACGCGCGC GCGCGCGCGA GGACGAGCTG CGTGCACGAA GTCGCGGTGC CGCGAGACTG GGTCGGCGAC GTGAAAGCGC TGCGGGACCC GCGGTACGAC GGCGCGAGGG CGAAGGAGTA CCCGTTCGAG CTGGACGCGT TTCAGCGCGC GGCGACGGCG GTGCTGGAAC GAAACGAAAG CGTGCTCGTC GCCGCGCACA CGTCGGCGGG GAAGACGGTG GTGGCGGAAT ACGCGATCGC GATGGCGTTT CGGGATAAAC AGCGGGTGAT ATATACGTCG CCGTTGAAGG CGCTGAGTAA TCAAAAGTAT CGGGAGTTGA GCGAGGAATT CGGCGACGTC GGGTTGATGA CGGGGGACGC GTCGATTAAT CCGAATAGTA CGTGCATCGT GATGACGACG GAGGTGCTGC GGTCGATGTT ATATCGAGGC GGGGACGTAA TTCGCGAGGT GAAGTGGATC GTGTTCGACG AGGTGCATTA CATGCGGGAC AGAGAACGCG GGGTGGTGTG GGAAGAGTCG ATCATCTTTG CGCCGAAGGA CGCGCGGTTG GTGTTTTTGA GCGCCACGCT GCCGAATGCG CTCGAGTTCG CGCAGTGGGT GACGAGCTTA CATAATCATC CGTGTCACGT GGTGTACACG GATCATCGAC CGACGCCGCT GCAGCATTAC GCGTTTCCCA AGGGCGGGAG CGGTTTACAT TTAGTCGTCA ACGAGCAGAG TCAGTTTCGT TCGGACAATT TTGCGAGGTT GCAGCAGGCG ATCGCGGATG GGGCTGAGAA GAGCGGTGGT TCAGGAGGCG GCGGTCGCGG TCGCGGTCGT GGCGGTGGAC GCGCACGCGG CGGCGGCGGC GGTCGTGGCG GTGGCGGTGG CGGTCGCGGC GGCGGGTCGA TGGCGGACGC CGATATTTTG CGCATCGTGC GTATGGTGAA GGAGAAAACC TTCTTCCCCG TCATCGTGTT TAGTTTTAGC CGACGCGAGT GCGAAGAGTA CGCCAAATTC GTGTCAAAAT TGAATTTCAA CACTCCCGAG GAGGCTGAGC AGGTTCGTGA GGTGTACAAC GCCGCACTGC TGAATTTGTC CGAAGAAGAC CGTCAGTTGA CGGCGGTGCA AGCGATTTTG CCATTGCTCG AGGCGGGCAT CGGCATTCAT CACAGTGGTT TACTTCCGGT TTTGAAGGAG CTCATAGAAA TTCTGTTCGG CGAGTCGCTG ATTAAGTGCT TATTTGCAAC TGAGACGTTC GCCATGGGAC TGAATATGCC GGCGAGAACC GTCATTTTCA CCGCTGTTAA AAAGTTTGAT GGCACCGATA TGCGCGTTCT CGCGCCCGGA GAGTATACGC AAATGTCCGG CCGAGCTGGT CGACGTGGTA AAGACGACCG TGGTATCTGC ATCGTCATGT GCGATGAGCG CATGGAAGAA CACGCGATGA AGGAGATGAT TCTTGGGAAG CCGCAGCCGT TGAACTCAGA GTTTAAGCTG AGTTATTACA GTATTTTGAA CCTCTTAAAA CGCGCGACGG GGACGATTGA CGCAGAGTAC GTCATCGCTC GCTCGTTTCA TCAGTTTCAG CACGCCAAAC AGTTACCAGA ATTAAAGGCT CGGCTCACTG AAGTACAACA GGAGGCGGCG AAGATAAAGT CGGTGGGTAG CGAAGAGATT CAAGAGTATA TCAAACTCAG ACGCGATTAT CGCGAAGCCG AGAAAGTGGT CTTGCGCACG ATGCTCCAAC CGGCAAACTG CTTGCGATTC TTCACTTCGG GTAGACTAGT TCGCATAAGA GATGGCGACA CGAATTGGGG GTGGGGTGTT GTCATCCAAG TTTCCACAGT TAAAGATGCG AAGGGTGGCG ACGTACACGT GCTCGACTGT TTGCTTCGTT GCGGTCCAGG CGCGGCAGAG GGTAGACTTG CGCCTGCGGA CGCAAAGAAT CTGAAGATGA ACACAACGGA AATCGTACCT GTGGGCACAC ATCTCGTTGA TGCTATTAGC GCGATGCGCT TCACGCTTCC AGGTGATTTG CGCACGAAAG AAGCGCGCGA AAGCGTTTGG ATTGCCGTCG AAACTGTTAC GAAGAAACTC ACCGAAAAAG GCCAGGTGAT TCCGCAAATA CATCCTGTCG ATGATATGGG GATTAATGAC GTCGCATTTG TGCGCACATA CCGTTCACTT GGCGCGTTAC GCGACAAGTT CCACTCGCAC GCGTTGTACA GCGAAGCGGA TGCGCTCGAG CGCAGCGAAA TGACGGCAAA AATCGACGTC ATCGAGCAGA AATCAGAGCT CCTCGCTGAA GCGTCGAGAC TGGAGACACA GATTCAATCG AGCGAGTTGA CAAAGTTCCG CGACGATTTG AGCGCGCGAA GTCGAGTTTT GAAGAAACTC GGGCACATCG ACAATGATGG CGTCGTTTTG ACGAAAGGTC GCGCCGCGTG CGAAATCGAC ACCGCTGACG AACTCCTAGT CACCGAGCTC ATGTTCAATG GCGTATTCGC CGGTCTAAGC CCTCACGAGC TCGTTGCCTT GGCGTCGTGC TTCATGCCCG TAGAGAAAAG CAACACATCG AACATGGATA AATCCGCAAA GGCGCTCGCG AAGCCGCTCA AAGCCCTTCA GGACGCCGCT CGAGAAATTG GCAACGTACA AAAAGAGTGT AAAATCGACA TCGAAGTCGA CGACTTCGTC GAATCATTCA AGCCAACCAT GGTCGAAATC GTGTACTGCT GGGCCAAAGG CGAACCGTTT TCCGAAATCG TCAAAAAGAC AGATCTATTC GAGGGCACCA TCATTCGCGC CATGCGTCGC CTGGACGAAC TCATGATGGA ATTACATCGC TCGTGCGTCG CCGTCGGCGA CGACGGCTTG GCGAAAAAGT TCGAGCAAGG CGCGGAGAGT CTGCGCCACG GCATCGTCTT CGCCGATTCA CTGTACACCT AG
|
Protein sequence | MSATAELECA NARASEHSAR ARDAAKRRKA TEAAADGIAD GGGDDDGDAR ARARTSCVHE VAVPRDWVGD VKALRDPRYD GARAKEYPFE LDAFQRAATA VLERNESVLV AAHTSAGKTV VAEYAIAMAF RDKQRVIYTS PLKALSNQKY RELSEEFGDV GLMTGDASIN PNSTCIVMTT EVLRSMLYRG GDVIREVKWI VFDEVHYMRD RERGVVWEES IIFAPKDARL VFLSATLPNA LEFAQWVTSL HNHPCHVVYT DHRPTPLQHY AFPKGGSGLH LVVNEQSQFR SDNFARLQQA IADGAEKSGG SGGGGRGRGR GGGRARGGGG GRGGGGGGRG GGSMADADIL RIVRMVKEKT FFPVIVFSFS RRECEEYAKF VSKLNFNTPE EAEQVREVYN AALLNLSEED RQLTAVQAIL PLLEAGIGIH HSGLLPVLKE LIEILFGESL IKCLFATETF AMGLNMPART VIFTAVKKFD GTDMRVLAPG EYTQMSGRAG RRGKDDRGIC IVMCDERMEE HAMKEMILGK PQPLNSEFKL SYYSILNLLK RATGTIDAEY VIARSFHQFQ HAKQLPELKA RLTEVQQEAA KIKSVGSEEI QEYIKLRRDY REAEKVVLRT MLQPANCLRF FTSGRLVRIR DGDTNWGWGV VIQVSTVKDA KGGDVHVLDC LLRCGPGAAE GRLAPADAKN LKMNTTEIVP VGTHLVDAIS AMRFTLPGDL RTKEARESVW IAVETVTKKL TEKGQVIPQI HPVDDMGIND VAFVRTYRSL GALRDKFHSH ALYSEADALE RSEMTAKIDV IEQKSELLAE ASRLETQIQS SELTKFRDDL SARSRVLKKL GHIDNDGVVL TKGRAACEID TADELLVTEL MFNGVFAGLS PHELVALASC FMPVEKSNTS NMDKSAKALA KPLKALQDAA REIGNVQKEC KIDIEVDDFV ESFKPTMVEI VYCWAKGEPF SEIVKKTDLF EGTIIRAMRR LDELMMELHR SCVAVGDDGL AKKFEQGAES LRHGIVFADS LYT
|
| |