Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_40550 |
Symbol | |
ID | 5005665 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | - |
Start bp | 1418 |
End bp | 4405 |
Gene Length | 2988 bp |
Protein Length | 950 aa |
Translation table | |
GC content | 54% |
IMG OID | 640421086 |
Product | predicted protein |
Protein accession | XP_001421676 |
Protein GI | 145354827 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCAAA ACCGACGACT GACTGGGCAT ATATTATTTC ATCGACGGGC GCAGAGAGGA CGCGTACACC CGGGAGATAT CGCCGAAGCC GCGTCGCTGG CGAGCATCGA TTTACCACCG GCCAAGTATC CTTTATTTGA CGCCATCGGC GACGATTTCG TGGAGACTGG AAAACTGAGC AAGCTCCAAC TGGAGGGTGT CATGTTCGCG TGTCAAAAGC ACTGCGAATT CGCGCCGGAT GGCAAGCGTT CGGGTTTCAT GATTGGCGAC GGTGCGGGGG TGGGCAAGGG ACGACAAATT AGTGGAATTA TTATCGATAA TTTCGTGCGA GGCCGTCGGA AAGCGGTTTG GATATCGAGC TCGGGCGATT TACACCGAGA CGCCGAGCGC GACTTGAGCG ATTTGGGAAG CACTATTAAG GTTATTAATT CGTGCCCAGA GCTCGATCGC GAAACTCGCG CGTTCGGGTT ATCGAAAGAG TATCAAGAGG GCGTGTTGTT TACCACCTAC AGCACGTTAG TGAGCAAGAC GCAGAAGAAG AGTCGTCTCG ATCAAATCGT AAATTGGTTC GGCGGCGAAG ACGCCGAGGG CGTCGTGATT TTTGACGAGT GCCACAAGGC AAAGAACTTT AGCGCGTCTT CTGATTCGGG TAGTAAAGTT GCGGCGGCTG TGATTGCGTT TCAGGAACGC TGTCCTCGCG CGCGCGTTGT GTACGCGAGC GCCACAGGTA TCTCTGAAGT TGGGCACATG ACCTACTTGG CCAGACTCGG TTTCTGGGGC AAAGGGACGC CATTTAAATC CGCGGATACG TTCATTGAGA GCATGAAGTC TCGCGGTGTA GGATTTTTAG AAATGCTCGC CATGGAAATG AAGGCAAGTG GGAAGTATGT TAGTCGAGGT TTATCTTTTA GGCAGGCAGA GTTTGAGCAG TACACGATCG CGCTCAGCAA AGAGCAGCGT CAAATGTACG ATGACGCGTG CGACTTGATG TCGCTCATCC GTAATGCGTG TATCGAAGCT GTGCGTCGAA CTGGTTCCGA TGGAAAAGGC GTTTGGAGTG CGTATTGGAG CGTCCATCAA AGATTCTTCA AGCTCTTGTG CATCAGCATG AAAGTTCCGG CGGTCATCGC CAAGGCGAAC GAGGCGCTCG CGCGAGAGCA ATGCGTCGTC ATCGGGTTGC AAACGACAGG TGAGTCTCAA GATGCGTCAG CCAACTTGAC TTTTGGTGAA GACGTCACGG AGTTCGGGTT CCTGAGTACG ACGCGCGAAA TGCTCGTCAA TTTCTTGAAC GTGCACTTTC CGACTGAAAT CCACGGCGCT TCAGACGGAG CGGCGGCGGA AAAGTCGGCG CCTTCCGATT GGTCGAGCGA TTGGAGGCGT GCAGGCGGCG CGGAGGTAGA GAGCGAAGGC TACACTGGTC GTTTTCACAC GAATGCCCCA GCCAAGGTCG ACGAAGAGCT CGTGGACGCT AAGGAAGAGT TGATCAATAA AGTCATCAAG TTGAGGTTAC CGCCGAACTT TCTCGATGCT CTGATTGATG GGCTCGGAGG GCCGACAGAA GTTGCCGAAC TCACGGGAAG AAGCGGTCGC ATCGTGCGTC GAGGCGATAG GCTCATGTAC GAATCGCGTG GCACGACGGC GTCTAGAAAA GGAACGAGCA TCAGCGGCGA CGGTGATGTG CTCGGAGTGA ACATCGCCGA GAAGAACGCG TTTATGAATG GTGACAAACG TGTGGCCATA ATTTCGGACG CGGCTAGTAC CGGGATTTCA TTGCACGCGT CGAAAGGCGC GAAAAATGTC CGCCGCCGCG TGCACGTCAC GATAGAACTT CCATGGAGTG CGGACAAGGC GATTCAGCAG CTCGGTCGCA CGCATCGATC CAATCAGATT ACGGGTCCAG TGTACGTGAT GTGCTCGACG AATATCGGAG GCGAAAGACG CTTCGTCGCC GCCGTCGCAC GACGCTTGCA AAGTCTCGGG GCGTTGACGA GAGGTGATCG ACGTGCGGCG ACTGGCATTG ATTTATCCGA TGGCAATCTC GACTCGTCAC TCGGGCGACA CGCGTTGAAG AAGCTCTACG AGGCGCTCGT CGCGCTCGAT GCGAATCTCC CGTCGGGCGT CACGCTCGAG GGCATCATGT CGCATGTGAA CGACGACGAT CTCGAGGGGA AACCCATCTC GAGCTGGCAG GATTTGAAGC TTGAACTTCG TTCAGCGTTA TTGGAGCTCG GAGTGCAGGT TGGACTGCGA AATGACGGCT TTTCGGTCGA AGAAGCGACG CTGCTGCAGG AAATTGGATA CGGCAACAAA GATCTCGGCG ACGTGCGCAA GTTTCTCAAT CGCCTACTTG GACTTCCGGT GCGAACGCAA AATATCATGT TTGCATACTT TGCTGAAACC TTGGATTGCG AAATCAAATT AGCAAAGGCT GAGGGGAAGT ACACGGAGGG CGTGAGCGAT CTCGGTGGTT CTTCCATTCG CATCGAACCA GAGTCGAAGA CGATTCTTAA GGATCCGTAC CACGGACAAA AACTCGTGTC GACCAAGGTT TTCATAGATC GCGGTATTTC GTTTGAGCGT GCGCTGGATA TTTTGGAAAA GAGCAAGAAT CACGATCGAG ACGGCTTCTA CAAGATGAAG CGCGAGATGT ATGGGCGCGC GCAAGTCGTT CTGGCCATCA CCAAGCCAGG TTCGCGAAAC TTGTTCATGC TGTGGCGTCC GAACACCGGC GCAAGTCTTT TCGAGATGGA GTACGACGAA TTGCGCGGCA AATACATGAT GTGCACGCCC GACGTGGCGA AACAACCGTG GGATCTTGTG CACGAATTGA CCGAGAAGCA TTGCATGCAC GGCGTGAATT GTCACGTTCG ACAAACCATG GGCGCTTGTA CCGCGGGCAA ACGCACGGTT GACTGCACCG TGATCAGCGG CGCCGTCGTG CCTTGCTGGG GAGAGCTCGA ACGAACTTTG GATAGAAACG CGCACAAG
|
Protein sequence | MIQNRRLTGH ILFHRRAQRG RVHPGDIAEA ASLASIDLPP AKYPLFDAIG DDFVETGKLS KLQLEGVMFA CQKHCEFAPD GKRSGFMIGD GAGVGKGRQI SGIIIDNFVR GRRKAVWISS SGDLHRDAER DLSDLGSTIK VINSCPELDR ETRAFGLSKE YQEGVLFTTY STLVSKTQKK SRLDQIVNWF GGEDAEGVVI FDECHKAKNF SASSDSGSKV AAAVIAFQER CPRARVVYAS ATGISEVGHM TYLARLGFWG KGTPFKSADT FIESMKSRGV GFLEMLAMEM KASGKYVSRG LSFRQAEFEQ YTIALSKEQR QMYDDACDLM SLIRNACIEA VRRTGSDGKG VWSAYWSVHQ RFFKLLCISM KVPAVIAKAN EALAREQCVV IGLQTTGESQ DASANLTFGE DVTEFGFLST TREMLVNFLN VHFPTEIHGA SDGAAAEKSA PSDWSSDWRR AGGAEVESEG YTGRFHTNAP AKVDEELVDA KEELINKVIK LRLPPNFLDA LIDGLGGPTE VAELTGRSGR IVRRGDRLMY ESRGTTASRK GTSISGDGDV LGVNIAEKNA FMNGDKRVAI ISDAASTGIS LHASKGAKNV RRRVHVTIEL PWSADKAIQQ LGRTHRSNQI TGPVYVMCST NIGGERRFVA AVARRLQSLG ALTRGDRRAA TGIDLSDGNL DSSLGRHALK KLYEALLGVQ VGLRNDGFSV EEATLLQEIG YGNKDLGDVR KFLNRLLGLP VRTQNIMFAY FAETLDCEIK LAKAEGKYTE GVSDLGGSSI RIEPESKTIL KDPYHGQKLV STKVFIDRGI SFERALDILE KSKNHDRDGF YKMKREMYGR AQVVLAITKP GSRNLFMLWR PNTGASLFEM EYDELRGKYM MCTPDVAKQP WDLVHELTEK HCMHGVNCHV RQTMGACTAG KRTVDCTVIS GAVVPCWGEL ERTLDRNAHK
|
| |