Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26389 |
Symbol | |
ID | 5004323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 137037 |
End bp | 140204 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | |
GC content | 62% |
IMG OID | 640419744 |
Product | predicted protein |
Protein accession | XP_001420425 |
Protein GI | 145352162 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0807646 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.155521 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGCGC CGTCGCCGGG AGTGAGCGTG GGCGCGCTCG CGAGGTGCGG GAACCCGTTC GCGCACACGC CGGCGTGGCC GAAGACGCAG CGGACGACGC CGGAGACGGC GGCGGCGGCG CGGCGCGCGA GAGGGAAGGG GTTCATGTAT TTGAGAGACG AGGACGACGA CGCGGACGCG GTGACGCCGG TGAGTCGGCG ACGGGCGCCG GCGTCGGCGC CGGGACCGAG CGCGCGAGCG ACGGCGATGG CGCAGAGCCC GTTCGTGATC ATGCAACGAC ACATGGCGGG ATCGGCGACG CCGTACGCGA ATTTAGACGC GCCGGCGCCG GATTCGGTGA CGCTCACGAC GGCGAGTGGG AAGAGCGTCA TCGCGCGCGC GAACGGCGCG ATTCGAACCG ACGAGCAACC TAGATTTGAT GACGACGACG AGGCGTACGA AGACGCGTTC ACGTTTCGAG CCGCGATCGA GACGCCGGCG ACGACGAGAG CGGAGAGCGG GGGCGTGTTT ACGTCGGCGG CGCCCAAGGG CTTGGTCTTT CAAACCGGCG GTGGTGCGAG CATCGCGGTG TCTGAAGACG TAAAACGACG CGCGCGCGCG ATGTTTGCGG ACGTTGATGA AAACGACCCG TCGTCGTCTG GTCGAGAGCG ACCGAGCGCC GCCGCCGCAA CGCCGCCCGC GGCGAGCGGC GGCGGAGGTT TGGTGTTCCA GACCGCCGCG GGCGCGACTT TAGAGCTGAG CGAGGCGGCG TTGAAGAAGT CGCGCGCTTT TCTCGCGGAA GTCGGGAATG AAAACACGCC AGTGCGTGCG GCGTTCCCGC AGCCGTCGTT CAAGACGCCG AAGTCGCTCC CGGCGGTTCG CAAGGCGCCA TCGAAGGCGC CCGGGTTCAC GCCGCCGATG AAGAGCACGA CGGCTTTCAC GCCGCCGATG TCAATACTCG GCGCGCGAAC GGCGGCGAGT GCACCGCGAC AAGCGAAGAG ATCGCGCCAC GATGGCACTG GTGCGTCCGT GGGTGTTGCC GTGCACGACT TATTCGCCGC GCGCGTTCGC ATGGGCATGA GAGCGCCGCT CTGCACATTC TTCAATAATT TGCTCCCGTT TCAAGTTCGC CCAGCGTTCG TCGACACGTG CGTCGCGACG CTCAACGCCG ATACCGCGAA GTCATTGCGC TTGCCAAGCG TCGAACGCGG CCTGGTTGGT TGGCGAGAGA TGCGCGAGTT GATGATTAAA GCCGGCGCGA GCGACGCGTC TCTGACGAAC GAGTGGGTGG CGAATGCGTA CAAGTGGATC GTGTGGACAC AAGCATGCAT GGCGCGAGCG TTTCCAGAAA AGTATGCTTT CGGCGTCTTG AGCGAATCCG CCGTGCTGCA GCGCATGTTG TACAAGTATG AACGCGAAAT TAACCGCGCG GAGCGACCGC ACGTGAGGAG GATTTTAGAA AAGGATGAAA ACCCAGGCGC GCCGGCAGTG TACGTCGTGA GCGCGATTCG ATCGATGACG ACCGCTTCCG GTATCGGTAA TGTACCTACG ATGTCAGAAA TCGAAATTTC GGATGGATGG TATAGCGTTC GCGCGCGGCT GGACGCGAAG CTTACGCGAG CGGTGCGCGA AGGACGCTTG CGCGTCGGGT ACAAAATATT TGTTGTCGGT GCCGAGCTAC GAGGTGTCAC GGATGCGGTG TCACCACTGT CGGACGATGC AGAGATGGCG TACGTATGCC TGCACGTGAA CGGCGCGCGG CTCGCGCCGT GGGATGCGGC TTTGGGACGC GTCACGTACA ATCTCACGAT ACCCCTGCGT TCAGTGGTGC CCGACGGCGG CGTCGTGCCG CGTATGTTGA TTCATGTCCG ACACGCGTAT CCCATGATGC ATCAAGAACG TAGGGATGCA GACAAGAACG TATTGCGTTG CGAAATCGCT GAGCGTCGAG CGCATGCTGA ATGGCAGCGC GCTCGTGACA GCGTCTTGCA CGAGTTGCAA GACGCAATGC ACAATCGCAT CGGAGGTTGG GGCAGCGAAG TCGATCAAGA GCGAGCGATT CGCGAGGCGC TTCAAGAGAA GAATCTATAC GATCGTCGTA CGAGCGCGGT TCTTCGTCTG AACGTTGTCG GCTTCATGCC GTCGTCAAAG CACGAATCAT ATCGAGGGCC GATCACCCGC GGATCGACGA GTGCGATTCT TACGATTTGG GACGCGGACG AAGCACTCGT CGATGCGGCG CAGCCAGGAC AGTCGTTTGC GGTGACGGCA ATCAAACCAC GCGCGAACGC GTTTATCGAG GGCGAGTTAA GTTTATCCAC CACGCGCTAC ACGCGTTGGA CCCCGATCCC GCAGGAAGAT ATTGCCGCGC AAAAACTCGA GCACAGCGAG CATGAATGGC GCTGCTTGAG CGTGCGCGCA GCCGTGGATT TGGCTGACTT GTCGCGATCG GGTCTAGTGC GCAAAGAATT CGACGCCGTG GCGTGTACTT TACACTGCGG GCCTCCACGA TCGACACAAC GCGGTCGACT GTCGCAATGG ATATTTTGCT TTGATTCATC CGTCCTCGAT TCGACGACTC CGAACGCGGC GCGTCCGCAC CTTTTGGCGG TAGAGATTAC TGGATACGAC GATGACAGCT TTGTCAAGGC GGATGATTGG ATGCCATCCA CGAAGTTATT CACCGAAGCG CGGCACGAGT GTGGACCGCC GCTCGTTTTG CGAAATTTGG AGTTCCAACA TTACGACGCG GACAACGGCG TCTACGTCGC GCGCGTGCCG ATCGAAAATG TTTCGGTGAC GTTAGCGGCG CAAACGCCAG GCATCCATCG CATTTCGGTT GCGTCGCGCG ATTTGGAGCG ATTTTGTGGC TCGCGAGACG ATCGATCGCG CTCGATCATC GCGTCGTTGA AGTCGCGCGC GAGAGCACTC ACGGGCGTGC CCGAAAACGC CGAGTCTTTG GTGGACGAGA TGCCATGGGA CGAGGTTCAC GCTTCGCAAG AACCCGCGTC GCAATACGAT CCGCCCAGAT TGAGCATGGA CGACGATTGG AACGATGAGA ACGCGCAATC GCTCGCCGAC GAGGCCGTTC GTCTCACGCG AAGCGCGAGC AAAGCGCCCG TCGTCGTGCC AACCCCGACG CCCCGCCGAA CGCCGCGGCG TTCTAGCCGA GGAAGCGCGA GTCGTTGA
|
Protein sequence | MPAPSPGVSV GALARCGNPF AHTPAWPKTQ RTTPETAAAA RRARGKGFMY LRDEDDDADA VTPVSRRRAP ASAPGPSARA TAMAQSPFVI MQRHMAGSAT PYANLDAPAP DSVTLTTASG KSVIARANGA IRTDEQPRFD DDDEAYEDAF TFRAAIETPA TTRAESGGVF TSAAPKGLVF QTGGGASIAV SEDVKRRARA MFADVDENDP SSSGRERPSA AAATPPAASG GGGLVFQTAA GATLELSEAA LKKSRAFLAE VGNENTPVRA AFPQPSFKTP KSLPAVRKAP SKAPGFTPPM KSTTAFTPPM SILGARTAAS APRQAKRSRH DGTGASVGVA VHDLFAARVR MGMRAPLCTF FNNLLPFQVR PAFVDTCVAT LNADTAKSLR LPSVERGLVG WREMRELMIK AGASDASLTN EWVANAYKWI VWTQACMARA FPEKYAFGVL SESAVLQRML YKYEREINRA ERPHVRRILE KDENPGAPAV YVVSAIRSMT TASGIGNVPT MSEIEISDGW YSVRARLDAK LTRAVREGRL RVGYKIFVVG AELRGVTDAV SPLSDDAEMA YVCLHVNGAR LAPWDAALGR VTYNLTIPLR SVVPDGGVVP RMLIHVRHAY PMMHQERRDA DKNVLRCEIA ERRAHAEWQR ARDSVLHELQ DAMHNRIGGW GSEVDQERAI REALQEKNLY DRRTSAVLRL NVVGFMPSSK HESYRGPITR GSTSAILTIW DADEALVDAA QPGQSFAVTA IKPRANAFIE GELSLSTTRY TRWTPIPQED IAAQKLEHSE HEWRCLSVRA AVDLADLSRS GLVRKEFDAV ACTLHCGPPR STQRGRLSQW IFCFDSSVLD STTPNAARPH LLAVEITGYD DDSFVKADDW MPSTKLFTEA RHECGPPLVL RNLEFQHYDA DNGVYVARVP IENVSVTLAA QTPGIHRISV ASRDLERFCG SRDDRSRSII ASLKSRARAL TGVPENAESL VDEMPWDEVH ASQEPASQYD PPRLSMDDDW NDENAQSLAD EAVRLTRSAS KAPVVVPTPT PRRTPRRSSR GSASR
|
| |