Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14566 |
Symbol | |
ID | 5001134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 343104 |
End bp | 346766 |
Gene Length | 3663 bp |
Protein Length | 1220 aa |
Translation table | |
GC content | 52% |
IMG OID | 640416555 |
Product | predicted protein |
Protein accession | XP_001416917 |
Protein GI | 145344809 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.122628 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.061037 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCACA TCGGGGCGAA ACCGAGCGAG GACGGGGGTG GGAAGTGGGA GGCGAAGGCG AGGGCGATCG AAGGCGACGC GACGACGGCG GGGGAGGACG AGAACGAGGC GACGTCGGAG ACGAGCGGTG GGTCGGTGAC GCTCGATGAC GTCGCCACGG GGGGGGGGAG GACGAAAGTG CGGTGCGCGT CGCTGACGGC GATGGACATC ATCGCCGCCG TGCCGGTGGA GAAGTGCGTG GAGACGATGG ATTGGGTGGA GACGGGTCGC GAGACGAGAG ACGAGAGAGC GATGGCGTTC GGCGCGGTGG CTTCGCGTCC GGTGAAGATA TCGGTCGATC ATGGGAAGAA GCGCGGTGGC ATCGATAAGT CATTGGTAGT AGCGATGGAT TCGCGGTGGA TGGAGAGGAT GGAGACGCCC GTGATCGTCG ATGAAGACGA GACGCGAAGA GCGCTGCGAA GAGCGATTCG CGAGGCGAGT TTCTCGACAA ATTCGGAGAG CTGTTTGTTT CGATTGAACG AAGACGCCCT GGAGGCGCTG GATTGCGTTG AGAATGACTT GCAGAAGAGA TACGACGTTG AGCGAGAGAG CGAAGGTTCG AAAGAGTTTG AGTCGACGAC GAAGGTTGGT AAGTTGAAGG ATTTCATGAG TTTGTTTCCG CGCCCACTCG AGACTTCGTT TGCGATGGAC GAGCCTTCTC TTCTCGCGTC TCCGATCACG TTCTTGCTCT CGATGGCGAA AAAGATCAAA GTGCTGCCGT GGGAAAATGA GATAAAGATT GATAGTTCAT GCAAGGAGTA CTTAAAGAAT CTCCTGTTTC AAGGAAGGAT CCCCGAGCCG TCGACGAAAG ACCCTTTCCA AGCCGTGCCG ACGAAGATCG CTCCGTTGAT TTGTGAAGAC ACCGCGCCAA GGTTCATGGA GAACGTGGAT CCAGATTCGA TGGATATTGC TCTGGACTCA AAGTTGAAGA CTTTAACGAC GCCTGTGGAA GAGCACACCC TCGCGGTGTC GAATGATTTC TCGTCCAAAG CCAAAGCGCA AATGACAAAG AAGCTTCTAA ACGTCGACGA CCGATTCGTC ACCAAACCAA TGATTGTACC GAAGATCGAT GAAGAAGAAG AAGAAGATGC GAATGTTCGT ACGGATCGCG CAATGCGAGA CGTGATTGCG AGTGTGCAAG CAGTAAAAGA CCGCGCGCCC GCGTTATCCG AATGGAAAGT TGCTATCCCA AACGCCCTCG ATACGTTGGC GCGATTTCAA GAAGAAGAGC ACACCGCGGC GGCAACTTTA TTTCGCGAAG CCGAGATGGA TAATGATGAT CTCGATTGGA CTTGGGACGA CATGATGGAT AAGAACGATG AACCACGAAC GACGAAAGAA CCGCCAAAGT CAGCGCCTGA AGCCATTCCA AAGCCACCGA CGCTGAGTGA ATCGCTGTCT CTTTCGAAGA TTGAGAAACA CAAGGATGCG TGCTTGATCG ATTTCGTCGA GCTTGACCTT GATCCAGAAC ACACATATGT GATCAAAGTG CTCGCGGAAC GTTATGGTTA CATATCGCAA ATTTTACCTC TGGATATTCA GCGAGCTATT CCCCGATTTC AAGTCGGCGA ATCTACGGAG CTCGCGCACG CAATGAAACA CTTTGCGAAT GAGTCGTCGT CAATCAATCG ACATTTGAAG AACTTGCTGT GCTGTCAAAG TGCGGCGAAT TTAATTCATG TGTATGGAAT TCATCAAGCG CACATTTTGA TGCGGAGTTA CGCTAAGGAT GAACAAGATT TATCAGACAT CAAGAGCCTC GTCGAAAACA ACGACGTTGC CGTGGAGCGC GGCGAGTTTG ACGATCATCC CAAGTTGAGA ATGATCAGAG CGTACATCGC AGATCTCATA GTGACTCAAT CGAAAATGCT CATTATCTTT CCCGACGCGA TGGGTATACT CTCCATCGTG CGATTTATCA GTAAGCTTGG TACAAAAGCA GTTCAGTTTG ATGGTAAGCA AGAGTTTAAA AATCTACACG AAGACGATGT TGAGGATTTC GCTCGCGCGG TTACGCTCGC GACGGCCAAT GTACAAGTTT TAATCGCTCT CGAGGCTCAT ATTGCTCATG AGGCATTTCC ATTGGAGCTA TTTTCGACGT GCGTTTATTA CGCTCCGTCG CCGAATGCGC TTCAAGATCT TCATCTGCTT GGTTCTTCGC GCCACGCTGC AAATCGCTTT GTGAGAGTGC TCGCCATGAA ACGGGGTGTC TCTATACCTA GTGAAGATCC AGAGCCTGCG TCATCGGCGT TGCGTCATTC ACCGTACGTG GAGAATAATC AACATGCTGC AAGCGTCGTG CGCGACGAAG ATCCCAATGT ATCGCCTCCT GTACAATTCG ATGGCGTTCC GAGGCACGTC GTCGTGCTCA ACGTCGCTAG ACAGATTATC AAGGTACGAG AGCAACTATT TGGTCAAGTT GAGCGAAACT TACTCGACGA CGGGTGCGAC GTGATTTTGC GAGAGTTTGG GATCGACGTT GACGCATGCT TCACCGTCGG CAAGCAAGTT CACGCAGTGA TATTGATCGT GCCAGAATAT TTTGTTGAAG GGTTACCGAC ACAAATTGAA CTTTTTTCGT TGGTGGAAGA TTTGATGGTT GCTATGGCGC ACTCGTTTGT GTCAGGAACG ATGGTCTTTG AGGGTGATTT GGACTTTTTA AACATTGCGC AAACGCTTGA TCGAAGAATC CATTCAGACG CCGCGCACAT GCATTTTTCC ATCGATCTCA GGTACAGTCT CGAAGATGAA ACCGCGTCGA TGCTACATGA GATTCTGCAG CCACAAGATG GTCGTTTCTC AGAATTTTCT GCGCCAATGC CAGAGAGCCG AACGATTGAA GAAATGGAGT TGTGTGATAT GTTTCCTTTG CTCAATCCAA TCTCCGCATG CGCTCTATTG GCGGATGAAT TCGTTTGCGA TGAACTCCAA CGCATTGGTA ATCTGTCTTC GAATACTAAG GCGTACTTGC AACGCGATGG ATACCTTGGA TGCCCACTCG ATGTCTTCGG TGCCGAAGAG CCGGCGAAGG ATGTACCACC GCCGTATTCG CCCCGCGGCG GAGCTGCGCA AGAACGAAGG CAAGTCTTCA CGGAACGTCC AATCACAGAG TTTTACTCGC CCATTTCGAA GCGACCACGA CTCGACGAAT GGGAGCTCGA ATCTTTGTCT TTAGATTCGC CGACAAACAC CGCGACTCCG CATTATTACC AATCTTTGCA TCGGCGCAAT ATCGATACGC CGTCTCCAAT CGTGGACTTC AAGACGTCTC CCACGTCGGC GCCGCGATCA CTCTTACGAT CACCGCCAGC GGCGTCACCA CGAAGAGCGC CCCCTGTCAA TGTTGGCGCT GTCGGCGGTG GTCGACTCTG GCAACCGACC CGGACGAGCG ACGGACCCTC GACGAGCGCA AACATGATTC TGGAGAAGCA TCGGCAGCAA CCTGAACCGT CACTTGAGGA ATTCAATCAA ATGCTCGAGT CTTACCGAAT GCCCGCGCCT CCCGAGCGTG GGCGAAGAAA CTCGACGCTT GATTGTTTCG ATTCGAACAG AGACTTAACG CAGCACCGAT TCACGAGAAT ATCATCATCG CGAACGCAAC AATTTCCAAA CACTTGGAAA TGA
|
Protein sequence | MTHIGAKPSE DGGGKWEAKA RAIEGDATTA GEDENEATSE TSGGSVTLDD VATGGGRTKV RCASLTAMDI IAAVPVEKCV ETMDWVETGR ETRDERAMAF GAVASRPVKI SVDHGKKRGG IDKSLVVAMD SRWMERMETP VIVDEDETRR ALRRAIREAS FSTNSESCLF RLNEDALEAL DCVENDLQKR YDVERESEGS KEFESTTKVG KLKDFMSLFP RPLETSFAMD EPSLLASPIT FLLSMAKKIK VLPWENEIKI DSSCKEYLKN LLFQGRIPEP STKDPFQAVP TKIAPLICED TAPRFMENVD PDSMDIALDS KLKTLTTPVE EHTLAVSNDF SSKAKAQMTK KLLNVDDRFV TKPMIVPKID EEEEEDANVR TDRAMRDVIA SVQAVKDRAP ALSEWKVAIP NALDTLARFQ EEEHTAAATL FREAEMDNDD LDWTWDDMMD KNDEPRTTKE PPKSAPEAIP KPPTLSESLS LSKIEKHKDA CLIDFVELDL DPEHTYVIKV LAERYGYISQ ILPLDIQRAI PRFQVGESTE LAHAMKHFAN ESSSINRHLK NLLCCQSAAN LIHVYGIHQA HILMRSYAKD EQDLSDIKSL VENNDVAVER GEFDDHPKLR MIRAYIADLI VTQSKMLIIF PDAMGILSIV RFISKLGTKA VQFDGKQEFK NLHEDDVEDF ARAVTLATAN VQVLIALEAH IAHEAFPLEL FSTCVYYAPS PNALQDLHLL GSSRHAANRF VRVLAMKRGV SIPSEDPEPA SSALRHSPYV ENNQHAASVV RDEDPNVSPP VQFDGVPRHV VVLNVARQII KVREQLFGQV ERNLLDDGCD VILREFGIDV DACFTVGKQV HAVILIVPEY FVEGLPTQIE LFSLVEDLMV AMAHSFVSGT MVFEGDLDFL NIAQTLDRRI HSDAAHMHFS IDLRYSLEDE TASMLHEILQ PQDGRFSEFS APMPESRTIE EMELCDMFPL LNPISACALL ADEFVCDELQ RIGNLSSNTK AYLQRDGYLG CPLDVFGAEE PAKDVPPPYS PRGGAAQERR QVFTERPITE FYSPISKRPR LDEWELESLS LDSPTNTATP HYYQSLHRRN IDTPSPIVDF KTSPTSAPRS LLRSPPAASP RRAPPVNVGA VGGGRLWQPT RTSDGPSTSA NMILEKHRQQ PEPSLEEFNQ MLESYRMPAP PERGRRNSTL DCFDSNRDLT QHRFTRISSS RTQQFPNTWK
|
| |