Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24607 |
Symbol | |
ID | 5002021 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 755884 |
End bp | 759191 |
Gene Length | 3308 bp |
Protein Length | 999 aa |
Translation table | |
GC content | 62% |
IMG OID | 640417442 |
Product | predicted protein |
Protein accession | XP_001418100 |
Protein GI | 145347277 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.230568 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGACGA CGACGGCGAG GGAGACGGGC GACGCGGCGA AGGCGAGCGA AGGGAAGACG GCGAAGGCGC TCGCGGCGAA GCCGGCGGCG GAGTCGTCTT CGACAGCCGT GGAGCTCGTC GCGACGGCGA ATTTAGGCGG TAAGTCCAAG GTTTTGTTTG AGATCGAAGG CGCCGGAGAG GCGGTGGATT TAGACGGCGA TACCGGCGCC GTGGGTCGTT GGCTCGCCGA GTCCTCGCGC GCGCTCAAGG TTGACATGAA GGGCGTGATG TACAACGCGC GCGTCGTCTC GAGCGCCGGC ACCGTCGTCG TCGTCGCCGT CAACGCCGAC GTAGCCAAGA TCGAGAGCGT ACACCGCGAG TTCGTGCAGC TTCGCGAAGA CCCGAGCGCG ATGGGCGGCA TGGAGAATTT CGGCGTCGGT TCGCTCTTCG ACCAGGACGA AGACGTCGAC GACGAAGACG TCGCCGTCGG CGCCAAGCGC AAGCGCGCCG CGGAGACCTC CAAAGCCTCG GGCGCGTCCG CTCGCAGGCC CGCGACGAAG CGTAAACCGG CGCAGAAGCG CAAGCCAGCC GTCAGGCGCA AGAAGTAGCT CGAGTTGTCA TTCGATTCCT TCGTCGCCCG CCTCGCGCGC CCCGGCGGTT GACTGCCAAC CTCCCGTCGC GTCACTCCGC GTCCCGCGAG CGTCCCTCGT CGCGCGCGCG CCCGCGCCCG CGCCCGCGGC ACGTCGACGC GCGCATCGAG CGTCGTTCGG CGCCGCGTTT GCGGACGCGC CGCGACCGCG CGCGCGGTTT CGCGTCGCGC CGCCCGCGCC GTGTCGAGTC CGCGAACGCG GCGCGATGCC CGCCCCCGCG GACGCCAAGC GCGTGTTCAA GCTCGCGCTC GGCGCGGACG GGACGAAGAA ACAGCGATAC GTCGTCGAGA CGCGCTCGGC GGGCGGCGGA CGGCTCGGGG TGATTCTGAT CGGACTGAAG AGCCCAGATC GGGTGCTGGT GAGCTCGAGC GCGGCGCGCG CGGCGCTGAA CGCGGAAGAA TTCGTGTGCG TCGACGCGAA CGTGGAGCGA GAGCTGGTGA ACGCGCGAGC GATACGCGTG GTGCGAGTGG AACGCGGCGA ACGGGTGTGC GTGCTGGATG ATGGTGAGGC GACGAACGCG AACGTCGGGG CGCCGGCGAA CCGCGGGGAG ACGTCGACGG AGGGCTCGCC GAGCGGACCG CCGCCGGCGA AGATGATGGG AGCGACGAAC CAAGACGTGT TGGGGATGTT GAGCGGGATC ATGAAAGGGG CGAGTCCGCG GATTTTGAGG AAGGATGAAG CGGGAGGGAT AATACAGAGC GAGCCGCCGG CGCCGACGCG CGCGGCGGGG CAACCGACGG GGCAACCGAC TCGGACGAAG TCGCCGATGT TGACGCCGCG AACGACGCGC GGGGGCACCG CGACTCGTGG GCCGACGTCG ACGAACGGGC AGACACCGCC GATGGGGAGT GCGTCCGGCC CTGGTGCGTC GTCGCTGCCG CTGGGTACGT TTGGCGCCGC GTCGCCGTCG TTCTTTTCGC CAGCCTTCGC GTCGCCGAGC GAAGCCGCGC CAGCGCGAGC GAGCGGTGAT GATTACATCG AAAGTATGAT GAGTGATTTT TCCGCGGCGC TTCTCGGAGG TGATACAGGA GACGCGCGCG TGGGCGCGAA CATCCCTCTC GGATACACTT CACCGTCGGT GAACTCGCAT CAACAACTCG AGCACAACGT CCCGTCAGCG GGTGACAGAG AATTGAGTCC GTTTTTCTCT TCCGTGGACG CCTCAGCGAG TGGTGTCCTC GGAGGCGCGC TAGAAGATGG TGTCGACGTG GCGCCGCACG AAGACATTTC CGGACTTCGT GGCGGCGATG CAGACGAAAT CGACCTCGCC GAACTCACAG CCGGTCGATT CGAAGCGGAT GCACTGGGTC TGGACGCCAC CCCGGCCGAA CTCACAGATC TGTTGAATCG ATCGCGCATA CTTCGTCAAG CCACGGAGAC GTTGGGCGAT TCTGCGCTCG ACGACGATTC CATCAAGGAG GAAGTTTCCG CGCCACCGAC GAACAGCGTG TTCCGCGATC TGCAGCGCGA GCCGTCGGCT GGTGAAATAT GCACTGCGTT TAATTTGCGC GGGGATAAGT TCGGATGCAA CGGGTGCGTG AGTCGACACG TGTGTCAACT CTGCGAGTCG CCGCGTCACA CGTTTGGAAT GTGTCCGTCA CTCTACGCCA ACATTCATAA ACGCGTGGAG AACAAAGTGT GCTTGGATTA TTACTTGAAC TCCGTGGACG ACACCGGCGG AGCTTTGCGT AACGGATGGG ATCAAGAAAA GCACGTTTGG AATTTGTGTC CCCGCAGCGA CAAGTGCATG CTCGAGCACG TGTGCGGCGC GTGCGGAAAG AGCGGCAACG ACGCGCATTT ACCGACGTGT CGCTTACACG CCGTCTTCGC CGACTCTGCT CCAGTCGATT TGTGTCGAAA GTACTACTTG CACTCTCTCG GAGACTATTT CAAGATGGAG AGCGACTCGC AAAAGTTTAA AATCGGAAGC AGCGGTGGCT GGGCGTTGTG CGGCAAGGGC GATCGATGTC ACTCTCGACA CATTTGCGGT CACTGCGGTG AAGAAGGACG CGGCAAACAC AAGCCAGAGT GTAAACTGCA CTCTGTGTGC GGTGTGATTC CGGACGCGAA CAAGCAACCA TGCTTTTTAT ACTTCATGAA GTCCATGGTC GGCATTCGCG AGTTTGAAAA GGCCCTCCTC GGCGGTAGCG CGAAGGGATT TTGCTCGCAG AAGAAAGGCG AGTGCTCTGG ATACCACGTG TGCGGTTCTT GTGGGAAGGA AGACGTGTCG CCGTACAACG CGGAAGGCCA CGCCGCGTCG TGTCGCATGC GAACGATTTC CAACGAAATC GATCCGACCC TCAACCCGCG AACGAAACAA ATGCTCAAAG ATTACGAAGA AGAACTCGCA CGCGTTCGCG CGAGCGCCAA GTCGAAGGAA ACTACCGCGA CGCAAACCAT CGGTGACGAT GGCTCGAAGG CGGAATCGAA ATCCGATCAA GTCGACGATG ACGCTTTAAC GCCGATCGAC GTGAAACGAT GTCGACGCCT CCGATCGGAA ATTGACGCCA TCCTTCGAGG CATCGGCGAC GACATCGATT TGCTCGTCGA AGAAGCGGTG GAATGCGCGG GTAAGGAAAT GAAAAAGAGC GCGAATCCGA CGCACAAACT AGAACAAATT AGAGAGAACT TTCGAAAGAT GATGTGATGT AATATGTCGA TGATGACAAA ATATTGTA
|
Protein sequence | MGTTTARETG DAAKASEGKT AKALAAKPAA ESSSTAVELV ATANLGGKSK VLFEIEGAGE AVDLDGDTGA VGRWLAESSR ALKVDMKGVM YNARVVSSAG TVVVVAVNAD VAKIESVHRE FVQLREDPSA MGGMENFGVG SLFDQDEDVD DEDVAVGAKR KRAAETSKAS GASARRPATK LRERGAMPAP ADAKRVFKLA LGADGTKKQR YVVETRSAGG GRLGVILIGL KSPDRVLVSS SAARAALNAE EFVCVDANVE RELVNARAIR VVRVERGERV CVLDDGEATN ANVGAPANRG ETSTEGSPSG PPPAKMMGAT NQDVLGMLSG IMKGASPRIL RKDEAGGIIQ SEPPAPTRAA GQPTGQPTRT KSPMLTPRTT RGGTATRGPT STNGQTPPMG SASGPGASSL PLGTFGAASP SFFSPAFASP SEAAPARASG DDYIESMMSD FSAALLGGDT GDARVGANIP LGYTSPSVNS HQQLEHNVPS AGDRELSPFF SSVDASASGV LGGALEDGVD VAPHEDISGL RGGDADEIDL AELTAGRFEA DALGLDATPA ELTDLLNRSR ILRQATETLG DSALDDDSIK EEVSAPPTNS VFRDLQREPS AGEICTAFNL RGDKFGCNGC VSRHVCQLCE SPRHTFGMCP SLYANIHKRV ENKVCLDYYL NSVDDTGGAL RNGWDQEKHV WNLCPRSDKC MLEHVCGACG KSGNDAHLPT CRLHAVFADS APVDLCRKYY LHSLGDYFKM ESDSQKFKIG SSGGWALCGK GDRCHSRHIC GHCGEEGRGK HKPECKLHSV CGVIPDANKQ PCFLYFMKSM VGIREFEKAL LGGSAKGFCS QKKGECSGYH VCGSCGKEDV SPYNAEGHAA SCRMRTISNE IDPTLNPRTK QMLKDYEEEL ARVRASAKSK ETTATQTIGD DGSKAESKSD QVDDDALTPI DVKRCRRLRS EIDAILRGIG DDIDLLVEEA VECAGKEMKK SANPTHKLEQ IRENFRKMM
|
| |