Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14946 |
Symbol | |
ID | 5001218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 241541 |
End bp | 243544 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | |
GC content | 55% |
IMG OID | 640416639 |
Product | predicted protein |
Protein accession | XP_001417186 |
Protein GI | 145345370 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.000739979 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGACG CCCTTGACGG CGCTGACCGC GACGCCAGCT CGACGAGTGA TCGTCCCGCG GCTTCTTCCC TGGATACCGT GCTCGAACTC GAGCGTTTCG TCGATGTCGA CGACGACCGC GCGGTGTTGG CAGACGATTA TTCCAACGAC GTCCCCGGGT CTTCCGAGCT CGATTTGACG TCTAAATACT GCCGAGACTG CCACGTCGCG AGTCAACGGT TGTATCACGT CGCTGCACTG TCGCGCGTGT GCGAACGGAC GCTAGAAGAC GACGTTGGGG CGTTCCCAAT CGATTTAATC GCAGAGCTGG CCGAGGATGA TGACGACGAT GTGCGCCGAA GCGTGGCCGA GCAACTCGAC CGATTCGCAG CGTGTGTGGG GAGGTTGAGC GAAAGATTAG ATCATGTATG CGCGATTAAA CTGCTCGGAG TGACTTTTAT GTTGATAGAG GATGATCGCG AGGAGGTTGT CAGCGCCGCA GAACAAACGT GCGGCGCAGT GGCTTCGTTG CTGTCACCCG AGAAGCAACG CAGTTTGTTG CTTCCAACGT TGAAATCGTT CTCGGAGAGC GATGAAGAAG AAATTAGAAT GAGTGCGGCG AAGGTGCTGG GGAGATTAGC AGCTGTTGTC GGCGTAGAGA TGACGAAAAG TGATTTGCTG CCGAGTCTGT TAAATTTGGC GAGTGACGCA GAATATAGAG TTAGAGAAGC CGTCGCCTCC GCGCTGAGCG ACACATTTGA GATTTTAGAT GCTAACGACA CGTTGGAATC GACTTTGCCG ACGTTTGCGC GCTTGTCCAG GGACAGTGTA TGGGCAGTCC GAGCCACATG CGCGAAGCAC GTCGTGCAAC TCGTCAGAGC GGTGCCGACG GACAGAATGC TCGACGTTGC TTCGGAGACT TTCGAGCCTT TGGCCAATGA CGTCAGCTTC AAAGTGCGTA CAGCAGCGTT GGAGCAGCTC GGGCAACTTA TTTTCGCGTT GAGCTCTGTT GAGGTGCCGA CTATCTTTGT GGACTACTTC ACGAGCATGG CGGAGAGTTC GACAAGCAGC AGCGCGCTAC AAGAAACTTG CGCATACAAC CTACCAGGCG TGGTGCTCTC GTTGACATCA GCGAGATGGA CAGAATTACG CCCGGCGTTT CGATTGTTAG CCGCAAGCTT GAATTGGAGA GTCCGTCGAA CGTTGGGTTG TTCGCTTCAC GAGATAGCGA CCATCATCGG TCGAGACAAC GCAGAGAAAG ATTTACTACC AGTGTTGGAG AGTTTTCTAG AAGACACTGA CGAAGTCAAG ATCGGTGTGA TTGAACATTT GAGTGAAATC TTCGCCGTCA TCGGTTCGGC GTCTCGGCTT TCGCTGATCC GACTTTTGTC GACCTTTGAG TCGGAGGATG CCGAAAAGAT CGGGAATTGG AGGATTCGTT TTGCGTTGGC CAAACAGATT CTCCCCGTGG CCGAATTGTT GTCTGGTGCG GCTACTGCTG AGTTTATCGT GCCAATCTTG CTCGCGTATT TAGATGACAC CGCGGCCGTT GTGCGCGAGA AAACCGTCGA GATCGCTGGC AGAGTTCTCT TCAACGCGGC GAGGGAATTG TCGTGGTACT CGGAAACCAT CGTGAACGCG ATGTCTTCCA TTAAACTACT CGCCACGAGT CACAGATGGT CCGACCGCAG AGCATACATC AACATTTGCC TCGCGCTCGC GAGAGAAGTC GAGCAACGAT TCGTGGTAGA TGAATTGTTA CCGCTGTTGG TCATGCTCGC CGAGGATTCC GTCGCGGCGG TTCGAGTGGC TCTGAGTCAC TTCTTAACCG AAGTTCTTCG ACTCGATCCC ACATACATTT CTCTGCCGGA TATCAGCGCG GCGATCATCA TACTGAAGGC TGATAGCGAC CCCACGGTAG TCGCTGCGGC GCAAAGCATG AAGGCCGAAG CCGAGGCGAA GGTGAAGGAC GACGCTGCGT TCACGTCCGC GCGAGACGTG CGTTTTTCCC CGACAGGCTT GTGA
|
Protein sequence | MSDALDGADR DASSTSDRPA ASSLDTVLEL ERFVDVDDDR AVLADDYSND VPGSSELDLT SKYCRDCHVA SQRLYHVAAL SRVCERTLED DVGAFPIDLI AELAEDDDDD VRRSVAEQLD RFAACVGRLS ERLDHVCAIK LLGVTFMLIE DDREEVVSAA EQTCGAVASL LSPEKQRSLL LPTLKSFSES DEEEIRMSAA KVLGRLAAVV GVEMTKSDLL PSLLNLASDA EYRVREAVAS ALSDTFEILD ANDTLESTLP TFARLSRDSV WAVRATCAKH VVQLVRAVPT DRMLDVASET FEPLANDVSF KVRTAALEQL GQLIFALSSV EVPTIFVDYF TSMAESSTSS SALQETCAYN LPGVVLSLTS ARWTELRPAF RLLAASLNWR VRRTLGCSLH EIATIIGRDN AEKDLLPVLE SFLEDTDEVK IGVIEHLSEI FAVIGSASRL SLIRLLSTFE SEDAEKIGNW RIRFALAKQI LPVAELLSGA ATAEFIVPIL LAYLDDTAAV VREKTVEIAG RVLFNAAREL SWYSETIVNA MSSIKLLATS HRWSDRRAYI NICLALAREV EQRFVVDELL PLLVMLAEDS VAAVRVALSH FLTEVLRLDP TYISLPDISA AIIILKADSD PTVVAAAQSM KAEAEAKVKD DAAFTSARDV RFSPTGL
|
| |