Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16663 |
Symbol | |
ID | 5003642 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 89074 |
End bp | 92392 |
Gene Length | 3319 bp |
Protein Length | 1048 aa |
Translation table | |
GC content | 60% |
IMG OID | 640419063 |
Product | predicted protein |
Protein accession | XP_001419704 |
Protein GI | 145350630 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.207195 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCGC TGTCGAGCGT CGCGCGCGCG CGCGCGTCGG GAACGACGGT GCGCACGCGC CGTCGCTCGC GGCGCGCGCG CGGGACGCGA AGGGCGACGA ACGAGGACGA CGCGGACGCT GGGTTCGACG TGGAAATCGC GGAAAAACCG ACGCACTGGC GCGTCGTCGA TGGCTTCGTG AGCGATTTCG CGAACGATTT GCGCGCCGTG TACGATGGAC GCACGCGCGA TCCGCTGCGC GTGGATCCGG AGCGGTTCGT GTGGGATTAC TGGCACGTGC CGAATCAGTA CACGCTGCTG CGAACGCCCG CGGAGGACTA TTTCGGCGCC GAGCGGTACG GCGAGTTGGA AAAGGCGCTG TTGGAGTACG GACGGAGAGA GTTGGGATGC TCGGCGATCA CGCCGGTGTG GATGTCGTGT TACGTGCACG GGATGCGGCA AGAGTTGCAC GCCGATGTGC CGCACGGACC GTGGGCCTTC GTGCTCAGTT TGACAAAGAA CGACGCGGAG GATGGGTCGT ACGGCGCACA TTTTAGGGGC GGTGAAACGC AAATAATGCG CCCCGAGCGA TTGAATTATT GGAAGAATTT TGACGCCGGC GAGGTCGTCG AGCGGTCGCA AATCATGGAG ACCATCGCGC CGACGTTTGG GCGTCTCGTG GCGTTCGACC CGCGCCTACC GCACGGCGTC ACTGAAGTTT TCGGGACTCA AGATCCGAGG CACGGAAGGT TGGTACTACA CGGATGGTTT AAAGACCCCG AACCGTCGTT TTCTGGCGCA TTGACGGAGG AGGACGCGTC GCCGACGCTC GAGGGCGCGC TCGGGGAACT GTACGGACGT CTCGTCGAGC TTCCCCGAGC GAGCGGCATG GTTTGCGCCG CGCTCACCGT AGCTCCAGAT GGTGCGGTGA CCAATCTACG ATGGACGTGC GATACCCTGA CGCCGATTCC TTTGCCCGAG CTCCCGAGTG AGACGGACAT TCGCGACGCC ATCATGCTCG ACATCGCGAG CGCGCTGTTG GAGTTGAAAT TTCCGGCTCG CGATGCCGAG TCCGTCATCA CCTTACCGTT TTGTTTCGAA TGAGCGTCAC AGACGCACTT TTTTCACGAT CGCACGTAGA TTTTAGGCGA TAGTTTTGTA TCACAAACGT AGAGTAGCCA CTCGCCGCCG AGCGTCCAAC CGCCGAGGCA CAGCGCGCCT CCCGCGCCGA CTCCATGACA GCACAGAAAT GACCCCGACC GCGTCCGACG GCGCCGACGA TGATATTGTC GACGTGCACG CCTTCAAAAC CCCTCGCGGA GTCGTGCGCG TGCGCACGCG GCGTCCGAAA CTCTTTCCAG AAGACACTTC AACCGCCGAG TGCGACGAGA CTGGACGCGT CGCCTGGCGC GCTCTCCCAG CGCTCTGCGC GTATTTAGCC AGCGACGAAG GATACTCGAT CGTGGTCGAG CGCTCGCGCG TGCTCGAGCT CGGCGCGGGG CTGGGGACGC CTGGGATGCT GTGCTGGCTG AGCGGCGCCG CGCGCGAGAC GACGCTGACG GACGGAAACG CCGACGTCGC GAGCGATCTA CGGCGATCGA TCGAGATGAA TCGAAACTTT GTAGATGACA ATGTATTGAC GTCGCTCGGG AGCGCGACGG CGGGGACGCT GGAGTGGGGG CGAGGGGACG CAGATGATTT CGCGAGAAAA ACGTTTCCAC TCGTCGTCGC GAGCGACGTG GTGTACAGCG AGGCATCGGC GCGCGATGTG TTAGACGTCG TGCACACAAA GCTCGAGCGT GATAATGGGA TGCTTTTACT CGCGTACGTG TCGCGTTGGG CGCACGTGGA TCGGGCGCTG TACGAGGCAA TCGTAGCCGG TGGGTGGACC GCAACGCTGA TTCCCGTGGC GGATTTCGAC GAGCTCGCTC GCAAAGAATC GTTGGAAGGT CGACCTTGTT TGTTCGAAAT CACACGCGGT GGGTGCGGCG CGCGCGGGCA AATTCGAGCA AGAATGCCGA AACGCGCTCG TGAGTGGTTT GATGCTGAAT TAAACGTGCT CAAGATCACA CCCGCGGACG CGTTTACGGA GCGCTTTGCT GATGACTTGT GTTCAACACT CGAGACGCAC GGAGGCGCGA TCGAGAGTCT GGAGATTAAC GCGAAAGGGC CGTTTAGAAT ACATCAAGAC ACGCTTCGTT GTTTGCTCGA CGTGTTTGCG AAACACGCGT CACGAGTGAA GCTAATTAGG TTGAAGTTGT GCGAAACTTG GCTCGATGTC GACGGATGGC GCGCCTTGGG AGATTTTCTC TGCGAATCGG ACGTGCGCGA TTTGGAGATT ATCGGTGAGG ATGTCGACGA GGAAATTTTG CACGCGATGA CGAATTCGAG TGGAAACGTT GCAAACTCGT GGGTCAGTCG ATTGAAGAGC TTAAAGTTCA GCCGTTGCGA CCGATTGAAC GCTCAAGCCA TTGTCGCGCT TCGACAGTCG TGGTTTCCTC CGCCCGCTCT CGCACGCGAG TTGGAATGTT TCGAAGTAAG CTTCTGCCCA ATCGGCGACG AAGGCATAAA ATCAATCTGT GACATCGAAT TTGTTGGCTT GCGCGAGCTT CGTCTGGCGA ACGTTGGCGT GTCCGCGATT GGCGGCGCCG ACGTCGCTTC AAGACTCGTG GCGCGGTGCG AACAATTACG AAACCTCGAT CTGAGCGGCA ACGATTGTTT TGACGCCAAC GGCGCGGCCG AGCTTTCTAC ATATCTCGAA ACCATAGGGA AATCACTCGT GATCTTGGAT CTGAGTGGTT GTTCTCTAGG AGATAACGGA TTGATTTGGT TGTGCGAAGC GCCTCGAGGG CTGAAAACGC TGCGTCGCCT CGAGACTCTT CGTCTTGGTT CGAACGGAAT CGGCGACTCG GCGATGCCAG CGCTGGCCAA TCTCTTCCAA AATGGTCACT TTCCTATGCT AAAGTGTTGC GATCTGTCCA TGAACGTCAT CTCATGGCGC GGCACGTACG ATTTTACCGA AGCCTTCGAC GTCGCCGCCG CCGCCGCCGG CGCGTCGCCT ACTCCCCTCG AGATCCTCAG CCTCCGCGGA AACCACATCG GCGATGACGG CATCGACGCC ATCACCGATA TTCTCCCCAA TCTCCCTCAC TTGCACACGC TCGACGTCTC CGACTGTGAC CTCACCGCCG TCGCCGTCCG CCGTCTCGCG GAATCGTCCT CGTCTCGCAC GTTCGCCGCG ACGCTCTCCC GCAATCCCGG CATCGACCGA ACCGCCGTGG CCGCTCTCGC GCGCGAATTC GATGATTTAG CTTTTGCTCG CGGACTAGAT GGATTTTAA
|
Protein sequence | MSALSSVARA RASGTTVRTR RRSRRARGTR RATNEDDADA GFDVEIAEKP THWRVVDGFV SDFANDLRAV YDGRTRDPLR VDPERFVWDY WHVPNQYTLL RTPAEDYFGA ERYGELEKAL LEYGRRELGC SAITPVWMSC YVHGMRQELH ADVPHGPWAF VLSLTKNDAE DGSYGAHFRG GETQIMRPER LNYWKNFDAG EVVERSQIME TIAPTFGRLV AFDPRLPHGV TEVFGTQDPR HGRLVLHGWF KDPEPSFSGA LTEEDASPTL EGALGELYGR LVELPRASGM VCAALTVAPD GAVTNLRWTC DTLTPIPLPE LPSETDIRDA IMLDIASALL ELKFPARDAD TEMTPTASDG ADDDIVDVHA FKTPRGVVRV RTRRPKLFPE DTSTAECDET GRVAWRALPA LCAYLASDEG YSIVVERSRV LELGAGLGTP GMLCWLSGAA RETTLTDGNA DVASDLRRSI EMNRNFVDDN VLTSLGSATA GTLEWGRGDA DDFARKTFPL VVASDVVYSE ASARDVLDVV HTKLERDNGM LLLAYVSRWA HVDRALYEAI VAGGWTATLI PVADFDELAR KESLEGRPCL FEITRGGCGA RGQIRARMPK RAREWFDAEL NVLKITPADA FTERFADDLC STLETHGGAI ESLEINAKGP FRIHQDTLRC LLDVFAKHAS RVKLIRLKLC ETWLDVDGWR ALGDFLCESD VRDLEIIGED VDEEILHAMT NSSGNVANSW VSRLKSLKFS RCDRLNAQAI VALRQSWFPP PALARELECF EVSFCPIGDE GIKSICDIEF VGLRELRLAN VGVSAIGGAD VASRLVARCE QLRNLDLSGN DCFDANGAAE LSTYLETIGK SLVILDLSGC SLGDNGLIWL CEAPRGLKTL RRLETLRLGS NGIGDSAMPA LANLFQNGHF PMLKCCDLSM NVISWRGTYD FTEAFDVAAA AAGASPTPLE ILSLRGNHIG DDGIDAITDI LPNLPHLHTL DVSDCDLTAV AVRRLAESSS SRTFAATLSR NPGIDRTAVA ALAREFDDLA FARGLDGF
|
| |