Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_23918 |
Symbol | |
ID | 4999523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 654964 |
End bp | 658082 |
Gene Length | 3119 bp |
Protein Length | 1000 aa |
Translation table | |
GC content | 68% |
IMG OID | 640414944 |
Product | predicted protein |
Protein accession | XP_001415560 |
Protein GI | 145340910 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.160353 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTTCG GTGGATTCGG TGGGTTCGGC GCGGCGCAGC AGCCGCCGTC GCCGTTCGGC GCGGCGCAGC AGCCGCACAG CGGCGCGTTC GGCGCGACGG GGGCTCGACC GGCGACGACG GGCGCGTTCG GCGGTGGATT CGGGGGGCAG GGCGCGGCGA GCGGGGGGTT CGGGAGCGCG ATCGGCGGCG CTGGGAGCGG CGCGCCGGGC GCGGCGCGGG CGGTGGCGGA CGCGTGCAAG TTTTGGAACC GCGGGAGCTG TAAATTCGGG GCGCGGTGTA ACAATAAGCA CTGCTGTTCA AAGTGCGGAT CGACGGCGCA TCGCGCGGTG AGCTGCTCGA TGACGCCGGG GGTGAGCGGA TATCGAGCGA CGCGGGTGAT GGATAAGGAG CTGTTGACGC CGGGGACGTC GTACGCGATG GTGCACTCGA TCAGCGCGAT GCCGGAGAAT TTGGGGAAGA GCGCGGAGGA GATTCGGGCG GCGGCGTACG CGAGCGCGGA TCAGCCGGCG ACGAGCGCGG GTGGGTTCGG GGCGCCGGCG GCGACGAGTC CGTTCGGGGG GACGACGGGG GGAGGAAGCG CGTTCGGGGG GGCGAGCGGG GGAGCGTTCG GAGCGAGCGC GACGCCGGCG AGTCCGTTTG GGGCGCCGAG CGGGGGTGCC TTCGGAGCGT CGACGTCGAC GCCGGGCGGT TTCGGGGCGT CAGCCGCGCC GAGCGCTTTC GGCGCACCGT CGGGAGGCGG AGCGTTCGGA TCATCGCCCA CGGGTGGGTT CGGGGCGCCC GCGGCGGCGC CGAGCCCGTT TGGCGGCGCC GCGACGCCGT CCGCTTTCGG AGCGCCCGCG AGTTCGGCGC CGAGCGGCGG TTTGTTCGGG TCCACGACAG GCGGTTTCGG CGCTTCTCCG GCGTCTTCCG CGTTCGGGGC GCCGTCGACC ACGAGCGCGT TCGGGGCGAG CGCGCCGACG CCGGGTGCTT TCGGCGCCAC GCCGTCGGCG AGTCCGTTTG GGGCGGCGCC TTCGACGCCG GGCGCGTTTG GGGCGCCGGC TTCGACGCCC GCTTTCGGCG CATCGGGCGC GTTTGGCGCC GCACCGACGC CGAGTGCGTT CGGAGCGCCG TCGTCTACGC CCGCGTTCGG TGCAGCGCCC GCGTCTAGCC CTTTCGGCGC CGCGCCCGCG GCGGCGAGTC CGTTCGGCGC GGCGCCGTCG ACGCCGGCAT TCGGTGCGGC GCCGACACCG GGCGCGTTCG GCGCGGCGCC ATCATCCGGT GGCGGACTGT TCGGCGCGGC GCCTTCGACG GGTGGTGGTT TGTTCGGTGC GTCTGCGCCT TCCACTCCAG GCGCGTTCGG CGCGAGCACG CCCGCGCCCG GAGGATTTGG CGCTCCAAAG CCCGCGGGCG GGCTGTTCGG CGCTGCGCCG TCGACGCCAG CGACTGGGGG GCTGTTTGGT GCTAGTACAG GTGCAACCAC TCCTGGCTTC GCCGGTTCAA CGCCAGGGTT CGGCGCAGCA CCGTCCACGG GTGGTTTGTT TGGCGCCTCG GCTCCGGCAA CCGGAGGCGG CGGTTTGTTT GGCGCCGCGC CGTCTGCGGC GGCGACGCCT GCGTTCGGTG GATTTGGCGC GTCTGCGGCG ACACCCGCGT TTGGTGCATC TTCTGCGGCG TCCGGTGGTT TGTTCGGTGC ATCCGCGCCG ACTTCTGCGC CAAGCGGTGG ACTGTTTGGC GCCGCGCCGA CGTCTTCGCC CGGCGGCGGT TTGTTCGGCG CTTCGGCACC GGCGACCGGC GGTGCCTTCG GTGGCGGACT TTTCGGTGCC GCGAAGCCCG CGAGCGGTGG ACTTTTCGGC GCCGCCCCGA CGACCGGAGC TGCTCCTGGT GGCGGACTGT TTGGTGCGTC GGCTCCGACG TCGGCGCCGA GCGGCGGTTT GTTTGGCGCG GCTCCGACGG CGGGTGGTGG ATTGTTTGGC GCTTCCGCAC CCGCCGCTGG AGGTAGCATT TTCGGCGCAG GGGGGCTTGG AGCCTCGCAG CCGGCGATGG GTGCGCTCGT ACCTTTCGGC GCTCAGCCTG CCGTTGCGTC CACTCCGTAC GGGTCACCAC TTCAGCCTCC GCAGCTCGCG CTGGCGACGA CGGACCAAGC CCAAGCGAAG AGATCTTTGC TCGGTAAGAT GCAAAGCCCA GCCGCTGGTG CCGCGGCGAC GGCATTCCCA GTGCGTTCGC TGACGCCACG CTCTCCTTGG CTCACGAGCC GCGGTGGGTT AAGCCGCGAA CCGAAGCCGC GTGGCGGCGC GTCTTTGCCG TCTCCTGGAG GCGCGGCACA AGCCATCGGA ACACCCACTT CGGCTGCTCG CCCTGGGAGC GCGGTGTCCG TGGCCACGCC ATCGAGTGGT GGGTGGTTGT TCAAGCCACG CGAGAATCCT CGCGCGCTCT TCATTCGCCC AGAGGGTGTT TCTGCTTCGC CAGCGCCGTC AGTCGCGCTC GCGGTTTCGC CGACGCGCGC AGATCAAAGC CCGGAACGTC GTCAAGTCCA TTTCGCAGAG GACAAAGAAA ACACTCCGGG TAAAGGCGCA CCGGTTAGTC ACAATATCAT CATGCCGAAG CTCACTTCTG AAGGCTACAG CATGAACCCG ACGCTTGAAC AAATGGAAAG AATGTTTGAG CGCAGCGGCG ACGACGCCCT CGCGGCGGTG GACAACTTTT CTGTGTCGAA CGAAGCTTTC GGTCGCGTAA AGTGGCTCGA GCCCGTCGAT GTTCGTGGCT TGGATCTCGA TAAAATCGTG TCTTTCGAGC AGGCGTGCTT GTGCCTGTAC CCCGAAGATC AAGGTATCGA GCCGCCGGAA GAGGGTGAAG GTTTGAAGAA GCGCGCCGAG GTAACGCTGT ACGGAATCTT GCCGAAGAAG TCTGGCACCG CCGCCAAGGA AAAGTATCGC GAGAAGATTG TGAAACAAAC CGAAAAAGCG GGCGCCGAAC TCGTAGAATA CAACCCAGAT ACAGGGATTT GGAAATTCAT CCTGCAACTT TGATGAACGG GATGAACGAT TTGACACGTC GCGAAAAACG GACCGCGCGA AGGAAAGATC CTTCCTAAGC CACCACGGTG AGCGACAAAT AAGCACAACA TCATTTAGTA GGTTTTGTA
|
Protein sequence | MAFGGFGGFG AAQQPPSPFG AAQQPHSGAF GATGARPATT GAFGGGFGGQ GAASGGFGSA IGGAGSGAPG AARAVADACK FWNRGSCKFG ARCNNKHCCS KCGSTAHRAV SCSMTPGVSG YRATRVMDKE LLTPGTSYAM VHSISAMPEN LGKSAEEIRA AAYASADQPA TSAGGFGAPA ATSPFGGTTG GGSAFGGASG GAFGASATPA SPFGAPSGGA FGASTSTPGG FGASAAPSAF GAPSGGGAFG SSPTGGFGAP AAAPSPFGGA ATPSAFGAPA SSAPSGGLFG STTGGFGASP ASSAFGAPST TSAFGASAPT PGAFGATPSA SPFGAAPSTP GAFGAPASTP AFGASGAFGA APTPSAFGAP SSTPAFGAAP ASSPFGAAPA AASPFGAAPS TPAFGAAPTP GAFGAAPSSG GGLFGAAPST GGGLFGASAP STPGAFGAST PAPGGFGAPK PAGGLFGAAP STPATGGLFG ASTGATTPGF AGSTPGFGAA PSTGGLFGAS APATGGGGLF GAAPSAAATP AFGGFGASAA TPAFGASSAA SGGLFGASAP TSAPSGGLFG AAPTSSPGGG LFGASAPATG GAFGGGLFGA AKPASGGLFG AAPTTGAAPG GGLFGASAPT SAPSGGLFGA APTAGGGLFG ASAPAAGGSI FGAGGLGASQ PAMGALVPFG AQPAVASTPY GSPLQPPQLA LATTDQAQAK RSLLGKMQSP AAGAAATAFP VRSLTPRSPW LTSRGGLSRE PKPRGGASLP SPGGAAQAIG TPTSAARPGS AVSVATPSSG GWLFKPRENP RALFIRPEGV SASPAPSVAL AVSPTRADQS PERRQVHFAE DKENTPGKGA PVSHNIIMPK LTSEGYSMNP TLEQMERMFE RSGDDALAAV DNFSVSNEAF GRVKWLEPVD VRGLDLDKIV SFEQACLCLY PEDQGIEPPE EGEGLKKRAE VTLYGILPKK SGTAAKEKYR EKIVKQTEKA GAELVEYNPD TGIWKFILQL
|
| |