Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18196 |
Symbol | |
ID | 5005236 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 616376 |
End bp | 618909 |
Gene Length | 2534 bp |
Protein Length | 604 aa |
Translation table | |
GC content | 67% |
IMG OID | 640420657 |
Product | predicted protein |
Protein accession | XP_001421340 |
Protein GI | 145354117 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGAG ACCGGTGCGT CGACGCGCTG CGAGCGGCGA GCTCGAGGGA GGCGGACGAC GAGGCGGACG CGTACTACGC CGCGCTGAGC CTGCACCGAG CGCTCGCGAC GCGGGCGAAT CGAGACGACG ACGAGGCGAT GGCGCGATGC GTCGCGACGT GCGAGGAGGT GGTAGAGAGG TACGCGCGAC GCCGCGGCGG GCGCGACGGC GCGATGCGGC GGGAGACGGT GGCGCTGGCG GCGTTGACGG TGGCGCGCGC GAGAGGACCG GCGGCGGCGG CGGAGACGCT GAGGGGCGGC GACGAGCGCG CGAGCGTGCT CGGCGCCGCC GTCGCGCGGT GGTTAGCGCT GCGATTGGTG GACGTGGACG CGGAGAGGGA GCTCACGGTG GCGGATTTGA CGCGATTGGC GGTTATTTTG TCAAAGGCGC GGGATTCGCG CGCGGCGACG GCGTTTTGCG AGTGCGGCGG CGTGTCGTCG TTGTGCGCGT TGATCGACGT CGAATCGTCC CCGCCGCTGA GCGCGGGGAC GATATTTGAT TGCGCCAAAG CCATCGCGGG TTTTTGCGCG GCGCTTTCGG ATGGTTCGAC GACGACGAGG TTCGACGTCG AAGGCGTGTC GCGCGAGTTG AGGGCGATTC TCTCGTCGGC TTCGGGTAAG ATGGCCGATC CCCTTCCCGC GGTCGCCGCC ACCGCGTGTC GCCAAGCGTT GGCGTCTTTG GAAAAGTACG CGGCGACCCG AGCGGGAGCG GTGTCCACGG TGGAGCGGCA GTCGTCGTAC GGGGCGCTCG GCGATTGCGT CACCGCGGCG AAGGAGCTCG ATTTCGCCGA CGCCTCGCGC TCGGGTTCGC ACGCCAGTCT ACACCGCGCG CAAAAATACG ACGGTTCGTT TGGATCGCAA ATCGATCTCT CGCGCACGCG CTCGGAATCG CTCAACTCCT TGGATGCCTC GTTTAGTTCG CCGACGAAGC TCATCCCCGC GGCGTCCGTT CCCGAAGCCT CCGACACCGC CACCGCGCTC TTCACCACCC CACCGCCGCT CGCGCGAAAA CGATCCGACT CCGGCACATC ACCTCTGCTC AAGTCCAAGT CCCCGGCGCG CGCGCGCTCG AAGAAGCGCG CCGTCGCGCT TCGCCGCTTC CTCTTCTTCG TCTTCGCCGT CGGCGTCGTC TTCGCCTTCG CCGACCGTCT CCGCGCCCAT CGCGCGTGAT TTCCTCCGTT CGTCCGTCCA TTCCGCCGTT CGTCGTCCGT TCGCTTCCCC GTTTCCTCGC CGCCGTCTGT TCCTTCGGGC GTCGACCGCG TCGATTCGCC GCCGAGTCGC GTCGCTCGAG TTAGTTAGTT AGTTGAAAAG ATCCTCCAGC GACTAAACCC CAACACTTCG GCGCCACATC ATCGCGCCGC GTCGACGCGC GCGATGCGCT GCGCCGCCGC GCCGCGCGAG ATGCGCGCGT GCGCGACCTC CTCGCGCGCG CGCGCGACGC GAGACCGCCG CGCGACGCCG CGAGCGGCGC GATCGCGCCG CCGCGCGGCA CCCAGGAAAA GCGCTCGATC GGCGCTCGCG CGCGCCGCGG ACGGCGCGCG CGACGACGAA ACGACGCACG AGTGGTGCGC GGGAGATGCG TCGTCGACGG ACGCGCGCGA AAGCGTCGAC GCGCGCGACG CGGGCGCGGA CTGACCTCGA CGCGTCGAGC ACTCGCGCGC AGCGTCGTCA TCGAAAACCG AGCGGCGAGC CTAGACGGCG ATAAGCGCAC GCTCGTCGTC GCGATCGAGG ATAAACAGAC GGAATCGAAC TTGCGTAAGA GCCCGGGGAG CGCGGTGTAC GGAAATGGGA ACGGGAAGGT GTGGACGGAG ATGTACGACG AGCCGGGGCA GTACGTCAGG GCGCGGTGCG GGTGCGGCGC GGAGACGCGC TTGCCGATAG CGCGATCGCC GTATCACGTG CGGTACGACT CGGCGAGGTT AGACAGCGCG AAGGTTGAAT TCTTGGTGGA TTCGAGTCAT CATCCGAACG CGCTGACGGG CGCGAAACCG GGCGACGTGT TTCACGTGAG CGAGCCGCGC GGCGTCGGCT TCTCCAACGT GTTGTTCGCG GAGCGATCGC TCGAGGCGGC GATGCGTAAA AACCATCCAT TGGTCCTTCT CGCGAACGGT ACCGACGGTC TCGCGAGCGT GCGCTCGCTA TTAGATTGGC AACCAGTGAT GGCGTACGCG GACGCGCATC CTGTGACGTT ATTTTACCTG TGCGAGAGTC AAGAGAGCGC CGCGCTCTTG TCCATCCACG ACGAGTGGCG GGAGGAGGGC TTCAAAATCA TTCCGTGCTA CGGCGCTTTG GACGACCAAC TCTTCTTGAT GGAGCAGTGT TTCCTCACCG GCGCCGTCGC GGCGGGTGGC AAGCCAACCA TTCTCGGCGC CGACCCCGCG GCGTGTTCCG TCTTGCTCGC CGGCGCCGAG GGCGACGTCG CCGGGAGCAT CTTGAAGCTC CTGAACGCCC GAGGTATCGC GCGCGACAAC ATCTTGACGA GTGACTTTTT TTAA
|
Protein sequence | MTRDRCVDAL RAASSREADD EADAYYAALS LHRALATRAN RDDDEAMARC VATCEEVVER YARRRGGRDG AMRRETVALA ALTVARARGP AAAAETLRGG DERASVLGAA VARWLALRLV DVDAERELTV ADLTRLAVIL SKARDSRAAT AFCECGGVSS LCALIDVESS PPLSAGTIFD CAKAIAGFCA ALSDGSTTTR FDVEGVSREL RAILSSASGK MADPLPAVAA TACRQALASL EKYAATRAGA VSTVERQSSY GALGDCVTAA KELDFADASR SGSHASLHRA QKYDGSFGSQ IDLSRTRSES LNSLDASFSS PTKLIPAAVV IENRAASLDG DKRTLVVAIE DKQTESNLRK SPGSAVYGNG NGKVWTEMYD EPGQYVRARC GCGAETRLPI ARSPYHVRYD SARLDSAKVE FLVDSSHHPN ALTGAKPGDV FHVSEPRGVG FSNVLFAERS LEAAMRKNHP LVLLANGTDG LASVRSLLDW QPVMAYADAH PVTLFYLCES QESAALLSIH DEWREEGFKI IPCYGALDDQ LFLMEQCFLT GAVAAGGKPT ILGADPAACS VLLAGAEGDV AGSILKLLNA RGIARDNILT SDFF
|
| |