Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24632 |
Symbol | |
ID | 5002237 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 55604 |
End bp | 58563 |
Gene Length | 2960 bp |
Protein Length | 539 aa |
Translation table | |
GC content | 65% |
IMG OID | 640417658 |
Product | predicted protein |
Protein accession | XP_001418360 |
Protein GI | 145347822 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.272058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.135769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCGACGACGA GACGGCCAAG TTTCACGATT ACACCGTTCG GTCGGCGGGA ATGATCTTCA CGCCGGACGC GCATCAACTC ACGACACCGC GCTCTCGGTT GATCCATGGT AGACGGACGA ACGTGCGCAA CGTCGAGCTC GGTAAGAAGT ATCCCGCGCG CGACGCTTTT CGCTTCTCTT CGTCGTCCCC GGACGAAGCC GTCGCGATCC TCGCCGAAGT CATGCGCGCC GCGACGCCGC AAGTCATGAC CAAGGCGCAA ACCTAAACGC TAAAATGTAG TCACTAAAAG AGAAACATAC GCGTTCGAGC GACTCATGCG TCGTAAACCA CGATGGCGCG CGCCGCGCCG CCGCCGCGTT CGCGCACGAT GGTCGTCGTC GCGGGCGCGT CGACGCCGCT CCATCGAGCG AGCGTCGACT CGTCGATGGC GTGTTGCTTG TGCGAGCACG TCGATCCGCA CTCCGGCCAT CTGTACGGCA CCGACACGAC GGACACACGG CACAGCTCGA ACATGCGACG CACGAAATCG CTCGCGGCGT CGAAATCCAA GTGTTCCAAG ACTTGCATGC AGAGACACAA ATCGTACGAC GGCGCGAGGC TCGCGTTGGC GAAATCCCCC ACGACGTACC GCACGTCGTC CGTCGCGTCG ATCGAGTCGT CGCCCGCCGT CGTCTTGTAC CCGGCGTAAT ACGGTGCGAC GCACGTCTTA TCGCGAATCC AAGAGAGGTG TCGCGCGAAC GCCGGTTTCA CGCACCCGAT TTCGAGCGCG CTCTGCGCGC GCGGCGACGA CGACAACGCG GCCCCGAGCG CGGCGTCGTA ATATCGCAGT CCCTCGCGCG CCGTTCCCAC CGGCCCGAGC GCGCCGAACG CGAACGGCAC CCGCGCGCGC TCGACGACGT CCACCGTTTC GCGCCTCGAA CACCGGACGA ACGCCGCCGC CGCGACGCAC GCCGCCACCG CCGCCGCCGC CGCGCGCCGC ATCGCCGCCG CGCGCGCGCC CGCGCGACGG AATATTTTCT CCAGCGCGCG CGCCTCGCGT TTCGTCCTCT CGCGCCCGCG CGCGCGCGGG ACCGCGCCTC GAAACGTGAA AAATGTGAAA CGTCACAGTG CCCAGCGGTC GCCGAGGCGT CGGTCACACG ATCGCGCGTC GACCGTGGAT TCGCGTCGGT CCTCGCCGCG CGCGCGTCGA CGCTCGAGGA TGCTCAAGAC GATGTTCCTG AACGATAAGA GCGACCGCTT GGCGTACAAA ATCGTCGATC ACGTGCGTGC GCGACGCTGT GACGAATTCT GATTCTGTCG GAGACGACGC GACGACTGAC GACGACGACG ACGCGCGACG CGACGCAGGC GTGCAAATCG GGCGCGAGAC GAACGGTCGA TGACGCGCTG GAGATTAATC TGCGGCTGTG CGACTGCGTG AACGATGATT TCGTCGCGCA CGGGAAGGAT TGCGTCAAGG CGCTGCGGGC AAAGCTGACG GCGCCGACGA AGGGACGGGC GGTGATGGAC GCGGACGCGA CGCTCAAGGC GCTGTTCGCG CTGGAGATGT GCATGAAGAA CTGCGGAGGA CGGTTTCACG CGATGGCGGT GGCGAAGGAG GTGCCGGAGA CGATGGTGCG GTTGTGCGAG AGAGCGCCGA ATTTGGAGGT TCGCGACAAG ACCTTGGCGT TGGTTCACGA GTGGGCGGTA AATTTGAGGC GAGAGCCCGC GTTCGCGGGG GCGTTTCATC AGTTGCGCGC GCGTGGGTTT CAGTTTCCCG AGGTTGAGCG ACGGTCGGTG GCGCGGGGGG CGACGTCGTC GGCGGCTCGC GAGGACTGGA ACGGCGGCGG CGACTGGTCG AAGAGAGAAG ATATCTCCGA GGAGGATCGA GCCGCGATCG CGCGCGCGCT CGAGGAAGAC GAAGAAGAAG CAATGACACC GGTGCAGCGT CAAAGACACG GGATTTACCC TGGGAACATC GCCATGGGTA CGCCCGTGCG AGCGCCGCAT CTGAGCTCGC CGCACGCCTC GGGACGACTC GCGGACCGCG CTGAGAGCGA ATTACAGAGA GCGCTTCGCG AAAGCGAGCG CACAGCCGCT GCCGAGGCAT CTCGCGCCAC GACATCCGTG TCGACGACGT ACAATTCGCA AGACGTAGAA AAATTGAAGG GCGACATCGC CGTCGCGACG AATTCGCTCA AGGTGTTCAC GCAAGTCCTC GACGGCTGCG TCGCCCTTCG CCCGCCGTCT CCGTCTAGTT TAGCGAACGA GCTCTCGGAG CAGTGCCGCG CGATGCAGCC ACGTCTGATC GAGCTTATCT CGAACGCCGA AGACGAGGGC TTGCTCGCGA GCGCCATACA TTTGAACGAC GAGTTGACCA AGGAGATGGA ACGATACGAT TTGCTCGTGA AAGCCGCGGC GGGCGACGTC GCATCGCGGG CTCGACTCGC CGCGCCAGTG GCCGCGACGC GAACGCACAC GGCGGAAAGT TTACTGAGCG AACTCGACGA CGTCATGCGA GCGTCCACCG CCGCCACGAC GACGAGCCAA AGCTCCGCGA ACCCTTTCGG CGACGTCTTG CCGACGGCTT CCATCGCCGG ACCGTCGTCG TCGTCGTCCA ATCCATTTGC CGCCGGTCCC ACCCCGCCGC GTCCGATCGT TCACTTCCCG TCGACGACGT CACCGTCGCC GCCTCCGATG CAGACCAACC CGATGTTCGC CGACGAGCGG TATCGACCGG CCAGACCGCT CCCGGAGGGC ATGCGACCGA TGAATCCCGC CGCGTTCGCG CGCGACTCCG TCTCGCCGCC GACGCCCTCT CGCGCCGCGC CCTCGCGATC GCCCGCGAGC GATCCTTTCG CAGATTTGAG CCAAGCCGCC GCCTCGCGCG TGCGCTCGCC GTCGCGCAAC CTTCCGCACG TTTAGCCCGT TCGCCGCCGA CGCCTCGGCG CGTTTGAAAA AACTGTTGCA AATCCATCGA
|
Protein sequence | MLKTMFLNDK SDRLAYKIVD HACKSGARRT VDDALEINLR LCDCVNDDFV AHGKDCVKAL RAKLTAPTKG RAVMDADATL KALFALEMCM KNCGGRFHAM AVAKEVPETM VRLCERAPNL EVRDKTLALV HEWAVNLRRE PAFAGAFHQL RARGFQFPEV ERRSVARGAT SSAAREDWNG GGDWSKREDI SEEDRAAIAR ALEEDEEEAM TPVQRQRHGI YPGNIAMGTP VRAPHLSSPH ASGRLADRAE SELQRALRES ERTAAAEASR ATTSVSTTYN SQDVEKLKGD IAVATNSLKV FTQVLDGCVA LRPPSPSSLA NELSEQCRAM QPRLIELISN AEDEGLLASA IHLNDELTKE MERYDLLVKA AAGDVASRAR LAAPVAATRT HTAESLLSEL DDVMRASTAA TTTSQSSANP FGDVLPTASI AGPSSSSSNP FAAGPTPPRP IVHFPSTTSP SPPPMQTNPM FADERYRPAR PLPEGMRPMN PAAFARDSVS PPTPSRAAPS RSPASDPFAD LSQAAASRVR SPSRNLPHV
|
| |