Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17845 |
Symbol | |
ID | 5004957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 262338 |
End bp | 265304 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | |
GC content | 58% |
IMG OID | 640420378 |
Product | predicted protein |
Protein accession | XP_001421101 |
Protein GI | 145353612 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0578537 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAACG GAGAGGGGGC GCTGGGCGCG ATTTGGACGA CGGCGACGGC GGCGCTCGGG ACGCTGCTGA CGAGAGGGAG CTGGCGGACG AGCGCGACGG TCGGGACGTC GTGCGCGCTG CTCGCGGTGA ATGGGGTGAA TTTGTTTTTA TTTTGCTCGT GGTGCACGCT GCAATTTCGG TGGATACACG AGTCGCACGC GGGCGCGGCG GCGACGTGCG AGCGTTTGAT TTTCGCGCTG TGTCCGCCGA CGACGACGAC GATTCTGACG TGGGCGTTCG CGAGCGCGAG CGCGGGAGCT GAGAGAGCGG CGTTTTACGG CTCGGTCGTG TCGTTGATCA CGCATCGGAT GTTTTTGTTT CCGTGCGCGA GCGCGTGCGC GGCGGTGACG AAGACGAACG GGCCTCCGTC GGACAAGCGC TCGGCGTTGA CCGAGAGCGA CGCCGAGTAC GCCACCGTGG CCACGCTCGG CTTACCCGTC GCGTTGTATT TATGGACGAA CCTCGACACA CTGTTCAAGT CCCTCGATCA CGTCTTCGCG TGTGGGGCGT TGGTGACGGT CCCGGTGTTG TATTTAATAG CCGCCGGCAC GGAGAAGTCG TTGTGGTGGC GCGCGCGCGG CGTCGACGTG GCGTCGAAAA GGATGGAAAC CGCCGTGCTC CTCTTCGCAC TCACTGGATT TGCGGTGAGC GTTGAAGGTG GAATAATATT TAGTGAATTT GCCGAGTACA TCGAAATCAT GGCTCCTTTG AATTACATCA TGGTGACGGT TTCCGTCCAC AGCGCGCTGG CAGTTTTCGC CGCGTGCTAC GCAAACGCCG TCGGCGACGG CGTCCCGACG GGTGCGGTGA AGGCGACGTT GGCTCTGTCT ACGTCGACCG CAATTTGTGC TTTGGGTGCA CCTCTGTGGA TGATTCCGAT CCCCATCGCC GGATCATCGT CATTCGTCAA ATATTATTAC GAGGACCGAG AGCCGAAGGA TTACGGCGTG TTTGCCGCGA GTTGCGTCGG ATGCTTTTCG TGGTTTCTTT CGAAGAATTT CTGGTCACTC GATGTGCGCG TGGGAGCTTT CGACGTCAAA CAATTGTGTG TCGCCATATT ACTCCTCGCC GTGGCGGCTT TGGCTTTGCC AGCGGTGTTG AATACGAAAA GCGCGCGCGC GCCGACGGTT GGCGTCCTCG TCGTCTGTTA CGTCTCGGCG CTCGCAACGA TTGAACAAAT CCTGAGTCAA GCGACGCATG ACGACGATTC TTTGATTTAT CCACCATATT TGGTCATCGT CACTTCCATC AGCGGCTTCT TGGCGAGTCG AGGCCTCGTC ATCAGCGGGA GAATCAGTCG CGAATTTGGC TGGGTGATGC AGAGCGTGTG CGGCGCAAAG CTCTCCATGC TCTTTGTTCG CGGTTTGAAG GAAATGTTCA GCGTCTTGGT GGTCGTGCTC GCCATCACCG CTCCGCACGC GATGTCACGG CGAATGACTC AATTATCTCC GGGCGCGAGC GTTGGTTATT GCGTCGCTTT GGTGTTCTCT TTGGTATTCG CTCGGTTTGC GATGTTTGAC GTTATATTTG AACTCTCGGG TCACCGACCG ACGGACGCGA CGCTCTTCGG TGGACTCCTC CTGATCACGG GGGCGAGCTT GGCGTCCGTG GTAACGCGAC AAAGTTACGG TGATGACATG TTTAGCAAAC GATTGATGAT GCTCTTGAGT TTCTGTGGCG TCTTTCTCAT CACCTTCCGA CCGCCGATGC CTTGGAAGGG CGAGGTGGGC ATGTGGTATG ACGCCGAACA CGTTCCCGAC TCTGAAGAAG ACGAAGCGAG AATGTACGGC GTGCGCGAGA ACGCGCATCA TGGATGGCCG AGCTGGTTGC TGATGTTAGC CGCGCTCACC GCGATATTCG CCGTCTCGTC TCCACGACAA CAGACAAAAT CAACGTCAAC GATTCGAATC GCCTTAAGCG CCGTGTGCGG TGGGAGCGTT GGTTTATATA TGGCGCTGGA ATTCTTCGTC CAGCAAGTGG CGCTGACGGC ACTACTATTT GTCGCGTGTG CGCTGGTGGG AGTGTTCTTG TCTTTCACGT ACAGCCCTTC GCCGAAGTCT TCGCGCTGGT TGCCTTACGT GTATTTATCG TTCGTCTCCG TTCTCGGCTT GGCGTACGTC ACACAAATGG GTGGCTCAGA CGAGACTGTG GACGACCATC AGGCGAGAAT GGAAGGAAAA TTCGGTGTCG TCGGCGTTTT CGCCGGGACT TCGCTGCAAA TCGCGTTTGC TCTGAAGTTG AGAATCAAAA CGAGCCTGGA GAGCGTGCAA CATCGACGAC GTCAAGGCGG TACTTCACCG TTCCTTCCCG CCACTGGTCG CAGTCGTCCG GAATATTTCC GCGGTGTCGC GAGCAGAAAC GAGCACAGAG AACTCAAGGC AAAGGCGATC GCTTGGATGC CAATCATCGG GAACATCGCC ACGCTCACGT CCTTTCTCGC GTGCGTGGTG CTGAGCGATG AGTTGGCCGA TGGCTCCGCG TTTTCGGTCT TCGTCCTCGC GCCCATTCTG CTTCTTCTTC ACCAAGACTC GGTGATATTT CCTATACTGG AAGACAGCCA ACGATACGCT CCACCGCTCG CGATGATCGT GGGTAAGATG TGCTGGGACG CCGTCGCCGC CATCCTCGCC GGCCCGAACC GAGTTCACGT TCTCGCCGCG ACCGCCTCCA AGTTGCCGTG GATGACGCTC AACGCGTTGA GTCTGCTCCT CGCTTCGGTG AATAGCATCA ATTTGGTGCA CTACCTCGCC ACGAGCGTTC GCACGGACGG GATGACGCTC ATCTTGACCG CCCCGCTCGC CGTCGTGGCG CCGTTTCTTT CAAAAATTCC CTCCGTGCGC GCGCTCGCCT TCACCAGTCT CATCGCCGTC GTCACCCAGC ACACCCTCCA GCGTCGAGCG AAGATCGTCG GGCTGAAGTA TTTATAG
|
Protein sequence | MGNGEGALGA IWTTATAALG TLLTRGSWRT SATVGTSCAL LAVNGVNLFL FCSWCTLQFR WIHESHAGAA ATCERLIFAL CPPTTTTILT WAFASASAGA ERAAFYGSVV SLITHRMFLF PCASACAAVT KTNGPPSDKR SALTESDAEY ATVATLGLPV ALYLWTNLDT LFKSLDHVFA CGALVTVPVL YLIAAGTEKS LWWRARGVDV ASKRMETAVL LFALTGFAVS VEGGIIFSEF AEYIEIMAPL NYIMVTVSVH SALAVFAACY ANAVGDGVPT GAVKATLALS TSTAICALGA PLWMIPIPIA GSSSFVKYYY EDREPKDYGV FAASCVGCFS WFLSKNFWSL DVRVGAFDVK QLCVAILLLA VAALALPAVL NTKSARAPTV GVLVVCYVSA LATIEQILSQ ATHDDDSLIY PPYLVIVTSI SGFLASRGLV ISGRISREFG WVMQSVCGAK LSMLFVRGLK EMFSVLVVVL AITAPHAMSR RMTQLSPGAS VGYCVALVFS LVFARFAMFD VIFELSGHRP TDATLFGGLL LITGASLASV VTRQSYGDDM FSKRLMMLLS FCGVFLITFR PPMPWKGEVG MWYDAEHVPD SEEDEARMYG VRENAHHGWP SWLLMLAALT AIFAVSSPRQ QTKSTSTIRI ALSAVCGGSV GLYMALEFFV QQVALTALLF VACALVGVFL SFTYSPSPKS SRWLPYVYLS FVSVLGLAYV TQMGGSDETV DDHQARMEGK FGVVGVFAGT SLQIAFALKL RIKTSLESVQ HRRRQGGTSP FLPATGRSRP EYFRGVASRN EHRELKAKAI AWMPIIGNIA TLTSFLACVV LSDELADGSA FSVFVLAPIL LLLHQDSVIF PILEDSQRYA PPLAMIVGKM CWDAVAAILA GPNRVHVLAA TASKLPWMTL NALSLLLASV NSINLVHYLA TSVRTDGMTL ILTAPLAVVA PFLSKIPSVR ALAFTSLIAV VTQHTLQRRA KIVGLKYL
|
| |