Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16762 |
Symbol | |
ID | 5003602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 287939 |
End bp | 290725 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419023 |
Product | predicted protein |
Protein accession | XP_001419766 |
Protein GI | 145350759 |
COG category | [S] Function unknown |
COG ID | [COG5594] Uncharacterized integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.905302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.169694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGG CCGCGGCGAC GGACGACGTC GCGGACGTCG TCGTCGTCGA CGGCGCGGAC GCCGACGTCG TGGAGCTGCG CGCGGTCGAG CGCCGGGCGA CGCGCGCGAC GTCGCGCGCG GCGACGACGT TCGGCGCGCT CGAGGACGGA CCGGTGGCGC GCGCGGACGC GGCGATGACC GCCGCGAGGC GGGCGCACGC GCGCGCGGGC GTGCTGGAGT ACGACGTCGA TCATCGCGGC GTGCAGGTGC CGTCGAGCGC GCGGCGATTT TCGATGACGG CGAGTCACAG ACGCGTGGGC GCGAGCAGCG TGTCGCTGGC GCTGTACCTG GACACGGTGC TGCAGTTCGG TTTGCTCGCG GTGGTGCTCG CGGGAATCAA CGCGTACTCG TTGACGACGA ACGTGACGGA TAGGGCGTTC ACGGCGTCGT ACGCGGTGAC GGGGTGGGAC GCGACGGCGG GAGCCGACGC GACGACGACG TGCGCGAGGA CGTACGAGAC GGGCGGCTTT GTGCTGAGCA CGAGTCAAGG GTCGAGGTGT GGAGTCCCGA CGTTGGCGAG TTTTTATAAC TGCCCGATGC GATGCGATTA CGAAGGAGCG ATCGGGACGT TTGACGCGAC GAATGCGTGC CAGACGCACT ATCCGTGCAC GCTCGAGAAG GTGCTGACGA CGACGCAGAC CGCGGTGTGT TGCGTACCTC GGGTCGCCGC CACTGTGAAT AAAACGCCTA AGGAGTTTCA GGCGGTTTCA ATCGTCGTGA CTGTGGTGTT TTTACTCTTT GACATGTTTT ACACTCGTAA TCGCGCGACG GCGGCGGCGA TCATCAAGTC GAGTGTGGTG ACCGCGAGCG ACTACTCGGT GCTCGTCACC GGGCTCGGAA AAGGGAACGA GTGGACTCGA CAGCAAATCG CGGATTTTTT CTCGCATTAC GGCGAGGTTG TTTCTGTTTG TCATCTTACG AACACTCGAC GACTCGTGAC TATGGAGAGA GAAATCAAAA ACGCCGTGCA GCGAAGAAAC GAAATTCAGG CGATTCTCGA TGACAACGAA GGCGCGAACA TGGACGAGAA GATCAATAAA TCCATCTTAT TGCGTCGACA ATTGTATCGA ATGATCGCAC TCAAAGGTAC GAAACCAACG CGAGAAGCGT TAGTAAAGTT GGATGCAAAA ATTGCCGAAC TCAAGGCGAA GGTCGTGGCG CTCGGGGAAA ACGGCAACTC GCATCTGGGG AGCGCCGTGG TGACGTTCAA TTACGAGCAG CACGCGATCA ACTGTTGCGA AGATCACTGC GGCGATTACG GAGAATCGAT CATCGATCGA TGCACCGGCA AACATTCACC AGACTTTAAC GGACGTCGAC TCATCGTCAC TCGCGCGCCC GAACCGAGCG ATATCAACTG GCAGCACCTT CGTCGACGAG ACAGTAACTG GGAGGTTGTG GCTACCGCCA TCGGGGCGAA GTTCGTGCTC GCGGGCGCTT TGTGCGTTGG AGGCGTTATT CAGTACGTTT TCGAGGTCTT GCGTAGTAAT CAACTCGAAA CCATTACGAA CGAGGTCGCT TTCCAATCTA CGGCTTCGAC GACTTCGTCC ATCAGACTGC AACTCGTCGC ATCTTCCACG TCACTCGTAG TGGTTATCAT TAACGGTTTG CTGGATCGAT TGACGGTTTA TCTCGCGCAA CTTGAGGTTT ATAAGACGAA AACGATTGAG ACAAACATGC TGATCGCGAC GTTGACGTTT GTTAACTTGC TCAACTACGT CGTCGTTCCG ATCATCACCA ATCGATGCTC GTCGAAGGCA GATGGTGTGT GTAACTGGTA CGTTCCCGGT GGTTTCGTCG AATATGCGTT CTATTTGCAA GTGTTCAACG TTCTCTTGTT ACCTTTAAGG AACATGAACA TCAAACACAT TATCCAAGTG AAAATACTCG CCCCAATGTC GAAGACGTTG TCGATGCAAG AGAGTCTTGT GCAACCTCCA GCTTTCCAGC TGGCGCGTTC GTACGCCGAG TTGCTCAAGA TTCTCGGTCT ATCGGCAATT TACGCGCCGG CACTTCCCGT GAGTTACGCC GTAGGCGTCT TTGGGATATT CGTGCTGTAC TGGAGCAAAA AGTACCAAGG TTTGTATCTC ACCACGGCGC CGCCGAAACT CCGCGAGGAC TCCTTCGGCA TCACGATTAC TGCTCGAGTG ATCAACCTGC TGCAAATCCT GTTTGGCTGT CTTGTGTTTT ATCGCTTTGA TGACGGAATC TCTACCACAC TCTGGGCGAA CATTGGTATC TGGGCGGTCG CTCTCATTCC CATTCGAAGA ATCAGGCGAT TGTGTATTGC GCAGACTCAG CTCGCGAATT CCACAGATGA TGTTTCTTTC GTTAAAAACG CGGGTCTACA CGGCGGGTCG ACGGACGATA AAACAGACCA CGAGATGCCC GAAATGCGAG TCACACAATC GCCAGAAGTA ACAGAGACTC GATCACGACG AGTTCGCACA GCATTCCTGT GTCGCATGTA CCAGTGCGAG AAGAGCGAGT TGATGTCGAA AGGGCGTTTG GGTTTGTATC ATCCCCCGAT TCCAACACAC GCAAGTCCAG AACAGTTAGA GAGGCTTCTG AAGAACTACG AGCCGTTCGG CGCGCTCGTA CCTGCTAACG CAAACTACTT GCCTGGACAG ACACAATCCA CTGGTGGCGA CAACACGGCG CCACCATTCT CTGAGAAAAC ACGGGCAAAA CTGGATATTC TCGCGTCGTT TCAGAGGCGA AAGAGCCAGC AGGAGCACCA TCTCTGA
|
Protein sequence | MATAAATDDV ADVVVVDGAD ADVVELRAVE RRATRATSRA ATTFGALEDG PVARADAAMT AARRAHARAG VLEYDVDHRG VQVPSSARRF SMTASHRRVG ASSVSLALYL DTVLQFGLLA VVLAGINAYS LTTNVTDRAF TASYAVTGWD ATAGADATTT CARTYETGGF VLSTSQGSRC GVPTLASFYN CPMRCDYEGA IGTFDATNAC QTHYPCTLEK VLTTTQTAVC CVPRVAATVN KTPKEFQAVS IVVTVVFLLF DMFYTRNRAT AAAIIKSSVV TASDYSVLVT GLGKGNEWTR QQIADFFSHY GEVVSVCHLT NTRRLVTMER EIKNAVQRRN EIQAILDDNE GANMDEKINK SILLRRQLYR MIALKGTKPT REALVKLDAK IAELKAKVVA LGENGNSHLG SAVVTFNYEQ HAINCCEDHC GDYGESIIDR CTGKHSPDFN GRRLIVTRAP EPSDINWQHL RRRDSNWEVV ATAIGAKFVL AGALCVGGVI QYVFEVLRSN QLETITNEVA FQSTASTTSS IRLQLVASST SLVVVIINGL LDRLTVYLAQ LEVYKTKTIE TNMLIATLTF VNLLNYVVVP IITNRCSSKA DGVCNWYVPG GFVEYAFYLQ VFNVLLLPLR NMNIKHIIQV KILAPMSKTL SMQESLVQPP AFQLARSYAE LLKILGLSAI YAPALPVSYA VGVFGIFVLY WSKKYQGLYL TTAPPKLRED SFGITITARV INLLQILFGC LVFYRFDDGI STTLWANIGI WAVALIPIRR IRRLCIAQTQ LANSTDDVSF VKNAGLHGGS TDDKTDHEMP EMRVTQSPEV TETRSRRVRT AFLCRMYQCE KSELMSKGRL GLYHPPIPTH ASPEQLERLL KNYEPFGALV PANANYLPGQ TQSTGGDNTA PPFSEKTRAK LDILASFQRR KSQQEHHL
|
| |