Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_48152 |
Symbol | |
ID | 5006953 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | + |
Start bp | 98121 |
End bp | 101332 |
Gene Length | 3212 bp |
Protein Length | 979 aa |
Translation table | |
GC content | 58% |
IMG OID | 640422374 |
Product | predicted protein |
Protein accession | XP_001422806 |
Protein GI | 145357194 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5096] Vesicle coat complex, various subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0169632 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGACGCGA CGCGCGCGCG CGGCGACATG GCGCCGTTTC TCGGTGGTAT GCGCGGCCTG ACGGTGTTCG TGCAAGACGT GCGGAACTGC AGTAATAAGG TGCGGCGAGC GCGCGCGCGC GAGGGATGAC GACGACGGGC GAGGGCATTC GATCGAACGA AACGACGCGC GGTGACGCGT TGAGCGGCGA GAGCGCGAGA GAGAGGCGCG CGGACTGACG ACGAGGGTCG CGCGCGTCGC GAACGCGAAC GCAGGAGCAA GAGCGCGCGC GCGTGGAGAA GGAACTGGCG AATATTCGAC GGAAATTCAA TAAGACCCAC CGCGCGCTCA CGGCGTACGA ACGGAAAAAG TACGTGCTGA AGCTGCTGTA CATATACATG CTCGGGTATA ACGTGGACTT TGGACACACC GAGGCGCTGA AGTTGATATC GGCGTCGTCG TACGCGGAGA AACAGGTTGG GTACATGACG ACGTCGGTGA TATTGAACGA GAGAAACGAG TTTTTGAGAA TGGCCATCAA TAGCATACGC ACGGATGTGA TCTCGAGCAA TGAGACGAAT CAGTGCTTGG GGTTGTCGTG CATCGCGAAC GTCGGGGGGC GGGAGTTCGC GGATTCGTTA GCTGGGGACG TGGAGACGAT TGTGATGACG CCGACGATTC GGCCGGTGGT TCGGAAAAAG GCGGCGCTGT GTCTGTTGAG GTTGTTTCGT AAGAATCCTG AAATTTTACT CGCGGAAACG TTCGCGTCAA AAATGACCGA CTTACTCGAC GCCGAGCGCG ATTTGGGGGT GCTCATGGGC GTCTTGGGTT TGTTGCTGGG TCTCGTGCAG CACGATTACC GAGGGTACGA GGCGTGCGTG CCCAAGGTCA TCGCGTTGTT GGAACGATTG ACGAGGAATA AGGACATTCC GCCCGAGTAT TTGTACTACG GTATTCCCTC TCCATGGTTA CAGGTGAAGT GCATGAAGAT TTTGCAGTAC TTTCCCACAC CAGACGATCA GGCGCTGCTC GATTCGCAGC TCATCGCCAT GCGAAACATC CTCACCAAGA CGGACACGGT GAAAAACTTC AATAAGAACA ATGCGCTGCA CGCCATCTTG TTCGAGGCGA TCAATTTAGT TACTAGCATG GACTACGCGC ACGAACTGTT GGACCCGTGC GTGGAGATTC TCGGGAATTT TCTCGACATG AAGGAACCGA ATATTCGCTA CTTGGCTCTC AACACGCTCA ACGCCCTCGC GGCGATGGCG GATTTGCGAG AAGCCATAAA GGTGTACCAA GAGCAAGTCG TGGCTGCGTT GCACGACGCG GACATTTCCA TTCGTCGCCG CGCGTTGACT TTATTGTTTT CTATGTGCGA TGCTTCCAAC GTGCACTCTG TCATCGAGGA GCTCATCAAG TACTTCGTCA CCGCTGATTT TGACATTCGC GAGGAACTGG CGCTCAAAAC GGCCATCTTG GCCGAGCGCT ACAGCGTGAA CGATCGCATG TGGTTCATTG AGATCGCGAT GCAAATGATA GACAAGGCGG GCGATTTCAT CAACGACGAC TTGTGGCATC GCATGGTGCA AATCGCAACC AACGACGCGT CGCTTCACGG TCGCACGGCG CAATTGATGT TCGTCAAGTT GCGCGACGAG GGCGCGTCGA ACGAACTCAT GCTTCGCGCG ATGTCGTACT GCATCGGAGA GTTTGGGTAT TTGCTTCCCA TTCCCGCGTC GCAGTACGTC GATCTCTTAG TGCCACTGTT CCAGGATACG GATGAGGTCA CGCAGGGCAT CATGCTCACA GCCTTCGTCA AGGTTGCGAT GCACAAGAAT TGCGATCAGG CGTCGATGGG TAAGATCGTG AAGGTGTTCA CCGACATGAG CTCATCGTTT GACGTCGAGT TGCAGCAACG TGCAAACGAA TATCTGAAGC TCTTGCGTCT CGGACCGAAC ATGCGACCGA TTCTCGAGCC CATGCCCGAG TACCCTGAAC GTTCGAGCGT GTTGGAGAAG CACATACAAG TAGAAAACGT CGCCTCGGAC GTCGCCGCGG GAGTTCGTAA ACTTGCCATG AGTGGTGTCG TGACGGCGAG AGAGCAACCC CGCGCTCAGG CGCGGTCGGC GCCGGCGCTT CCTGCAGCCG CAGCACCGCC GGTCGATGCC GTCACAGATT TGCTCGGCAA CTTGATGGGC GACGGATCTT CGGCGCCGGC GGCGCTTCCG CCGTCGTCGA CTGGGATGAA TCTCGACGAG CTTCTCGGAA ACGCCCCTCC CGCACTTCCA GCGGTAGAAG AGCGTCTCGC ACTTCCGAGT TCCACGTCAC CTCCCGCGGG GCCGGTGACG ACCACATCGT CCGCAGACGC TTTAGACGAT TTACTAGGCT TAGGCGCGCT CGCGGCGACG CAACCGCCGC CGGCGACGCA CGGCGACGCC TTAGACGCCT TTGGAGCTCT GGGTGCGCCG GCGCCCGCGG CGCCGGCACC GACGCAACCC GTGGCACCGG TACAATCATT GACGTCGAGC GACGGTATTC AACCCACGGT GAACGTTCAA GACTGCGCGA AACGGTTCCT CATCGCCGAC AACGGCTTGC TGTATGAAGA CGCGAACGTA CAGATTGGCG TGAAATCGCA GTGGCAAGGG TCTCAAGGTC GCGTGATGTT CTACGTTGGG AACAAGTCCG CGAGCGCGGA TCTGCAAAAC TTCAGAATGG TCATACCGTC GATCGAAGGC TTGCGTCACA GTCTTCAACC CTTCCCCGCG TCCATCGGAC CGAAGCGCCA GGTGCAGTTG ATGTTGCAAG TAGCGATTAC GTCGGCGTTT GCCTCGGCGC CAAAACTCGA GTTTTCGTAC ACGTCCACCG CCGTCGCGGC GGCGTGTGCC AGGTCTCTGG AGTTGCCCGT ACGTGTGACC AAGTTTTTGA GCCCGATGAC CATCGCTTCG CCGCAAGAGT TCATCGCCAA GTGGCACCAG ATGGCGTCCG CCGGGCAGCA ACAGAAAATT ATGGACGTGT CGCAGCAGTA CGCGACGAGC ATCGAAAGCG TGTCAAACGC CTTCTCGGGC ATGCGGCTCG TCGTACATAA AGGCTTAGAT CCAAACCCCG CAAACTTAAT CGCGGGAAGC CGGTTCGTCG GCGAACGATG CGGTGAAGTC TTTGTGGGCG TTCGCGTGGA GAGCGACGCG AACGTGCGCG GACGATATAG ATTCACCGTC GCTTCGATGG AC
|
Protein sequence | MAPFLGGMRG LTVFVQDVRN CSNKEQERAR VEKELANIRR KFNKTHRALT AYERKKYVLK LLYIYMLGYN VDFGHTEALK LISASSYAEK QVGYMTTSVI LNERNEFLRM AINSIRTDVI SSNETNQCLG LSCIANVGGR EFADSLAGDV ETIVMTPTIR PVVRKKAALC LLRLFRKNPE ILLAETFASK MTDLLDAERD LGVLMGVLGL LLGLVQHDYR GYEACVPKVI ALLERLTRNK DIPPEYLYYG IPSPWLQVKC MKILQYFPTP DDQALLDSQL IAMRNILTKT DTVKNFNKNN ALHAILFEAI NLVTSMDYAH ELLDPCVEIL GNFLDMKEPN IRYLALNTLN ALAAMADLRE AIKVYQEQVV AALHDADISI RRRALTLLFS MCDASNVHSV IEELIKYFVT ADFDIREELA LKTAILAERY SVNDRMWFIE IAMQMIDKAG DFINDDLWHR MVQIATNDAS LHGRTAQLMF VKLRDEGASN ELMLRAMSYC IGEFGYLLPI PASQYVDLLV PLFQDTDEVT QGIMLTAFVK VAMHKNCDQA SMGKIVKVFT DMSSSFDVEL QQRANEYLKL LRLGPNMRPI LEPMPEYPER SSVLEKHIQV ENVASDVAAG VRKLAMSDLL GNLMGDGSSA PAALPPSSTG MNLDELLGNA PPALPAVEER LALPSSTSPP AGPVTTTSSA DALDDLLGLG ALAATQPPPA THGDALDAFG ALGAPAPAAP APTQPVAPVQ SLTSSDGIQP TVNVQDCAKR FLIADNGLLY EDANVQIGVK SQWQGSQGRV MFYVGNKSAS ADLQNFRMVI PSIEGLRHSL QPFPASIGPK RQVQLMLQVA ITSAFASAPK LEFSYTSTAV AAACARSLEL PVRVTKFLSP MTIASPQEFI AKWHQMASAG QQQKIMDVSQ QYATSIESVS NAFSGMRLVV HKGLDPNPAN LIAGSRFVGE RCGEVFVGVR VESDANVRGR YRFTVASMD
|
| |