Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_23932 |
Symbol | |
ID | 4999774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 770595 |
End bp | 773650 |
Gene Length | 3056 bp |
Protein Length | 1009 aa |
Translation table | |
GC content | 60% |
IMG OID | 640415195 |
Product | predicted protein |
Protein accession | XP_001415928 |
Protein GI | 145341670 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACGGC GACGACCGGC GACGGCGGCG CGGGCGATCG TGGGGGGGGT GCTCGCGCTC GCGAGCGTCG CGGTGCCGGC GCGAGGGGCG TTCAGCGTGA TGCCGTCGAC GGGGACGGGG ACGACGGTGA CGTACGCGGG GACGGCGGCG TTTCGCGAAC GCGTCGTGGA CGCGTGCGGG ACGTGCGGCG GGGACGGCGC GGCGTGTCAG GGATGCGATG GCGTGACGAA CAGTGGGAAG GTGTTCGATG CGTGCGGGGC GTGCGGGACG GCGTGCGATA AGAGCGATAC GTCCATAACG TGTACGTTTA ACGCCACGTG CGAGGACTGC GCGAGCGTCA CGGGCGGCGC CGCGACGACG GACGCGTGCG GGAATTGTAA AGCGCCGACG GACGCGACGT TTTCGCGAGA GGGGGTCCCG GATCATCATC TCGGTGGCTG CGTCGGGTGC GATGGAGTCG CGAATAGCGG AGCGGTGATG GACGTGTGCG GGGTGTGCGG CGGGAACGGG TGCTCGGGAA CGGATCCGTC GCTGTGGAGC TGGTGCTGCG ATTGCGCCGG CGTGGCTTTC GGGACGAGCG CGCAAGATTT GTGTTGTGAA TGCATCGACG GCGCATCGTA CTACCCCAAC GGTGTGAAAC CAGCCGAGCA CACGCAGATA GAAAATTTAT GGACGCAAGG GCAAGAGGCG TTCGCGGCGG CGGCGGCGAC GATGACGACG TTTCGCGCGC TGGTAGAACC AAGACACAAG GCTGCGGCGC AGACGCTCTA TGAACAGGCA GAACAGGCGT TCAAAGATGC ATGGGCGCTC GTCGCCACGC AGGCCGCTGT TCCAGGTACG ACGATGTGCT ATCCAGAGCT CACGGCTTTA CCGCAAGTGA CAAACAATCG CGACGCGTGC GGCGTTTGCA AGGGGCAGCT TTCGTCTTGT TTAGGTTGTG CGACGAGCGC GACGCCGCTT CCGGTAGGGC CGTTAGAGGG CTTGTATCCA GATGATTGCG ATGTGTGTGG TGGATCGACT GAGGTGGATG TGTGTGGGAT TTGTGGAGGG AGTAGCAACG GCTTGGATTG CGTCGGTTGC GACGGCATCG TCGCGAGCGG TAAAGTGCAA GACGCGTGCT ACGACTCTGA CGACACAAGA ATGTATAGTA TCGACGCCGT TTCGGGAGCG AAGACGCTGC TCGTACAAGT TGGCGAGGTT GGTAGCGGGT GTAGCGACCC CGATGCTTTC ATCACCGCGT GTGCAGCGGG GGGCGGCGGA TGTTGCGGTT GTGACGGCGT GCCGAATAGC GGTAAAACGC TCGACGCATG TGGCTCGTGT TTAGCGGCGG ATGCCACGGA TAGGAAGACC AACGCGAATG CGTGCGAAGA AATCTTCCTC GTTAAACTTC CAAGCGGTGT AGTGATCGGT CCATTCACGA AAGAACAGAT CCGAGGAGGA TCTTTGACGT ACACCGAATC GTCGACGAAC ACGGAGACGA TCTACACCAT CGAGCCGGAT TCGCAGATTG CTAACGCTGC AGTTATTGAT GTCACTGAAA GCGTCGAATA CGGCTACACT TCCGTCGAGC AAGAATATTT AGATTCTTGG CAGTCGATTT TTGTGAGCGG ACGAGCGGAT GTGATCAGCT CCACGAGCGT GCCGAACACG GTGAACACAG TCGTCGTCAC TGGTAGTCGA GTGACGAATA CGAGCGAATA TCAAGCGTTG CAAGTGAAAC CGGAATTCGT CGGTGTTATT TATCCGTTTT GCACGGGGGC GACGTATCGG GCTGGTCATA GACTGCACGG CAAGAAAACG GGGTCGAACT ATTTAGGAAT CATCACCACG AGCGCGAGTA TGCCGGAAGA CTGGACCGAG GCACAGTCAC GTCTCGATCG TGACTGGAAC CCGTATTCGG AGAGAGTGTT GGACTACGTC GCAGAATGGG CCGATCAGCG GTGTACGTGC GTCGCCGACT GGAGAACTGA ACCCGAGCCC GTACCTCCAA CATGTGAGCG GGTTTGGCTG CAAGCGAGCG AGGAAAAGAA AAGTTCAAAG ATGGAGCGGA GCTGGGCGAG CGGCTCGTGG CGCGTTACGG GTGGTTGGTG GACGCAAGAC TTTGGGCAGA AAACGACGAC CACGCAAAAT GCTGCTCGCG GACCAAGACG TATAGACTCA TTTGGGACGC TGCGCACACG ACTCTCGCGT ACGGCGAGCC AAGAAACCGA CGAGCTCGGG GCGTGCGACT ACAAAGCACT ACGGGTCATC GATCCCGTGC GAAACGCGAG CGGCGCGGTG TGGTATCCAC AGCAACAGCA AGTCGCGCAA GGATTCAACG TGACCTTCAA GTTCATGATC ACGCAACCGA CAGTCACATG TGACTACGCC GAGAGCGTCA GCGGTGCGTT CGTGCAGTCT TTACACACTA AGCTCTATGA AAAGTGTACG ACTTCGGGCG GCGACGGCTT TGCGTTTGTC ATTCGCGACG ATAGCGCTTC GGCGCCGGGA GCGACAGACA TCGGGTTCGA TGGTCCCGGA TTGGGATACG GTGGCATCAC CAATTCGATT GCGTTCGAGT TCGACACCGT GTTCACCGCA GCATATAACG AACCACGTGA GAGTCACGTC GCGATTCACA CTCGCGGTAA ATCCCCAAAC ACGGCGCATT CGGCGGCGAG CCTCGCGACC GTCGCTCTTG ACGGTTCCGT GCCCACGACG AGCATCACCG ACGGTCAGAT TCACGAAGTC TTCATCACGT ACAAGCCCAA CATCACCTCT GAGGAGATGT TTTTCGCCAT CGAGTCCGGT GAAATCACCG GTTTGTCCAC CGCCCTCAGC GCGCACACCG CCGACTCCCT CGGAGTCGTC TCCGTCTACC TCGACGACAT GTCCTCACCG CTGATGAGCG TCCCTTTCAA CATCGAGAGC ATCTTGCGCG ACTCCGCCAC GAGCGGCAGC GCGTGGGTCG GCTTCACCGC CGCCACGGGA GATCTCTGGC AAGCCGTCGA CATCTTAGAG TGGAACATGA CCTCGGTCTC CGTCTCGTGA CGGCGGCACG GTCAGACGGT TGTAAC
|
Protein sequence | MGRRRPATAA RAIVGGVLAL ASVAVPARGA FSVMPSTGTG TTVTYAGTAA FRERVVDACG TCGGDGAACQ GCDGVTNSGK VFDACGACGT ACDKSDTSIT CTFNATCEDC ASVTGGAATT DACGNCKAPT DATFSREGVP DHHLGGCVGC DGVANSGAVM DVCGVCGGNG CSGTDPSLWS WCCDCAGVAF GTSAQDLCCE CIDGASYYPN GVKPAEHTQI ENLWTQGQEA FAAAAATMTT FRALVEPRHK AAAQTLYEQA EQAFKDAWAL VATQAAVPGT TMCYPELTAL PQVTNNRDAC GVCKGQLSSC LGCATSATPL PVGPLEGLYP DDCDVCGGST EVDVCGICGG SSNGLDCVGC DGIVASGKVQ DACYDSDDTR MYSIDAVSGA KTLLVQVGEV GSGCSDPDAF ITACAAGGGG CCGCDGVPNS GKTLDACGSC LAADATDRKT NANACEEIFL VKLPSGVVIG PFTKEQIRGG SLTYTESSTN TETIYTIEPD SQIANAAVID VTESVEYGYT SVEQEYLDSW QSIFVSGRAD VISSTSVPNT VNTVVVTGSR VTNTSEYQAL QVKPEFVGVI YPFCTGATYR AGHRLHGKKT GSNYLGIITT SASMPEDWTE AQSRLDRDWN PYSERVLDYV AEWADQRCTC VADWRTEPEP VPPTCERVWL QASEEKKSSK MERSWASGSW RVTGGWWTQD FGQKTTTTQN AARGPRRIDS FGTLRTRLSR TASQETDELG ACDYKALRVI DPVRNASGAV WYPQQQQVAQ GFNVTFKFMI TQPTVTCDYA ESVSGAFVQS LHTKLYEKCT TSGGDGFAFV IRDDSASAPG ATDIGFDGPG LGYGGITNSI AFEFDTVFTA AYNEPRESHV AIHTRGKSPN TAHSAASLAT VALDGSVPTT SITDGQIHEV FITYKPNITS EEMFFAIESG EITGLSTALS AHTADSLGVV SVYLDDMSSP LMSVPFNIES ILRDSATSGS AWVGFTAATG DLWQAVDILE WNMTSVSVS
|
| |