Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29977 |
Symbol | |
ID | 5000081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 310489 |
End bp | 313607 |
Gene Length | 3119 bp |
Protein Length | 1011 aa |
Translation table | |
GC content | 56% |
IMG OID | 640415502 |
Product | predicted protein |
Protein accession | XP_001416127 |
Protein GI | 145342088 |
COG category | [R] General function prediction only |
COG ID | [COG1033] Predicted exporters of the RND superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0323795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGACGATGC CCGCCGCCGA CGCGGACGCG CCCGCGCGCG ACGACGACGC GGGATTCGTC GTGCGCACCA TCTCGAGGCG TCCGTGCGTG ACCTGCGTCG CGACGCTCTT GTTCGCCGCG ATAATTTCCG CCGTCGGTTT GGTGACGGGG ACGCTCGAGG TGCAGACCGA AGGCTGGGAA ACGCGCGGCA CGCCGATCGC GAATCGCCAG GTGCCGTACA GCCTGTACAG CGAGAGCACG TTCGACGACG GCACGACGAA CAACGCGTAC CTGAGCGAAC CGAACGCGCG ACGAAAGTTG TTGTCTGGGA CGCCCGACTG CGGAACGGGG GGGCAGTACG ACTCGGAGAG CAACAGTAAC ATGGAGGACG ACAACAATTT TCACTTGATA TTCGAATCGA CGGGGGGGGA TTTGTTCACG CCGGAAGCGT TCGCGGATAT GTGCAAGGCG ACGGAGTCTT TCATGGATCA CGCCAACGGG GGCGGCGACG GCGACGACTT TGGAAGCCTG TGCGAGCGAC AGAGCGTATG CTCGAGCGCC AGCGATCCGC CGGGGACGAC GTCGTCGGAT TGGTCGTCGG GTGGGACGTA TAAAGGAACG TGGCAGAACT GGCAGATCGA AACTTCGGGC TCGCCTTCGA CGTACAGGCG ATGCTCGCGG GCCTTTTCGT ACCCCACGTA TCTGTACATG AACACCGCCG GGGCCTCGAG CTGCGACGAC TACTTAACAA ACTCTGCGTT GCAAACGGCT CTGACGAATT TGAAGACATA CTTACTCGCC TGCGCCCCCG TGAAGGCTGC TGATTCTGAG GCGGCTTGTT CGGGAACTTC GCCTGTGCCT TGGAAGTCGG CGGCGCTCGG TGAAGCTTTC GGCTTAAATG GCGTCACCGA TTTATCGCTC ACCCGCTCCA TCATCCCAAC GACTGGCAAC AGCGATATGA TCGCAACATG GGGTTTGAAC ACGTGGGAAG GCGGTTGGAC GTACTCGAGT TCGTATTTCA ACGTGTACTA TGACACCGCT AGTTCGGACG TAAAAGACGC GTATGTCGAC GAACAGCTGT TCAAAGACTT GGCGCTCGCC GTCAGCGCCA TCTCCATCAT TATCATCTTG ATGTGGGTGC ACATGGGTTC GGTGCTATTG ACCGTGGGTG GAATCGTACA AGTGCTACTG GCGTTCCCGT CGGCCATTTT CTTCACCAAG ACGCTCTGTC AACTCGACTT CTTTCCGTTT CTCAACTTCA TCGGTCTTTT TGTGATCGCC GGCATCGGCG CCGACGATTG CTTCGTGATG TACGACAAGT TCCAACAGTC CAAGTGTCGG TGCGTCCCCG GGGCGAACGC TACGGAAGTC ATGAAACGCG CGTATTGGGA TTCCGCGTGG GCCATGTTTT TGACGTCGAT GACTACGGCT GCAGCGTTCT ACTCGAACGC AATCATACCC ATCGCTCCCA TTCGCGTCTT CGCAATTTTC ATGGGCACGA TGGTCATATT TGACTATCTG TATGATATCA CGATATTTGC CGCTTTGCTG GCATGGCAGC ACGACGTCAT CGTTGGGTAC GAAAACACTG GCAAGAACAC GTTTGGGTCG TGGTTTCTGG ATTTTTATGG CTCAATCGCA CGCTTTCGAG CCAACTGCAA GAAAGGACCA CCAAAAACCC GCGACGCTGA CGATGTCATC GATTCCAACC GCGAACGAAG ATCGGTGGCT GAAGCTATGC TCGCAGACAA AGTGTTCCCC ATCATTCACA CACTCCGGTG GTTCCTCGTG GTGGCTCTCA TCGGTGCTTT TGCGGGCGGC ATGGTCGGCA CTTTGAAGCT GTCGACTCCG CGTGACTCGG AAGTCCAATT GCTCCCTGAC GACCACATGT TCACGAAATT TTCGTTCCTC AGGCGTTCAG GCTTCAAGAG CTCAACGGAG TCTCAGGTGT GGACGAAAGT TCTGTGGGGG GTTACTCCGT CGGACAATGG CGATCACTTC AACCCCGCGA GCCGAGCTTC GATTGAATGG GACACGTCGT TCGATCTTTC ACCCACTGCG AATCAAAACT GGTTGAAGAC TTTTTGCAGT GACACAAAGT CGAACATGGC GAACGATAAC GCAGCGTATT GTTGGTTCGA ATACTTTGAA ACTTGGCTCG GCTCGGCGGG TGGCCCGACC GAGTGTGGCG GCTACAAATT CATTCCAGTC CCTCAGGCCA ACTTTTACAG TTGCGTCAAG TATTACGCCG ATCAGAATCC AAGTGTACAG ATGATGTCTC GCCCACTCTC GAACGCGTTC TACACTAGTA ATGGCCAAAC AAGGATGAAA ATCATGTCGG TGCAGTTCGC CACAAACGTA TTGTGGACCG CGCCTACGGA AGACCTTGAA AAAGTCTGGC AAACTTGGGA GAACTATTTC GCGGCCAAGA TGGCGACAGC ACCAGCTGGA CTTAAGAACG GGTTTCAAAC GTCTGACGCA TGGGCTTGGA TGGATACCGT CGCTCAGATG CGAGACGGTG CGTACATCGC GGCCGGGACG ACGCTCGCAA TCGCTGCGGT GACGACGATG ATCTCCACGC AAAACGTCAT CATCACCTTG TACTCACTGT TGTGCATCTT GACGATTCTC GTCGTCACGG TCGCCGGGGT CGTTTCCATG GGCTGGAATC TCGGGTTCCT CGAAGGGATC TGTCTCGTCA TCTTAATCGG GCTTTCGGTT GATTACGTCG TTCACATCGG GCACGCGTAC GCGCACGCTG CTCGCCACGA AGGCGTCAGT CGTCGGGAAT GCGCTCGCTC GGCGCTCAGT GTCATGGGTT TCCCCGTCCT CGGTGCGGCG TTCACAACTC TGTGCGCTGC GCTTGCGTTA CTTCAAGCCG TGATCGTCTT TTTCACAAAG TTCGGCACCA TCGTCGTGCT CTCCGCCGTC TTCTCGTCCA TCGTCTCCGT CGTGTTGTTC ATCGGCCTGC TCGCGGCGGC GGGTCCGGTC AACGGTATGG GCGACGTCAC GCGCCTCTGT GGACGAAAAG CGTTGTCCAA GAGCAAGAGC ATTGCGTACT AATCGGTGCA ACTTGTACGG CTTTTTACAA ATTTTTCACT TTACTGTAAT TTATTACAAC ACTCTTCACA CGCCGGCCT
|
Protein sequence | MPAADADAPA RDDDAGFVVR TISRRPCVTC VATLLFAAII SAVGLVTGTL EVQTEGWETR GTPIANRQVP YSLYSESTFD DGTTNNAYLS EPNARRKLLS GTPDCGTGGQ YDSESNSNME DDNNFHLIFE STGGDLFTPE AFADMCKATE SFMDHANGGG DGDDFGSLCE RQSVCSSASD PPGTTSSDWS SGGTYKGTWQ NWQIETSGSP STYRRCSRAF SYPTYLYMNT AGASSCDDYL TNSALQTALT NLKTYLLACA PVKAADSEAA CSGTSPVPWK SAALGEAFGL NGVTDLSLTR SIIPTTGNSD MIATWGLNTW EGGWTYSSSY FNVYYDTASS DVKDAYVDEQ LFKDLALAVS AISIIIILMW VHMGSVLLTV GGIVQVLLAF PSAIFFTKTL CQLDFFPFLN FIGLFVIAGI GADDCFVMYD KFQQSKCRCV PGANATEVMK RAYWDSAWAM FLTSMTTAAA FYSNAIIPIA PIRVFAIFMG TMVIFDYLYD ITIFAALLAW QHDVIVGYEN TGKNTFGSWF LDFYGSIARF RANCKKGPPK TRDADDVIDS NRERRSVAEA MLADKVFPII HTLRWFLVVA LIGAFAGGMV GTLKLSTPRD SEVQLLPDDH MFTKFSFLRR SGFKSSTESQ VWTKVLWGVT PSDNGDHFNP ASRASIEWDT SFDLSPTANQ NWLKTFCSDT KSNMANDNAA YCWFEYFETW LGSAGGPTEC GGYKFIPVPQ ANFYSCVKYY ADQNPSVQMM SRPLSNAFYT SNGQTRMKIM SVQFATNVLW TAPTEDLEKV WQTWENYFAA KMATAPAGLK NGFQTSDAWA WMDTVAQMRD GAYIAAGTTL AIAAVTTMIS TQNVIITLYS LLCILTILVV TVAGVVSMGW NLGFLEGICL VILIGLSVDY VVHIGHAYAH AARHEGVSRR ECARSALSVM GFPVLGAAFT TLCAALALLQ AVIVFFTKFG TIVVLSAVFS SIVSVVLFIG LLAAAGPVNG MGDVTRLCGR KALSKSKSIA Y
|
| |