Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37402 |
Symbol | |
ID | 5001610 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 205487 |
End bp | 208276 |
Gene Length | 2790 bp |
Protein Length | 890 aa |
Translation table | |
GC content | 58% |
IMG OID | 640417031 |
Product | predicted protein |
Protein accession | XP_001417176 |
Protein GI | 145345348 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.174724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGGCG TCACCGCGTC TCCCCGCCTC GTCGTGCTCT ACGCCTTTGT CGTGCACTTG ATTTTCGTCT ACGCCACGTT TGACGTGCAC TTCCAGTCCC CGCTCGTCGC CGGCGTCGAG CGCGCCGACG CGCGACTGCG CGCGCCCGCG AGACGATTGG TGATATTCGT CGCCGACGGC GCGCGCGCGG ACGCGGTGTT CGACGAAGCG CGAGGCGCGG CGCACGTGCG AAGTCGCGCG CGCGGCGGCG CGTGGGGCGT CTCGCACGCG CGAGCGCCGA CGGAATCGAG ACCGGGACAC GTGGCGCTCT TGGGGGGGTT CTACGAGGAC CCGAGCGCGA TCACGAAAGG ATGGAGCGCG AACCCGGTGG AGTTCGATCA CCTGGTGAAT CAGAGTAACA ACGCGTGGGC GTGGGGCGCG CCGAGCGTGG TGCCGCTGTT CGCCGACGGC GTCGACGGAG CGAGACGGTT TTGTTACGAC GAGACGCTGG AAGATTTTGC GAGCGCGAAC GATCACGGGG CGTTAGATGA GTGGGTGTTC GACCGCGTGG TGCGGTTTTT AGAGAGTAAT GGGGTGGAAG GCTCGAGCGA GAGCGACGCG CTGGACGGCG ACGGGAACGT GTTTCTGTTG CACTTGCTGG GCTTGGATTC GAGTGGACAC GCGCACAAAC CGCATTCGAG CGAGTATTTC GAAAATATTA GGATAGTAGA TGAAGGTGTT CGCAGAGTGG AGGCGGCGTT CGTGGAGAGG TTTGGGGATG ATGGAAAGAC TGCTTTTGTT TTCACCGCCG ATCACGGAAT GTCGAATAAA GGCGCGCACG GCGACGGCGA TCCTGGGTGC ACGGAAACAC CGTTGGTGGT TTGGGGCGCG GGTGTCGCCA GTGGGTCACA AAAAGTCGCG GGAGCGTGTC GTGGGACGCC CGAAACGCCG AAGGATTGGG GCATGGATCC CGAGACGAGG TGCGACGTTG ATCAAGCCGA CGTCGCGCCT TTGGGTGCGA CGCTCATTGG ATTTCCACCG CCTCGTCACA ACTCGGGCTT GCTCCCCAGC GCGTACTTGA GCGATAAACC CGAAGATCTT AAGTCGTCCG CGATGATTGC GAACGCCAAG CAATTGCTGG CGCTTCACGA TCTCAAGGCG TCGCGCACTG CAAAGCGCGC GCTCAGCGCT TTGTTTTCTT TCAAGCTCCA CCCAGATATG ATTAGCGTCT CATCGCAAGT CGCGGAGTAC GAGCGACTCG ACCGGCTTGG TCGACACGAC GACGCGACAC GCGGTGCAAA TCAAGTCGCG CGCGCGTGTT TGAGAGGGTT GGAATATTTA CACACGTACG ACCGGGCGCT TTTGCAAGCC GTCGTCGTGT CGTGTTTTGC GACGTGGATG ATTTTACTTG CAGTCAATTT GCTTCCGCGA AGAAAAGGGG AATGCGAAAC GCCGACTGGT ATACCGTTGA CGCTGGCGCT GATTTATGCC GGTGTCGTGA GCGTGATTCT TCTGGCCCGC CGCGCGCCGC CCACGTACTT TTTGTACTTC GGTCTCCCGG CGTATTTTTT GTTGGGTATC GTGAACACGC TTTCCGCTGT GAAAATAGAA AGTATCGCCG ACATTAGCTT ACTGAACGTC GGTATTTGTC TTGTCGGCGC GTTTGCGGTG GCTGAAACGA TATGTAACGG TTTTCACGAT CGCTTCGTGT TTTCATACGC GTTTGCATTC GGGTCGGCGC TTTTAGTCGC ACTCGCGATA CGCTTGCTGT TTCGTCGAGA TTTTGACAAC GCTACACGTG CCGCTGTGAT GGCTATCAGC ACAGCTATGT TGGCGCCGTT CACCATGCTT TCAATTGAGC TGGAAGCGAA CACGGAGATG ATCGTCTCAG GATTGCTTGC GAGTGCGCTC CTCGGCACCA CGACGCACGT TTTGATTCGG CCGCTTGATA TTTTTCAGGA CGACGCCGTC GACAGGGAAA CTAAGCGCCG ACCGGGTCAG ACTGTTTTCG CGTTGCAAAT ATTAATGGTT CTTACTACTG GCGTATTGGT GACCGCCGTC GACGGTTTAC AGCGCGACAA GTTGCCCGTT CCAACGTCTA TGCACGCCGC ATCGTGGATC GTCGCGTTGA TGGCACCCAT CTTGCCCATG TTTTCACCAC CGCGAACGCT GCCGCGAATG ATTAGTGTGT TCCTTGGGTT TGCATCGACG TATGGACTGT TCTCCGTGTC GTACGAGTCC CTGTTCTACG CTTGTCTCGG GTTTTGTCTT CTTTCGTGGA TGATTCTGGA GCGCGGCTTA CAAGCGCCCT CACCGGCAAA GTCCAAAATG TTCACCCGTA CCATCATCCC GAGCGATCTC CGCCACGCCG CTATGTTCTT ATTTCTCATC GATGCAGCCT TCTTCGGCAC GGGCAATATC GCATCCATCG CATCCTTCGA CCTCAGCAGC GTCTACAGAT TCACCACCCG CTTCAACCCG TTCCTCATGG GCGCGCTCCT CGTCCTCAAG GTATTACTCC CCATGATCAC CGTCGCCGCC GCCTTTCTCG TCGTCTTGAA ATCCTCCCGC GTTCCTGCGT TCGAGTCGTA CTTCATGTTT CTCATCTTGA GCGACATCAT GGCTGTCAGA TTCTTCTTTC AAATCACCAC CGTCGGCAGT TGGCTCGACA TCGGCTCGAG TGTGTCGCGT TACGCGTTGA TGGGAACGCA AGTCGTCACC ATTTTACCAT TCTTAGCCTT GGCCCAGATC TTCACGCGAG CTTTGCCCGT GAACGGGCGT ACCGCAATCA TTCGCACCAA GCGCGATTAG
|
Protein sequence | MHGVTASPRL VVLYAFVVHL IFVYATFDVH FQSPLVAGVE RADARLRAPA RRLVIFVADG ARADAVFDEA RGAAHVRSRA RGGAWGVSHA RAPTESRPGH VALLGGFYED PSAITKGWSA NPVEFDHLVN QSNNAWAWGA PSVVPLFADG VDGARRFCYD ETLEDFASAN DHGALDEWVF DRVVRFLESN GVEGSSESDA LDGDGNVFLL HLLGLDSSGH AHKPHSSEYF ENIRIVDEGV RRVEAAFVER FGDDGKTAFV FTADHGMSNK GAHGDGDPGC TETPLVVWGA GVASGSQKVA GACRGTPETP KDWGMDPETR CDVDQADVAP LGATLIGFPP PRHNSGLLPS AYLSDKPEDL KSSAMIANAK QLLALHDLKA SRTAKRALSA LFSFKLHPDM ISVSSQVAEY ERLDRLGRHD DATRGANQVA RACLRGLEYL HTYDRALLQA VVVSCFATWM ILLAVNLLPR RKGECETPTG IPLTLALIYA GVVSVILLAR RAPPTYFLYF GLPAYFLLGI VNTLSAVKIE SIADISLLNV GICLVGAFAV AETICNGFHD RFVFSYAFAF GSALLVALAI RLLFRLLASA LLGTTTHVLI RPLDIFQDDA VDRETKRRPG QTVFALQILM VLTTGVLVTA VDGLQRDKLP VPTSMHAASW IVALMAPILP MFSPPRTLPR MISVFLGFAS TYGLFSVSYE SLFYACLGFC LLSWMILERG LQAPSPAKSK MFTRTIIPSD LRHAAMFLFL IDAAFFGTGN IASIASFDLS SVYRFTTRFN PFLMGALLVL KVLLPMITVA AAFLVVLKSS RVPAFESYFM FLILSDIMAV RFFFQITTVG SWLDIGSSVS RYALMGTQVV TILPFLALAQ IFTRALPVNG RTAIIRTKRD
|
| |