Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28050 |
Symbol | |
ID | 5006031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | + |
Start bp | 162056 |
End bp | 165130 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | |
GC content | 54% |
IMG OID | 640421452 |
Product | predicted protein |
Protein accession | XP_001421858 |
Protein GI | 145355209 |
COG category | [S] Function unknown |
COG ID | [COG5594] Uncharacterized integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0618861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0695266 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGT CGACGACGAC GTTCACGCCG CCCGCGCCGT CGACGCCGAC GACGCTGTGC GATTCGCAAA ACGTCGCGTG CGTGAACACG GATGTCGCGG ATGCGGAAGT TTTGAGCGCG TTCGCCGCGT ACGTGTGCGC CGCGCTCGCG GTGTTGACCG CGTTCGGGAT CGCGAGGAAA TACGTGCCGA TTTATACCGG TCGAGAACAC CTGCGGTCGC TGAAGACGAG CGGGTGCGCA CCGCCGCGAT TCGACGCGAG CGCGAATCGC GGCGGCGCGC GAGAGGCGTG CTCGACCACG TACGGATGGA TCGCGCACGT GTTGACGGTG GCAGATTCTG ACATCGTGCA CACGGCAGGA TTGGATGCAT TAGTATTCTT AAGAATTGCG CAGTTCGGGA CGCAGTTGTT CGCGCCTTTA GCGTTGGTTG GAGTGCTGGC GCTCGCGCCG ACACACCTGT CGAGATCGTA TTACGAGACG ACGACGACGA GCGAGTCGTC TGCGGCGCGC GAGAGCCACG TGTTGATGCG AATGACGATC GCAAACGTGG AACCGACCAG TTCGTTGATG TGGATGCACG TAGTGATGTT TTGGGCGTTC ACGGCGTATG CGCTGTGGTT GCTGACGGCC CACTATCGCT CTTACGAGTT TTTACGTCAA GTGTACGGAA CGACGACGGG CGAGTCGAAT CCTTGGCGCG CGGTGCACAT CCCGCAAACC GTGCTACAGA AGTTTTTACA GCAAGGAATA AACACAAACA GAGAGTTCAT GACGGAGACT ATTGAGGAAG AGGAAAGAGA GGGCGGACAG ACTCCTGCGG CGAGAACCAT GCCGACGACG ATGCTTCTCG AAGCGTTGCT TGGGCCGAAA CGCTCGAACG GTCAGATGTC GACCGAGCAG AAGCTCAGAC GTTTTCCTAG GGCATCGTTC ATTGAAAGCA CTCGTGGACC ATCGATGGCG ACGCCATCAA GAGGCTATCA CTCCCACTTC GGTCCCCTTC GAGTGACGGA GACGCCGCGT GAAGAATCGT CTCGATCGGT GTCGGAGATT TCCATGGCGT CCATGAGTGA CTTTGACGAA ATGTCCAAAG AACATCACTT AGATAAATTG ACGAGCGATA GCGAGGATGT CGCGATAAGA CACAATTGGT GGGAAGGGTT AGACATCGCT GAAGAGGTAT GGAGCGACCA ATTGAGGAGT GGGAGCGACG GATTTGGATC GACCGATGCG CTCTCTGCTC CGCGGCAAAT CAAAGTTGAT ATCGAGGGGC GATTTCCTTG CAACGACGCA TCAACGTGCG ATATTAACCC AGTACCATCG ATAGATGATA GACGGTACGT CTCAGCCGTC GCGGACGAAG TCAGCGAAGA CGGCAGCGAA AAGGAAGTGG TGGTCAGCGT GTTGGTGCAA AACTATTGCG TCTTGATGAC AGACGTCGGT GGTAATCTTC CCGAGGGGGC CGCGGATCCG TGGGAAGGTG TGCGAGCGGT GGAGACATTT TTCGGAGGCC TGTTTCCAGA CGACTTCCTA ATGGTGATTC CTTTACAGGA CTACCGCCCT GTGGACGACT TACTCATCGA GCGCGACAAG CTCAAGAATG AAATCGAGAA ACAATCGATG TTGCAATCAA AACGGCATGG ACACCGTCGT ATGCGTAGAG GGAGCGGTTT TCGGGATGAA ATCACGGGTT TACGAGACAG AGTAGCAATT TTAGACCACT TGGTTGTTCA GGAGCGCACC AGAATTCTTC AAACCGAGCC CGGGTCGAGT TGTATCGTTG CTTTCAAAAG CCAGTATGCG GCGGCGTGCG CGGCGCAGTG CCGTATCACA TCGCGTCAGC GTGATCTTTT TGCGATCGAA CCCGCGCCGG GACCCGACAA TCTCAATTGG CAATCGGTAT TACTTCGAAG ACGTCAGCGT GAGATCCGAT CGATGGTGAT TTTCCCGCTC ATTCTCACCA TCATACTCAT TCCGACGGGA ATGTTCACTG GCGTGATGTC GTCGCTATGC GTAGCAAATC AATTCGGTGC AAATCACAAC GACGGCTTGA AGTGGTACTG CTCGAGCGAT TCCGCGCGGT ATCTACGAAT TCTAGTGCAA GGTATTTTAC CACCCATTCT GCTGACACTC TGGGAAACGT TTGTCGTTTC GTTCGGAATG ATGTATCTCG TTCAGGCACA GAGCAAGTAT TCTAGTCTGA GTAAAACAGA CGAGTCGTTT GCGGAGTACT ACTTTCTGTG GGCGTTTCTG AATGTGTTTT TCGGCACTGT ATCTGGTTAC GCCATTCAAC GATATTTGAA CGCGCTCAAC ACGAAAGGTC CGGATGCCAT GCTGCAACTT CTCGGTACGT CGCTGCCGCT CACAAGTAAT TTCTTCCTAC TTTGGATCGT ATTCAGAGGG GTATACCTCC CCACTCAGCG GTTGATTTTC CCTCATCCCG GAGTGCTATG CATGATCGTC AATCGCTGGC TGTGCTGTTT GGGATGCAAC GTGACCGCTC GAGATAGAAC GATCAAATAC AGCCCGAGAT CGGTTCGCCT TGGTCGCGAA GTCGGTGTGT TCGCCATGGT GATGATGATT GGTCTCGTCT TTTCCACAGT CGCACCTTTG ATCACATTAC TCTGCACCGT ATTTTTCGTC TTTAATTTTG TCATATGGCG TTATCACGTC CTATATGTGT ACGAACGCTC GTACGAAGCC GGCGGGGCGA TGTGGACAAC GTTTTGCAAC TTGACGATTT ACGCGCTGGT CATCGCGCAG AGCTTTTTGT CGTTTGTCCT CTTGTCCAAG CAAGCGTACG CCGGAGCACT CATTCTCTGG ATCACTGTCT TACCGGTTCT AAGCAAAGCC AGTCACAGAT TTCGATCGAT CGCGAGCGAG CTTCGCTGGT CCGTGCCCCT ACCACAGGCG TCCATCGCGC CTCGCGCCGA GTTCAACGCC GAGACTTACA TGCATCCAGC GCTCAAGCGC AACTCCATGG GATGGCACCC AGAAATCGGC AAGGTCTGGC GAGGGTACCC TAACGTCACC GTGAAAGAGA CTCGGATATT CAGAAGACGT CAACGACATA GATGA
|
Protein sequence | MTTSTTTFTP PAPSTPTTLC DSQNVACVNT DVADAEVLSA FAAYVCAALA VLTAFGIARK YVPIYTGREH LRSLKTSGCA PPRFDASANR GGAREACSTT YGWIAHVLTV ADSDIVHTAG LDALVFLRIA QFGTQLFAPL ALVGVLALAP THLSRSYYET TTTSESSAAR ESHVLMRMTI ANVEPTSSLM WMHVVMFWAF TAYALWLLTA HYRSYEFLRQ VYGTTTGESN PWRAVHIPQT VLQKFLQQGI NTNREFMTET IEEEEREGGQ TPAARTMPTT MLLEALLGPK RSNGQMSTEQ KLRRFPRASF IESTRGPSMA TPSRGYHSHF GPLRVTETPR EESSRSVSEI SMASMSDFDE MSKEHHLDKL TSDSEDVAIR HNWWEGLDIA EEVWSDQLRS GSDGFGSTDA LSAPRQIKVD IEGRFPCNDA STCDINPVPS IDDRRYVSAV ADEVSEDGSE KEVVVSVLVQ NYCVLMTDVG GNLPEGAADP WEGVRAVETF FGGLFPDDFL MVIPLQDYRP VDDLLIERDK LKNEIEKQSM LQSKRHGHRR MRRGSGFRDE ITGLRDRVAI LDHLVVQERT RILQTEPGSS CIVAFKSQYA AACAAQCRIT SRQRDLFAIE PAPGPDNLNW QSVLLRRRQR EIRSMVIFPL ILTIILIPTG MFTGVMSSLC VANQFGANHN DGLKWYCSSD SARYLRILVQ GILPPILLTL WETFVVSFGM MYLVQAQSKY SSLSKTDESF AEYYFLWAFL NVFFGTVSGY AIQRYLNALN TKGPDAMLQL LGTSLPLTSN FFLLWIVFRG VYLPTQRLIF PHPGVLCMIV NRWLCCLGCN VTARDRTIKY SPRSVRLGRE VGVFAMVMMI GLVFSTVAPL ITLLCTVFFV FNFVIWRYHV LYVYERSYEA GGAMWTTFCN LTIYALVIAQ SFLSFVLLSK QAYAGALILW ITVLPVLSKA SHRFRSIASE LRWSVPLPQA SIAPRAEFNA ETYMHPALKR NSMGWHPEIG KVWRGYPNVT VKETRIFRRR QRHR
|
| |