Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24022 |
Symbol | |
ID | 5000072 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 175983 |
End bp | 180551 |
Gene Length | 4569 bp |
Protein Length | 1395 aa |
Translation table | |
GC content | 54% |
IMG OID | 640415493 |
Product | predicted protein |
Protein accession | XP_001416093 |
Protein GI | 145342014 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000888169 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCGCCGCGA GGAAGCGCGC GTCTCGCGCC CATCGACGCG AAACCCTCCA TTGTCGCCAT CACCGCGCGC GCGCGTTCTC GCGGCGCGTC GTCGACGCGC GCGCGCCGTC CGAGTCGCCG AAAAGCGATC GCCGCCGCCG GTGCAGACGC GCGCGGTCGC GCCTAAAACG CGCGTCGAAC GCCATGCCGA CGTTCCGGGA CGACATGGCG GGGTCATCGC ACGACACCGC GGCGATTGAT TTCACGAAGA CGCCGTTCGC GCCGGCGTCG ACGCCGAGAA AGATTAGCGC GGTGCGGTTT TCGCTGATGA GCTCGAGCGA AATCGGACGC GCGGGGGTGT TTCACGTGTT CGAACGGAAT TTGTATCAGA TGCCGCAGCG AACGCCGATG CCGAACGGGA TATTAGATCC GAGAATGGGG ACGACGGATA AGCGCGGTGG GGAGTGCGCG ACGTGTCGAG GGAACATGGT GGATTGCGCG GGGCACTTTG GGTACATCAA GCTGGAGCTG CCGGTGTTTC ACATCGGATA CTTTAAAAAC ATCATCAGCG TGTTGCAGTG CGTGTGCAAG AATTGCTCGC GGATTTTACT GAACGAGGAT GATCAAGATA TGTTTTTGAA GCGCTATCGG AATCCAAGAC TTGAGATTAC GCAGAGGCGT CTGCTGTACA AAAAGTTGAC GGAAAAGTGC AAACGATGCC GAACGTGTCC GCATTGCGGA GATTACAACG GCGTGACCAA GCGTGCGGGG CAGACTTTGA AGATCGTTCA CGAAAAGTAT AGTAAAAACC CAGCGTTGTT GGAGGAGTTT ATGAAGGAGT TTGAAACAGC TTTGAAGTAC AACGAGCAGC TTCGAGCGTC ACTGCCAAAG GTGCAAGACG ATTTGAATCC CATTCGTGTG TTGGGCATAT TACAGCGCGT CACTTCGGCC GATTGCGAAT TACTAGACAT CGCAGATCGA CCGGAACACT TGCTCTTGAC GCACCTGCCC GTTCCGCCGT GTTGCATCAG GCCTTCGGTG GAGATGGACG GCGTGTCAGG GTCAAACGAA GACGATATCA CGATGAAGCT CATACAAATC ATCGAAGTCA ACAATGTTTT GCGGCAAGGC TTAGAGAAAG GGCTCGCTGT TAACAATATG ATGGAGAATT GGGATTTTTT ACAGATTCAA TGCGCCATGT ACATCAACAG CGAGCTTCCC GGTTTATCTC TGCAGTACCA GGGCCCTGGT AAGCCGCTGC GCGGCTTTGT GCAAAGACTC AAGGGTAAGC AAGGGCGCTT CCGCGGTAAC CTCAGCGGTA AGCGCGTAGA CTTCAGTGCG CGCACTGTGA TCTCACCAGA TCCGAATCTT CGGATCGACG AGGTTGGTGT ACCAATACAT ATAGCGAAGA CAATGACATA CCCAGAAGTC GTCAACAAAC ACAACATGGT GATGCTGAGA GAGCGCGTGC GCAATGGTAT GGCGAAGCAT CCTGGAGCGA ACTTTGTCAA GTTTGCGTCT GGTGGTATGC AGTACTTGAA GTACGGCGAT CGTCGCAAGA TTGCCAGCGA GTTGAAATTC GGAGACATTG TTGAGCGACA TTTGCACGAC GGCGACATTC TGTTGTTCAA TCGTCAGCCG AGTTTACACA AGATGTCAAT CATGGCGCAC AAGGCACGTA TCATGGAGTG CCGAACTCTA CGTTTTAACG AGTGCGTGTG TACGCCGTAC AACGCCGATT TCGACGGTGA CGAGATGAAT ATTCATTGCC CTCAAACGGA AGAAGCGCGC GCCGAAGCGA TGCAACTTAT GGGAGTGCAA CACAACCTGT GCACACCGAA AAACGGCGAG ATTCTCATCG CAGCGACGCA GGACTTCCTA ACTGCAGCGT TTCTTCTCAC CATTAAGGAT GCGTTTTTTG ATCGCTCTCA GTTTGGCTCG CTCGTCGCGT ACATGGGTGA TGCGCTCCTT TCCGTGGATC TTCCGACGCC AGCAATCTTG AAACCCATTG AGCTGTGGAC TGGAAAGCAA GTGTTCAGCG TTCTCATTCG ACCGAAAGCG TCCGATCACA TATATGTTAA TCTTGAAGTC GCTGAAAAGC TGTACAACAA GAAGGACAAA ACAATGTGTC CCGACGACGG GTACGTGTGC ATTCAGAATA GTGAAATAAT CTCAGGGCAA CTCGGCAAGG CGACACTCGG TAGTGGAAAT AAAAGTGGAC TATTCTACGT GCTCAACATA GAATACGGCG CCGAAGCCGC CGCAAATGCG ATGAATAGAC TCGCCAAGCT GAGCGCTCGA TGGCTGGGTA CGCGCGGGTT TAGCATTGGC ATCGACGACG TCTCACCGGC CGCGGAGTTG TCAGCCGAAA AGGGCCGACG TATCGAGGAC GGCTACAGAA CTTGCGACGA ACGAATCGCT TCGTATGAGA AAGGTACTTT ACCTCTTCAA GCGGGCTGCA ACGCGGAGGA AACGTTGGAG GCTGAAGTTC TCGGGGTCTT GTCTGCGGTG CGCGAAGCCG CGGGGAACGC GTGTTTAAAG GCGCTGCCGC GTCGCAACGC ACCACTTATC ATGGCTCTTT GCGGCAGTAA GGGTAGCACG ATCAACATTT CTCAGATGAT CGCGTGTGTT GGTCAACAAG CTGTGGGAGG CTCGCGACCT CCTGACGGTT TCGCCGAGCG ATCGCTCCCG CATTTCAGGC GCGGAGAAAA GACACCCGCC GCAAAGGGCT TTGTCGCCAA CTCTTTCTTT TCTGGCATGC GACCGACGGA ATTCTTCTTC CATACCATGG CTGGTCGTGA AGGTCTCGTC GATACTGCCG TAAAGACTGC AGAGACTGGA TACATGTCTC GTCGTCTCAT GAAGGCGCTA GAGGACCTAT CGCTTCAGTA CGACGGCACT GTGCGCAACT CGATGGGTGG CATTGTGCAG CTGCGCTACG GCGACGACGG CATGGAACCG ACTATGATGG AGAGTGAAGA CGCCCAACCG ATCGAGTTTA AGCGGTGTTT GATGAATGTG CGAGGTCTGA ACTCGGCACG AGGCGAACCG CCAGCGACTC GCTCGGCGTT AGATGCGGCG CTGGACCAGT TCAAGCGAAA CAACAGAATT ATCAATGTCG ACGAAGAAGG CGAAAGCGCC GAGACGCGCG AACTCGTGTA CAGCGCTGGT ATCTCGCAAT TATTCTACGA AAACATGAGA ACATTCATCA TCAACGAAGT GGAGTCGCCT TCGGATGGCG TATACATGTC GGATCGACAG ATGAAGGCAT TTTTGGACGC GTGCGCGCGA AAATACACTG AAAAACGAAT TGAGCCCGGC ACCGCAGTTG GCGCCATTGG CGCACAATCC ATCGGCGAGC CAGGCACACA AATGACGCTG AAGACGTTCC ATTTCGCCGG TGTGGCGTCG ATGAACATCA CCCTCGGCGT TCCTCGAATT AAGGAAATCA TCAACGCGTC CAAGAATATC AGTACGCCAA TCATCACGGC GTCGCTCATG AGCGACAGGG ACGTCAAGGC GGCGCGCGTG GTGAAGGGTA GAGTGGAGAA GACGACGTTG GGCGAAATCT GTTCCGAAAT CAATATTGTC GTTCGTCCGC ACGATCTTTA CCTCGAGCTT ATGTTAGATA TGGAGGCGAT CAATCAACTT CAGTTGGACG TCACCATTCA CTCCGTGCGC CTGGCTGTGC TCGGGGCGCC AAAGCTCAAG CTCAAGCCTC AACACGTGCT CATCGCCGGC GAGAATATCT TGCATGTCTT ACCCTCGGAA GAAGCACTGA TAGAAAAACG CGCGTTGTTC ACTCTGCAGC ACATGCGACT GGCCATTCCC GCCGTCATCG TGCAAGGCAT CCCATCCGTC GGTCGAGCCG TCATCAACGA CAAGGGCGAC GGAACGTACA ACCTCATCGT GGAAGGCGTG AACTTGCAGT CCGTCATGGG TATCGAGGGC ATCAAGGGAA CAGAGACACG AACAAACCAT GTCATGGAGT GTGAACGCAC GCTAGGCATC GAAGCGGCGC GCGCGTGCAT CATTGAAGAG ATTGACAGCA CGATGGGCGC TCACGGCATG TCAATCGACA ACCGACACTC GATGTTGCTC GCCGACGTCA TGACGTACAA GGGCGAGGTT CTCGGGATCA CGCGTTTCGG CATGGCAAAG ATGAAAGACT CTGTACTCAT GTTGGCTTCG TTCGAGAAAA CTACGGATCA TCTGTTCGAC GCAGCGCTGC ACGGGCGCAC GGATTACATC GACGGCGTCT CCGAGTGCAT CATCATGGGC ATTCCAATGC CCATCGGCAC GGGGATGTTC CGCTTGCAGC ACCGCGCGAT GAAACTCGAA GTTGACATCA ACTTGGTAGA AGAAGACGGA AGAGAAACGT TGATGACGAC TGTCACCCCC GAACAACGCG AAGAATTGCC GAAAAGACCG CCCTTGCTCT TGCAAGGCGG ATACTGCAAA CCATGCAGAC CTATCGAAGC TTAATACGTT GCGGTCCGCT CGAAATTTTA GAAATTTTAG TAGAATAGAC GGATTTTTAC AATTCAATTC GAGGGCGTTA GCCAAATGTC GCGATCGTGC GAGCGATGA
|
Protein sequence | MPTFRDDMAG SSHDTAAIDF TKTPFAPAST PRKISAVRFS LMSSSEIGRA GVFHVFERNL YQMPQRTPMP NGILDPRMGT TDKRGGECAT CRGNMVDCAG HFGYIKLELP VFHIGYFKNI ISVLQCVCKN CSRILLNEDD QDMFLKRYRN PRLEITQRRL LYKKLTEKCK RCRTCPHCGD YNGVTKRAGQ TLKIVHEKYS KNPALLEEFM KEFETALKYN EQLRASLPKV QDDLNPIRVL GILQRVTSAD CELLDIADRP EHLLLTHLPV PPCCIRPSVE MDGVSGSNED DITMKLIQII EVNNVLRQGL EKGLAVNNMM ENWDFLQIQC AMYINSELPG LSLQYQGPGK PLRGFVQRLK GKQGRFRGNL SGKRVDFSAR TVISPDPNLR IDEVGVPIHI AKTMTYPEVV NKHNMVMLRE RVRNGMAKHP GANFVKFASG GMQYLKYGDR RKIASELKFG DIVERHLHDG DILLFNRQPS LHKMSIMAHK ARIMECRTLR FNECVCTPYN ADFDGDEMNI HCPQTEEARA EAMQLMGVQH NLCTPKNGEI LIAATQDFLT AAFLLTIKDA FFDRSQFGSL VAYMGDALLS VDLPTPAILK PIELWTGKQV FSVLIRPKAS DHIYVNLEVA EKLYNKKDKT MCPDDGYVCI QNSEIISGQL GKATLGSGNK SGLFYVLNIE YGAEAAANAM NRLAKLSARW LGTRGFSIGI DDVSPAAELS AEKGRRIEDG YRTCDERIAS YEKGTLPLQA GCNAEETLEA EVLGVLSAVR EAAGNACLKA LPRRNAPLIM ALCGSKGSTI NISQMIACVG QQAVGGSRPP DGFAERSLPH FRRGEKTPAA KGFVANSFFS GMRPTEFFFH TMAGREGLVD TAVKTAETGY MSRRLMKALE DLSLQYDGTV RNSMGGIVQL RYGDDGMEPT MMESEDAQPI EFKRCLMNVR GLNSARGEPP ATRSALDAAL DQFKRNNRII NVDEEGESAE TRELVYSAGI SQLFYENMRT FIINEVESPS DGVYMSDRQM KAFLDACARK YTEKRIEPGT AVGAIGAQSI GEPGTQMTLK TFHFAGVASM NITLGVPRIK EIINASKNIS TPIITASLMS DRDVKAARVV KGRVEKTTLG EICSEINIVV RPHDLYLELM LDMEAINQLQ LDVTIHSVRL AVLGAPKLKL KPQHVLIAGE NILHVLPSEE ALIEKRALFT LQHMRLAIPA VIVQGIPSVG RAVINDKGDG TYNLIVEGVN LQSVMGIEGI KGTETRTNHV MECERTLGIE AARACIIEEI DSTMGAHGMS IDNRHSMLLA DVMTYKGEVL GITRFGMAKM KDSVLMLASF EKTTDHLFDA ALHGRTDYID GVSECIIMGI PMPIGTGMFR LQHRAMKLEV DINLGVSQMS RSCER
|
| |