Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28295 |
Symbol | |
ID | 5006172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | + |
Start bp | 165988 |
End bp | 170172 |
Gene Length | 4185 bp |
Protein Length | 1394 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421593 |
Product | predicted protein |
Protein accession | XP_001422114 |
Protein GI | 145355751 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.126458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.583249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCACG GCGGTAAGAC GACGAGCCGG TTGTCGATCG CGGATGAAAG TGAATTTCCG TCGCTCGCGT CGGGGGCGCG AAGGAACCTG GCGATGGCGT TGGACGGCAC GGCGAGCGGG GACGGAAAGC CGAGGCGAGT GGCGCCGACG CCGGTGGAGA CGAACGGAAC GAACGGAGCG AGACCGCCCG TGGGGAACGG ATTTCACGCG CAAGTGACGC ACTGTGCGTC GCCGATGGGA GGTGCGCGAC GAATCGCGCC GACGATGGTT TCGAGACCAG TGGATGGGGA TTTGTCGTCG TCGTTTACGG CGTCGGGCGC GGCGACGCCG GCGAGCAAGC CGAGTAGTTT ATCCGCGTCG CCGTCGTCCA TGATGAAACG CGCGGCGGCG ACTGCGACGG CGCTGACGCT GGAACGATTG TATGTGAACG AGCCGAGACC GATGGAGCCG ATAGAGTTGC CGACGCCCTT GGCGATGTTG GCGACGCTGC ACGCGGCGGC GCTGCGCGCG GGATTTCCGC TCGATCAGTC TCCGGAGCTC GCGTGGTTGT GTGCGCTCTT GGGCGCGCCC TCAGACTTGG TCGTCGAAAC GATCGAGGGT AGTGCAAAGC TCGTCATGCG TACTGGACGC GAGGCGCATG CGTACGCTTC AAAAGTGCTT TCGCTCGTGC CGGCGCTCGC GTGCGCGCTC GGCGAAGGCG CGCTCGAAGC GCTCGCCGAG TCCAACGTTT TAGCTTCGCA CGCTAGTGCG TTACATAGGG AAATAGTCCG TGAGTTGGGT GCCGTGCGTG TCGCTTCGGG CGAGTGCGAG CGGGCGTATC AAGTAGACGG TCGCGTGGGA AGCAAGCTGT ATGGATTCGA CTTGCTCGAC GCTTTGGCCA AGGCGGACGA GCGACGGGGT CGCGCAAACG CCAGACCAGG TCGTGCGATT CCACTGACGG AAAATCAGGC GATAGAACAC AACCGTGAAC GCACTCGAGA TGCGTTCTAT GCGCTCTTAC GCGCGTCGGC GAGCCGATCG GTGGGTTTAG GTCAAGATGC GTCCATCGCA AGCCGCGCTC GAACGCTCAT CGCGAACACG CATCCGGCAA ATATGCGGTG GCTTGCCGAT TTATTCGTCG CCCGTCTGTC ACAAAGCAGT GCGGTAGGCG AAGTAGACGA AGAGCTTGGA AACAGCATCA CGCCGGCGCG CTTATCACGC CTACACGAGC GCATAGTAGG AAGTAACGCA GCGACGCAAG CGCCGCAAGG AGGTAGGAGC GGCGCGGGCG CGCGAGTCGA TGGTAGAGGT GGCCGAGGAG GACGTGGACG TACTCTTGGT CGAGGAGGTG GAGAAAAATC CACGAACGGA GTGACGCAGA CCAACGGGTT TGGCGCTGAA GTGACGCACA CGTGGCCGTC ATTTGTGTGC TTGTTCCCGC CTGCGCAACA GCCCTTCGTG CGATTCATCG AAACGACGGA TTCGCACAAG TTTTGCGTCG CACTTCAACG CTCGCTTGTC GCAGCGTTAC ATGTTCTTGA CCCGAATGCG CACCCGAGTG AAGACGCCGC GCAGTCTCCG GCGAGTTCAT CGAACGCTGG GCAAACGGAA CGCATGTTGG CTGCTCACGC GATGGGCTCT TTCTTGGGTT TACTCACTTT TGGATTCGCT TCAGGCGCGA CTTCTGCGCG TGTGGCGTCA AATAATATGG CACTGCAAGG TATTGATTTG ACGTCGACGC TTCGGCGAGC CGGAAGACGT GGAGAGCTCG TCGTGAGCGC GCCGTGGGTG CTCGCGTTTT TGCGCTTCCT GATGTGGGAC GCAGATTCGC TCGATATCGC GCACTACGCC GAAGCACTGG CGTATTTGCG TGCAATCGCG GCATCACCAT GCTTAGATCC GTTCGCTGTA AAAGGCGACT TCAATTCATC GCGCATGTGT TTGCGATCCA TTTTAGCAAA TGGTTTGTCG GTGAATGCGC CGATTACGAC AGTGCAAGCC TCGACTTGGA CCTCCAACAA CGTGATGGGT CTACCGCCAC CACCGCTCGC GGTGATGGAT GCGTTGCCTA CGACAAGCAA GGAGATTATG CAATCGTACG AAGATCCTTT AGCGATTGAC AGTTGGGGCA GGGAATGGGT TCCATCGGCT TCGGCGAACG CTAGCGCCAC GGCGAATTCT CAAGAAGTAC CAGATTCGCC CTCCGAGTCG GACTCGAGGA TCGAAACACA AACTGCGCCA GACTTGAGTC GTGTGGGCGT AGATCGTCGC TACGTCGAAA CTGTGTGCCC AAGCGTCGAC CGTGCGGTGC GAACTTTGAA ACAACGAGGT GCCATCGCGC AAGCGCTCGT CGAAGAGAAA GAACAACTCA CAAAGGAAGC ACCTCGTTCT TTGATGCACA CGCCATCGCA ACCGACGAAT GTGCCTCGTC GAATGCAAGC GCAGCCACAG CGCGTAGAAC CAACTTCTGT GAGCAAGTCG ACGCAAGCAC AAGACTTCGG CGGCGTCTTT GAAGCCAAAA ACACGCTTTC GCCAACCTTT GGCGTGACTT CGCCAAACGC GGAGAGTGCA CTCGCAACAA TGCCGTCGAT GGATTTGAAA TCTTCCTCGA CTGAAATTAA ACATGCACTG CAGCGTGCTT TCTTGCAGAA CGTTCCAGGT CTGCGCCGTC TCGTCGACTT TGCCGTGGAC GCCGCAACGC TCGCTGCTGT CGATGACGCC ACCGCGGCTA TCACCGCAAG CGCGGTAGAG AGCGCGCAAG AGGTTGTGAG TGCAGCGGCG ACGCAGGCAG CGATGAAGTA TGCCGAGTCG GCGAAACGAA ACGGAACGTC GTTCGTTTCC CATGCGAGTT CTGTCATCGA AGAAGCTTGG ACACCAGAGT TTGAGCGTGC AGTCGAGCGA TCTGCAATGT CGGTGACGAG AGATGTCATA TCTTCTGTAG CAAGCACCGC CGCCGACGAC GCGGGTGCAC GCGCCGCCGC CGCGGTTGCG GCACTTTCAG GTGCCAACAC GAGCCGAGGC GGAGCAGCCG CGACTGGGGG ATCAAGCGCC GCGACGAGAG CAGCGTGCGG CGTCGCCGCT GACGCGGCGG CGTTCGCGGC GGCGGATCGG GTGCGACGAA ATTTACCTTT CGAGCTTCAC GCGCGCGTGG CGAGCGACGC ACGAAGGTTG TTGAAGTCAG CGCTCGGCGC GGCGAAAGAG GCCATGAGCG CACCGGCGAC TGAAGAGACA GACACTCAGA AGTCATCCTC GCGTGAAGCC GAGTACGCGA GCGTCGCCGC TTCGGCGATC GCAGCCGCGC TTTCTTCGGC GTCAAAGTGG CCCGCAGGCA CCTTACTTGC CGAGTGTTCA TCTTTAAAGC TGCTCGCTGA TCGTAAGGCG GACGCGTTAA CGCTCTCTCG CGGCATAAAA AGTGTCATCG AGTGCACCAG AAAAGTTCTC GAACAAGACG CCAAATTTAT TCGCGTGCAA ACGAGTCGTA AGAAGCCTTC GACGCACGTT GAAAAGCCGA TTCCCGACTC TAGACCTGGA GAGCAGCTCG CGTCGATGTT AGCCCAGCTC GCGAGAGCGT ACGTCAAGCT TCTTCTCGCG GCGCCCCAAA ACGCGCTCTC GAAGACGACA AAGGCAAAGG AGGCGGAGGA AGGCTGGGGA ATATCGCGGG ACGAAGTGTG CGAAGGTCTC ACCGCCACGC TCATCGCCAC GTTTACCGCA TCTGCACCGG CATTAAACCT CGTGTCGGCC TTAGCGCCAC CGGACTCACC CGATCCTTGG AAGCGTGTGT ACGACGAAAT GTTCGCCGGC GAACTCTGGC TCGCCGCGCT TCGCGAGTTC CCCGAGAGCC CGTTCGCTCA AAAGCGAATT GAACTCATCT TCGCCGAGGC CATCTCAAAA CTGCCGCGAC GCTCCGTGTT GCGCACTGCC TTTCTCCACC TCATTGACGA CGCCGACAAG CGCGTCGGTC GACTCGGGCA CGCCCTCGCG ATTCGCGGCG CGATTCGTTT CTGCGCCGTC GTCGCCGCGA GCGCGACGCA CGACGACGCC GAAGAAGTCA CCGCCATCGC CGCCTTCGCC GAAGATCTTC GTCAGCGATG CGAAGCCGCG GGCGAAAACA GGCTCGCTAG ACGCGCCGAG GGCGCGCACC GGTCGCTCAC GTCGGGATCC GTGTACGCCA ACATCAAATC CGTCGCGACT CCGACAAAGT CGTAG
|
Protein sequence | MRHGGKTTSR LSIADESEFP SLASGARRNL AMALDGTASG DGKPRRVAPT PVETNGTNGA RPPVGNGFHA QVTHCASPMG GARRIAPTMV SRPVDGDLSS SFTASGAATP ASKPSSLSAS PSSMMKRAAA TATALTLERL YVNEPRPMEP IELPTPLAML ATLHAAALRA GFPLDQSPEL AWLCALLGAP SDLVVETIEG SAKLVMRTGR EAHAYASKVL SLVPALACAL GEGALEALAE SNVLASHASA LHREIVRELG AVRVASGECE RAYQVDGRVG SKLYGFDLLD ALAKADERRG RANARPGRAI PLTENQAIEH NRERTRDAFY ALLRASASRS VGLGQDASIA SRARTLIANT HPANMRWLAD LFVARLSQSS AVGEVDEELG NSITPARLSR LHERIVGSNA ATQAPQGGRS GAGARVDGRG GRGGRGRTLG RGGGEKSTNG VTQTNGFGAE VTHTWPSFVC LFPPAQQPFV RFIETTDSHK FCVALQRSLV AALHVLDPNA HPSEDAAQSP ASSSNAGQTE RMLAAHAMGS FLGLLTFGFA SGATSARVAS NNMALQGIDL TSTLRRAGRR GELVVSAPWV LAFLRFLMWD ADSLDIAHYA EALAYLRAIA ASPCLDPFAV KGDFNSSRMC LRSILANGLS VNAPITTVQA STWTSNNVMG LPPPPLAVMD ALPTTSKEIM QSYEDPLAID SWGREWVPSA SANASATANS QEVPDSPSES DSRIETQTAP DLSRVGVDRR YVETVCPSVD RAVRTLKQRG AIAQALVEEK EQLTKEAPRS LMHTPSQPTN VPRRMQAQPQ RVEPTSVSKS TQAQDFGGVF EAKNTLSPTF GVTSPNAESA LATMPSMDLK SSSTEIKHAL QRAFLQNVPG LRRLVDFAVD AATLAAVDDA TAAITASAVE SAQEVVSAAA TQAAMKYAES AKRNGTSFVS HASSVIEEAW TPEFERAVER SAMSVTRDVI SSVASTAADD AGARAAAAVA ALSGANTSRG GAAATGGSSA ATRAACGVAA DAAAFAAADR VRRNLPFELH ARVASDARRL LKSALGAAKE AMSAPATEET DTQKSSSREA EYASVAASAI AAALSSASKW PAGTLLAECS SLKLLADRKA DALTLSRGIK SVIECTRKVL EQDAKFIRVQ TSRKKPSTHV EKPIPDSRPG EQLASMLAQL ARAYVKLLLA APQNALSKTT KAKEAEEGWG ISRDEVCEGL TATLIATFTA SAPALNLVSA LAPPDSPDPW KRVYDEMFAG ELWLAALREF PESPFAQKRI ELIFAEAISK LPRRSVLRTA FLHLIDDADK RVGRLGHALA IRGAIRFCAV VAASATHDDA EEVTAIAAFA EDLRQRCEAA GENRLARRAE GAHRSLTSGS VYANIKSVAT PTKS
|
| |