Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25726 |
Symbol | |
ID | 5006179 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | + |
Start bp | 189310 |
End bp | 192558 |
Gene Length | 3249 bp |
Protein Length | 1015 aa |
Translation table | |
GC content | 57% |
IMG OID | 640421600 |
Product | predicted protein |
Protein accession | XP_001422121 |
Protein GI | 145355765 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5215] Karyopherin (importin) beta |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.779434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000347416 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGCTGA GGGAAGAGGC GGAGGCGAGG GTGACGCGAA AGACGTGCGA TTTGATTTAC GAAGTCGCGG CGGGGGCGAT GGAACGGGAA GAGCCGTGGG CGGAGTTGAT GCCGTTCATG TTCGGTGCGG TTTCTGAAGG GTCGGACCGA TTGAAGGAGA GCGCGTTGAT GATCTTTGCG ATGCTCGCGA GTTACATGAG CGATGCGTTG GTGCCGCAAA TTCCGACGCT GCACGCGACG CTGAGCGCGT GCTTGGCGTC GGCGGATACG AATGTGCGAC TGGCGGCGTT ACGAGCGACG TGTGCGTTCG TCGATGCGTT GGAAAATCCG AGCGATCGAA TGAAGTTCCA AGATTTGTTG CCGGCGATGC TCAACACGAT CGGTTCGGCG TTGAGAGGTC AAGATGAGAC GTCGGCGCAA GAGGCTCTCG CGTTGTTCAT CGAGCTCGCT GAAGCCGATC CGAGATTCGT GCGCAATCAT CTCGTCGAGC TCGTGGAAGC GATGCTCAGC ATCGCGGAAC ATAACGACCT CGAGGATGGC ACGCGAACGT TGGCGACTGA GTTCTTGGTG ACGCTCACCG AGGCGAGAGA TCGTGCGCCG GGTATGATGC GCAAGGTTCC AAATTTCGTC CAGCGCCTGT ACAACTGCCT CGTCTCCTTC TTGGTCAACG ACATCGAAGA CGATGAAGAT TGGCACACGA CGGAGAACGA AGAGGACGAG GGTATCGGTC AAGGTGATTT GTACGACGTC GGTCAAGAGT GCCTGGACCG CGTCTCCATC GCGCTCGGCG CCAACTCCAT GTTACCCGCG TGCGCCGCAA CGATGCCGTC GCTGATTGGC GACGCGGATT GGAAGCGTCG TCACGCCGCT CTCATCGCAC TCTCGCAAAT TGCCGAAGGT TGCGCCAAGG GAATGAAGAA GGACGTTGTC GGCGCCATTC AGCCGTGCTT ACACGCGCTC GCCACCGATC CCCATCCTCG CGTTCGTTGG GCCGCCATCA ACGGGTTGGG GCAAATGTGC ACCGATCTTG GCCCGAGATT GCAAGAACAG GCGCACGCTA ACGTCGTACC GTTGCTCCTG AACGCCATGG ATGACGTGAA GAATCCGCGT TGCCAGGCGC ACGCGGCCGC CGCGACGGTA AACTTCAGCG AAGACTGCCC TCCGGAGTGC ATGGCGCCGT ATTTGGACAC GCTCATGAAC AAGCTTTTGA GCTTACTGCA GTCTGGCAAC AAGTCTGTGC AAGAAGCGGC GCTCACGGCG CTCGCATCGA CCGCGGACAA CGCGCAAGAA TCCTTCATCA AGTACTACGA CACCGTCCTT CCGTACTTGA AGTCCATCTT GGTGAACGCC AACGGAAAGG AGTACCGCAT GTTGCGCGCC AAGGCTGTGG AGTGCATTTC TCTCGTCGGT ATGGCCGTTG GACGCGCCCG CTTCGCTCAA GATGCGCGTG AAGTCATGGA CATGTTGATG CGTCTCCAAT CTGGCGGATT CGAAGACGAC GACCCAACGG TGCAATACAT GCTTCAAGCC TGGACGCGTT TGTGCAAGTG CCTTGGCGAG GAATTCGTCC CGTATCTTGA AGTCGTGATG CAGCCGCTGT TGAAGTCTGC CAATTTGAAG GCGGACGTCA TCGTCACGAA CAAGGACGAT GAAGACGGGG GCGAGGAGGA GGAAGAGGAA AACGACGATT ACGAACAGGT CGAAGTCGGC GACAAGCGAG TTTCCATTCG CACGGCGGCT TTGGAAGAAA AAGCCACCGC GTGCAACATG TTGTGCTGCT ACGTCGACGA GCTCAAGGAC GGTATTTTGC CCTACTTGGA GCAAATTCTG CAAACCATGA TTCCCTCGCT CGAGTTTTAT TTCCATGAGG ACGTGCGTAG AGCCGCCGTG GCGTCGTTAC CGGATCTTCT CCGCGCCGGT AAGCTTGCTG TGAGCAAGGG AGCAAAGGAT AAAGCTTGGT TCCAGCAACT CGTCAATCAT ATCATTCCCC CCCTTATTCA AGCGATGGCG AAGGAACCGG ATATCGAGAT TCAGGTGCGT TACTCAAAGT CGACGTCGAC GAGAAGAAAC AAGTGATGAG ACTTATGGAT TCCCCTTCAA GACGCCTTGA GGCTCTGGTG ATAAATCCGA TAACAGTCCC TGATGTTTCT GATTGTTCCT CTTTTTTTTG CATTTTTCCG TCTCGAAAAT ACAGTCGTAC TGACCTAGCG CCCTCTTTTC TTTTTCGCCA CGCAGGCTCG CATGCTCGAA TCGCTCGCGG AGTCCGCCGG TGAAGCCGGT GACCTCGTGC GCGATCACTT ATCGGCAATG TTAGAAACAT TCAAGGTATT GCTCACGGAG TCGCTGGAGC GACGTGCGGA GCGGAACAAG CGCGCGGGCA CGGACGATTT CGATGAAGAG GAAATGCACG CGCTGGAGGA GGAACAAGAA GCCGAGGACG AAGTGTTTGA TCAATTCGCC GAATGCGTCG GGTCTTTGTT GAAAAGCTTC CACAGCGCGA TTCTTCCGTC TCTTGAGCCA TTGCTCGCGT TCATCGTGCC GCTGCTGGAC AAGAACAGGA GCCCCGCCGA GCGACGTATT GCGATTTGCG TGTTCGATGA CATTTTTGAA CACGCGAGTG ACGGTGGCGG CGCTCTCAAA TATCTCGACG GTTTCGTATC GCCTTGCATC GCTGGATGCA CGGACAACGA CGCGGATGTG CGCCAGGCGT CTGTTTACGG CGTCGGCGTG ATGTCCGAGC ACTGCGGCCA AAGCTTTAAC GCTCACGTGC CGAGCGCGCT CAGCGCGCTC GCGAGCGTCA TTCAAGCGCC GGGCGCTCGA GACGATGAGA ACATCTACGC CTTCGAGAAC GCGGTCGCCG CGCTCGGGAA GATGTGTGAA TTTCAGAACG CCGCCTTGGA CGCTAGCGTC ATCTTGCCTT CGTGGTTGGC GAGCTTGCCG CTGACGGAGG ACAGGGTTGA GGCGAGAAAC GTGCACGCGC AACTCATGCG CCTGCTCGAG AGCAACGGGC AAGCCTTGAT GGGTGCCTCT TACGAGCACC TTCCCCGCGT CGTCAGTGTA CTCGCGGATG TTTTACCAAC CTCCGGTTTG AGCGCGAAGC TCCGTCTCGT CGAGCCCGAA GTCGCGGCAA AGATGAAGGC GTTCCTCGTG CAAATGCAAT CGAGCCTTCC TCAAGACAAG CTCAGCGCCG CGTGGGGCAT CTTGAGCGCG GAGAAACAAG CCGCGCTGCA GGCGGCACTT CAAGGTTAG
|
Protein sequence | MALREEAEAR VTRKTCDLIY EVAAGAMERE EPWAELMPFM FGAVSEGSDR LKESALMIFA MLASYMSDAL VPQIPTLHAT LSACLASADT NVRLAALRAT CAFVDALENP SDRMKFQDLL PAMLNTIGSA LRGQDETSAQ EALALFIELA EADPRFVRNH LVELVEAMLS IAEHNDLEDG TRTLATEFLV TLTEARDRAP GMMRKVPNFV QRLYNCLVSF LVNDIEDDED WHTTENEEDE GIGQGDLYDV GQECLDRVSI ALGANSMLPA CAATMPSLIG DADWKRRHAA LIALSQIAEG CAKGMKKDVV GAIQPCLHAL ATDPHPRVRW AAINGLGQMC TDLGPRLQEQ AHANVVPLLL NAMDDVKNPR CQAHAAAATV NFSEDCPPEC MAPYLDTLMN KLLSLLQSGN KSVQEAALTA LASTADNAQE SFIKYYDTVL PYLKSILVNA NGKEYRMLRA KAVECISLVG MAVGRARFAQ DAREVMDMLM RLQSGGFEDD DPTVQYMLQA WTRLCKCLGE EFVPYLEVVM QPLLKSANLK ADVIVTNKDD EDGGEEEEEE NDDYEQVEVG DKRVSIRTAA LEEKATACNM LCCYVDELKD GILPYLEQIL QTMIPSLEFY FHEDVRRAAV ASLPDLLRAG KLAVSKGAKD KAWFQQLVNH IIPPLIQAMA KEPDIEIQAR MLESLAESAG EAGDLVRDHL SAMLETFKVL LTESLERRAE RNKRAGTDDF DEEEMHALEE EQEAEDEVFD QFAECVGSLL KSFHSAILPS LEPLLAFIVP LLDKNRSPAE RRIAICVFDD IFEHASDGGG ALKYLDGFVS PCIAGCTDND ADVRQASVYG VGVMSEHCGQ SFNAHVPSAL SALASVIQAP GARDDENIYA FENAVAALGK MCEFQNAALD ASVILPSWLA SLPLTEDRVE ARNVHAQLMR LLESNGQALM GASYEHLPRV VSVLADVLPT SGLSAKLRLV EPEVAAKMKA FLVQMQSSLP QDKLSAAWGI LSAEKQAALQ AALQG
|
| |