Gene OSTLU_25726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25726 
Symbol 
ID5006179 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp189310 
End bp192558 
Gene Length3249 bp 
Protein Length1015 aa 
Translation table 
GC content57% 
IMG OID640421600 
Productpredicted protein 
Protein accessionXP_001422121 
Protein GI145355765 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5215] Karyopherin (importin) beta 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.779434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000347416 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGCTGA GGGAAGAGGC GGAGGCGAGG GTGACGCGAA AGACGTGCGA TTTGATTTAC 
GAAGTCGCGG CGGGGGCGAT GGAACGGGAA GAGCCGTGGG CGGAGTTGAT GCCGTTCATG
TTCGGTGCGG TTTCTGAAGG GTCGGACCGA TTGAAGGAGA GCGCGTTGAT GATCTTTGCG
ATGCTCGCGA GTTACATGAG CGATGCGTTG GTGCCGCAAA TTCCGACGCT GCACGCGACG
CTGAGCGCGT GCTTGGCGTC GGCGGATACG AATGTGCGAC TGGCGGCGTT ACGAGCGACG
TGTGCGTTCG TCGATGCGTT GGAAAATCCG AGCGATCGAA TGAAGTTCCA AGATTTGTTG
CCGGCGATGC TCAACACGAT CGGTTCGGCG TTGAGAGGTC AAGATGAGAC GTCGGCGCAA
GAGGCTCTCG CGTTGTTCAT CGAGCTCGCT GAAGCCGATC CGAGATTCGT GCGCAATCAT
CTCGTCGAGC TCGTGGAAGC GATGCTCAGC ATCGCGGAAC ATAACGACCT CGAGGATGGC
ACGCGAACGT TGGCGACTGA GTTCTTGGTG ACGCTCACCG AGGCGAGAGA TCGTGCGCCG
GGTATGATGC GCAAGGTTCC AAATTTCGTC CAGCGCCTGT ACAACTGCCT CGTCTCCTTC
TTGGTCAACG ACATCGAAGA CGATGAAGAT TGGCACACGA CGGAGAACGA AGAGGACGAG
GGTATCGGTC AAGGTGATTT GTACGACGTC GGTCAAGAGT GCCTGGACCG CGTCTCCATC
GCGCTCGGCG CCAACTCCAT GTTACCCGCG TGCGCCGCAA CGATGCCGTC GCTGATTGGC
GACGCGGATT GGAAGCGTCG TCACGCCGCT CTCATCGCAC TCTCGCAAAT TGCCGAAGGT
TGCGCCAAGG GAATGAAGAA GGACGTTGTC GGCGCCATTC AGCCGTGCTT ACACGCGCTC
GCCACCGATC CCCATCCTCG CGTTCGTTGG GCCGCCATCA ACGGGTTGGG GCAAATGTGC
ACCGATCTTG GCCCGAGATT GCAAGAACAG GCGCACGCTA ACGTCGTACC GTTGCTCCTG
AACGCCATGG ATGACGTGAA GAATCCGCGT TGCCAGGCGC ACGCGGCCGC CGCGACGGTA
AACTTCAGCG AAGACTGCCC TCCGGAGTGC ATGGCGCCGT ATTTGGACAC GCTCATGAAC
AAGCTTTTGA GCTTACTGCA GTCTGGCAAC AAGTCTGTGC AAGAAGCGGC GCTCACGGCG
CTCGCATCGA CCGCGGACAA CGCGCAAGAA TCCTTCATCA AGTACTACGA CACCGTCCTT
CCGTACTTGA AGTCCATCTT GGTGAACGCC AACGGAAAGG AGTACCGCAT GTTGCGCGCC
AAGGCTGTGG AGTGCATTTC TCTCGTCGGT ATGGCCGTTG GACGCGCCCG CTTCGCTCAA
GATGCGCGTG AAGTCATGGA CATGTTGATG CGTCTCCAAT CTGGCGGATT CGAAGACGAC
GACCCAACGG TGCAATACAT GCTTCAAGCC TGGACGCGTT TGTGCAAGTG CCTTGGCGAG
GAATTCGTCC CGTATCTTGA AGTCGTGATG CAGCCGCTGT TGAAGTCTGC CAATTTGAAG
GCGGACGTCA TCGTCACGAA CAAGGACGAT GAAGACGGGG GCGAGGAGGA GGAAGAGGAA
AACGACGATT ACGAACAGGT CGAAGTCGGC GACAAGCGAG TTTCCATTCG CACGGCGGCT
TTGGAAGAAA AAGCCACCGC GTGCAACATG TTGTGCTGCT ACGTCGACGA GCTCAAGGAC
GGTATTTTGC CCTACTTGGA GCAAATTCTG CAAACCATGA TTCCCTCGCT CGAGTTTTAT
TTCCATGAGG ACGTGCGTAG AGCCGCCGTG GCGTCGTTAC CGGATCTTCT CCGCGCCGGT
AAGCTTGCTG TGAGCAAGGG AGCAAAGGAT AAAGCTTGGT TCCAGCAACT CGTCAATCAT
ATCATTCCCC CCCTTATTCA AGCGATGGCG AAGGAACCGG ATATCGAGAT TCAGGTGCGT
TACTCAAAGT CGACGTCGAC GAGAAGAAAC AAGTGATGAG ACTTATGGAT TCCCCTTCAA
GACGCCTTGA GGCTCTGGTG ATAAATCCGA TAACAGTCCC TGATGTTTCT GATTGTTCCT
CTTTTTTTTG CATTTTTCCG TCTCGAAAAT ACAGTCGTAC TGACCTAGCG CCCTCTTTTC
TTTTTCGCCA CGCAGGCTCG CATGCTCGAA TCGCTCGCGG AGTCCGCCGG TGAAGCCGGT
GACCTCGTGC GCGATCACTT ATCGGCAATG TTAGAAACAT TCAAGGTATT GCTCACGGAG
TCGCTGGAGC GACGTGCGGA GCGGAACAAG CGCGCGGGCA CGGACGATTT CGATGAAGAG
GAAATGCACG CGCTGGAGGA GGAACAAGAA GCCGAGGACG AAGTGTTTGA TCAATTCGCC
GAATGCGTCG GGTCTTTGTT GAAAAGCTTC CACAGCGCGA TTCTTCCGTC TCTTGAGCCA
TTGCTCGCGT TCATCGTGCC GCTGCTGGAC AAGAACAGGA GCCCCGCCGA GCGACGTATT
GCGATTTGCG TGTTCGATGA CATTTTTGAA CACGCGAGTG ACGGTGGCGG CGCTCTCAAA
TATCTCGACG GTTTCGTATC GCCTTGCATC GCTGGATGCA CGGACAACGA CGCGGATGTG
CGCCAGGCGT CTGTTTACGG CGTCGGCGTG ATGTCCGAGC ACTGCGGCCA AAGCTTTAAC
GCTCACGTGC CGAGCGCGCT CAGCGCGCTC GCGAGCGTCA TTCAAGCGCC GGGCGCTCGA
GACGATGAGA ACATCTACGC CTTCGAGAAC GCGGTCGCCG CGCTCGGGAA GATGTGTGAA
TTTCAGAACG CCGCCTTGGA CGCTAGCGTC ATCTTGCCTT CGTGGTTGGC GAGCTTGCCG
CTGACGGAGG ACAGGGTTGA GGCGAGAAAC GTGCACGCGC AACTCATGCG CCTGCTCGAG
AGCAACGGGC AAGCCTTGAT GGGTGCCTCT TACGAGCACC TTCCCCGCGT CGTCAGTGTA
CTCGCGGATG TTTTACCAAC CTCCGGTTTG AGCGCGAAGC TCCGTCTCGT CGAGCCCGAA
GTCGCGGCAA AGATGAAGGC GTTCCTCGTG CAAATGCAAT CGAGCCTTCC TCAAGACAAG
CTCAGCGCCG CGTGGGGCAT CTTGAGCGCG GAGAAACAAG CCGCGCTGCA GGCGGCACTT
CAAGGTTAG
 
Protein sequence
MALREEAEAR VTRKTCDLIY EVAAGAMERE EPWAELMPFM FGAVSEGSDR LKESALMIFA 
MLASYMSDAL VPQIPTLHAT LSACLASADT NVRLAALRAT CAFVDALENP SDRMKFQDLL
PAMLNTIGSA LRGQDETSAQ EALALFIELA EADPRFVRNH LVELVEAMLS IAEHNDLEDG
TRTLATEFLV TLTEARDRAP GMMRKVPNFV QRLYNCLVSF LVNDIEDDED WHTTENEEDE
GIGQGDLYDV GQECLDRVSI ALGANSMLPA CAATMPSLIG DADWKRRHAA LIALSQIAEG
CAKGMKKDVV GAIQPCLHAL ATDPHPRVRW AAINGLGQMC TDLGPRLQEQ AHANVVPLLL
NAMDDVKNPR CQAHAAAATV NFSEDCPPEC MAPYLDTLMN KLLSLLQSGN KSVQEAALTA
LASTADNAQE SFIKYYDTVL PYLKSILVNA NGKEYRMLRA KAVECISLVG MAVGRARFAQ
DAREVMDMLM RLQSGGFEDD DPTVQYMLQA WTRLCKCLGE EFVPYLEVVM QPLLKSANLK
ADVIVTNKDD EDGGEEEEEE NDDYEQVEVG DKRVSIRTAA LEEKATACNM LCCYVDELKD
GILPYLEQIL QTMIPSLEFY FHEDVRRAAV ASLPDLLRAG KLAVSKGAKD KAWFQQLVNH
IIPPLIQAMA KEPDIEIQAR MLESLAESAG EAGDLVRDHL SAMLETFKVL LTESLERRAE
RNKRAGTDDF DEEEMHALEE EQEAEDEVFD QFAECVGSLL KSFHSAILPS LEPLLAFIVP
LLDKNRSPAE RRIAICVFDD IFEHASDGGG ALKYLDGFVS PCIAGCTDND ADVRQASVYG
VGVMSEHCGQ SFNAHVPSAL SALASVIQAP GARDDENIYA FENAVAALGK MCEFQNAALD
ASVILPSWLA SLPLTEDRVE ARNVHAQLMR LLESNGQALM GASYEHLPRV VSVLADVLPT
SGLSAKLRLV EPEVAAKMKA FLVQMQSSLP QDKLSAAWGI LSAEKQAALQ AALQG