Gene OSTLU_29487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29487 
Symbol 
ID5006612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp327762 
End bp330849 
Gene Length3088 bp 
Protein Length910 aa 
Translation table 
GC content56% 
IMG OID640422033 
Productpredicted protein 
Protein accessionXP_001422715 
Protein GI145357009 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5215] Karyopherin (importin) beta 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.505299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00488082 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGACGG CGTGGACCCC GAACGGGGAC GGGGCGGCGC GAATCATTCA GATGATCGCC 
GAATACTTGG ACCCGCGCGC GAATCAGCGA GAGATGCTGG GCAGGCTGGA GCAATGCGCG
GGGTTTCCGG ACTTTAATAA CTACCTCGCG CACGTGCTGA CGAGCGACGA GGACGCGGGA
CGGAGGGAGG ACGTGCGACA GAGCGCGGGA CTGCTGTTGA AGAATAATCT GAAGACGTCG
TGGACGACGA CGATGAGCGA AGAGTACAGG ACGTACGTGC GAGAAACGCT GCTGAGAGCG
CTGGGACACC CGTCGAGATT GATCAGGGGG ACGTGCGGGA CGTGCGTGGC GGTGATCGTG
CGGTGCGGGG GGGTTGAAAA TTGGGGCGAC CTGTGGCCGA CGCTCGTGAG AGCGGTCGAG
GCGGGGGACG AGAATTCTCG AGACGGTGCC CTGGGCGCGC TGTACAAGGC GTGCGAGGAG
GTGAACGGGA GATTGGACGT CAAAGTGCCC GGGTTGCCGG ATTCGCCCGC GGGGATGGTG
ATTCCGCGAC TGTTCGCGTT GTTCTCCTCG CCGGCGGCCA AGGTGCGCCA GCAAGCGGTC
GGCGTGGTGA ACATGATCGC GCCGTGCTGG CCGGAAAACC ACTACGCGCT GTTGGATAGT
TACTTGCAAG GATTGTTTTC GCTCGCGAAC GATCCGGACA ACGACGTTCG AAGGCTGGTG
TGCTCGGGCT TGGTCATGTT GATTCACATT TGTCCAGAAA AGTTGGCGCC GAATTTGCGG
GAAATCATCG TGTACATGCT CGAAAGGCAG GACGACGAGG ATAAAGACGT CGCCATGGAG
TCGTGCGAGT TCTGGGGAGC GTTCTGTGAG GCCGAACTGG GTGATGATTA TGTGCAAATT
TTGCGCGAGT TTACGCCGAG ACTCATCCCA GTGTTGTTGA CAAACATGGC GTACACGGAG
GACGACGAGG AAGTGATTTC GGCCGAGGAC GACGAAGTGA ACGTGGGAAG GGAAGACAGA
GATCAAGACA TCAAACCCAC GTTTCGTGAT ACCAAGGATA AAGGCTCACA AGGAGAAGGA
GAGGATGACG GCCAAGACGA CAGCGATGAC TTCGTGTGGA ACCTCCGAAA GAGTTCTGCA
AATGGTTTGG ACATCCTGTC GAACGTCTTT GGCGACGAGC TTCTGCCTCT TCTGCTACCC
GTCGTCGAGC AGAGACTACG CGAGTCGAGG TGGGAGATCC GTGAGAGCGC TATTTTGGCC
CTTGGCGCCG TCGCTGAGGG CTGTTCAGGC GGCTTGCTGC AATACCTCCC GATGCTAATC
AATTTCCTCT TACCGATGCT GGATGACGCC CGTCCGTTGG TGCGCTCAAC GACTTGCTGG
ACGCTCAGTC GATTCTCTCG ATGGACGTTG CAGTGCGCGA GGCCGTCGAA CGATCCAAAC
GCGATGCCCC AGCAGCAAGG TATGGAGCAG CTCAACACGC TGACGACAGC GCTTTGCAAG
AGGTGCTTGG ATCACAACAA ACACGTTCAA GCCGCAGCTT GCGGCGCGAT CGCGACCCTT
CTCGCCGAAG GTCAAGACAC GCTGGCGCCT TGGACCGAAA CTATCGTGCA GACGCTCACC
CAGGCGCTGG CTACATACCA GCGGAAGAAT ATGCGCAACT TGTACGACGC ACTGACCATG
CTCGCGGAGA ATATCGGTCC GTCGATTGAG GATGCGCGGT ACGCCGGTGC GATCTTACCA
GGAATGCTTC AAAAGTGGGA GAATGCGAAC AAGGTGGACC CTGAGCTGTA TCATTTGCTC
GAGTGCCTCA CGGCGATAAT CGTCGGCCTC GGGCAGGCAT CGGCCGAGTT CTCGTCTGGG
ATTTTCGCAA AATGCATTTC CGCTTTGACA TACCAGCTTC AGCAGCGCAC TGCAGTGCAA
CGCGGCGAGA TGCCAGCCGA AGAGTACGCA ATCGACATCG TCATTTGTAC CTTGGACTTG
CTTTCTGGTT TATGCGAAGG CATGGGACAA GCCATCGAGC CGCTCGTCGC GCAGTCGCCT
ATTCGAGATA TTCTCATCGC TTCGTGCATG GATGAGTCCC CAGGAGTCAG ACGTAGCGCA
TTCGCACTCG TGGGCGATCT CACACGTTCG AGTACTGCGC ACTTGACTCC GTCTTTGCAA
CAGTTGATGG AGCTCATTGT TGCGCAGTTG CAGCCAGCGA TGGTCATATC CATGAACATG
TCTGTATGCA ATAACGCAAG CTGGGCCGCC GGCGAGATCG CCATTCGAAC GTCAAGCGAC
GTATTGCGTC CATTTGTAGC GCCACTGGCG CAATGTTTGG TTCAAATTCT CGACATGCGA
ATGGTGAACA GAGCCCTTGG CGAAAATGCC GCCATAAGTC TTGGTCGACT TTCGATGACG
TGTCCTGAAG AATTACAAGG TGGTCTCGCG CATTTCATCA CGTCTTGGTG CTCTGCTCTG
AGACGACTTC GCGATGGCGT TGAAAAAGAA CACGGCTTCA TGGGGCTTTG CAAGTTGATT
CAGATGAATC CGTCGGGCGC GACGAGTGGT TTGAGCGCAT TTGTCGAGGC CGTTGCGTCG
TGGAGACAGT GCCGCAACAA TGAACTCGTC GCGACCATGG GTCAACTTGT GCGCGGCTTC
AAGGATCACG TTGGGACCGA CCAGTGGGCG ATGGTTGTAC GGGATCTCGA ACCTGGTGTG
ATGAGAAAAT TAGCTGAGCA GTACGGCGTT TAAGTGGAGG AAGAGGAAGA GGAAGAGGAA
GAGGAAGAGG AAGAGGAAGA GGAAGAGGAA GAGGAAGAGG AAGAGGAAGA GGAAGAGGAA
GAGGAAGAGG AAGAGGAAGA GGAAGAGGAA GAGGAAGAGG AAGAGGAAGA GGAAGAGGAA
GAGGAAGCGG TTGCATGAGG ACGCAGCCAG ATTGGAAAAG GTCAGGGTAA GGACTTTTAG
GAATTGTTTA GGAAGTAGGG AGATGGTTGA AGGAGAGATA TGGGATTTGG AAGAGGATTG
GAAGAGGATT GGAAGAGGAT TGGAAGAGGA TTGGAAGAGG ATTGGAAGAG GATTGGAAGA
GGAAAAGAAA ACATCGCGCA GCGAGCTC
 
Protein sequence
MATAWTPNGD GAARIIQMIA EYLDPRANQR EMLGRLEQCA GFPDFNNYLA HVLTSDEDAG 
RREDVRQSAG LLLKNNLKTS WTTTMSEEYR TYVRETLLRA LGHPSRLIRG TCGTCVAVIV
RCGGVENWGD LWPTLVRAVE AGDENSRDGA LGALYKACEE VNGRLDVKVP GLPDSPAGMV
IPRLFALFSS PAAKVRQQAV GVVNMIAPCW PENHYALLDS YLQGLFSLAN DPDNDVRRLV
CSGLVMLIHI CPEKLAPNLR EIIVYMLERQ DDEDKDVAME SCEFWGAFCE AELGDDYVQI
LREFTPRLIP VLLTNMAYTE DDEEVISAED DEVNVGREDR DQDIKPTFRD TKDKGSQGEG
EDDGQDDSDD FVWNLRKSSA NGLDILSNVF GDELLPLLLP VVEQRLRESR WEIRESAILA
LGAVAEGCSG GLLQYLPMLI NFLLPMLDDA RPLVRSTTCW TLSRFSRWTL QCARPSNDPN
AMPQQQGMEQ LNTLTTALCK RCLDHNKHVQ AAACGAIATL LAEGQDTLAP WTETIVQTLT
QALATYQRKN MRNLYDALTM LAENIGPSIE DARYAGAILP GMLQKWENAN KVDPELYHLL
ECLTAIIVGL GQASAEFSSG IFAKCISALT YQLQQRTAVQ RGEMPAEEYA IDIVICTLDL
LSGLCEGMGQ AIEPLVAQSP IRDILIASCM DESPGVRRSA FALVGDLTRS STAHLTPSLQ
QLMELIVAQL QPAMVISMNM SVCNNASWAA GEIAIRTSSD VLRPFVAPLA QCLVQILDMR
MVNRALGENA AISLGRLSMT CPEELQGGLA HFITSWCSAL RRLRDGVEKE HGFMGLCKLI
QMNPSGATSG LSAFVEAVAS WRQCRNNELV ATMGQLVRGF KDHVGTDQWA MVVRDLEPGV
MRKLAEQYGV