Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29487 |
Symbol | |
ID | 5006612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 327762 |
End bp | 330849 |
Gene Length | 3088 bp |
Protein Length | 910 aa |
Translation table | |
GC content | 56% |
IMG OID | 640422033 |
Product | predicted protein |
Protein accession | XP_001422715 |
Protein GI | 145357009 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5215] Karyopherin (importin) beta |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.505299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00488082 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGACGG CGTGGACCCC GAACGGGGAC GGGGCGGCGC GAATCATTCA GATGATCGCC GAATACTTGG ACCCGCGCGC GAATCAGCGA GAGATGCTGG GCAGGCTGGA GCAATGCGCG GGGTTTCCGG ACTTTAATAA CTACCTCGCG CACGTGCTGA CGAGCGACGA GGACGCGGGA CGGAGGGAGG ACGTGCGACA GAGCGCGGGA CTGCTGTTGA AGAATAATCT GAAGACGTCG TGGACGACGA CGATGAGCGA AGAGTACAGG ACGTACGTGC GAGAAACGCT GCTGAGAGCG CTGGGACACC CGTCGAGATT GATCAGGGGG ACGTGCGGGA CGTGCGTGGC GGTGATCGTG CGGTGCGGGG GGGTTGAAAA TTGGGGCGAC CTGTGGCCGA CGCTCGTGAG AGCGGTCGAG GCGGGGGACG AGAATTCTCG AGACGGTGCC CTGGGCGCGC TGTACAAGGC GTGCGAGGAG GTGAACGGGA GATTGGACGT CAAAGTGCCC GGGTTGCCGG ATTCGCCCGC GGGGATGGTG ATTCCGCGAC TGTTCGCGTT GTTCTCCTCG CCGGCGGCCA AGGTGCGCCA GCAAGCGGTC GGCGTGGTGA ACATGATCGC GCCGTGCTGG CCGGAAAACC ACTACGCGCT GTTGGATAGT TACTTGCAAG GATTGTTTTC GCTCGCGAAC GATCCGGACA ACGACGTTCG AAGGCTGGTG TGCTCGGGCT TGGTCATGTT GATTCACATT TGTCCAGAAA AGTTGGCGCC GAATTTGCGG GAAATCATCG TGTACATGCT CGAAAGGCAG GACGACGAGG ATAAAGACGT CGCCATGGAG TCGTGCGAGT TCTGGGGAGC GTTCTGTGAG GCCGAACTGG GTGATGATTA TGTGCAAATT TTGCGCGAGT TTACGCCGAG ACTCATCCCA GTGTTGTTGA CAAACATGGC GTACACGGAG GACGACGAGG AAGTGATTTC GGCCGAGGAC GACGAAGTGA ACGTGGGAAG GGAAGACAGA GATCAAGACA TCAAACCCAC GTTTCGTGAT ACCAAGGATA AAGGCTCACA AGGAGAAGGA GAGGATGACG GCCAAGACGA CAGCGATGAC TTCGTGTGGA ACCTCCGAAA GAGTTCTGCA AATGGTTTGG ACATCCTGTC GAACGTCTTT GGCGACGAGC TTCTGCCTCT TCTGCTACCC GTCGTCGAGC AGAGACTACG CGAGTCGAGG TGGGAGATCC GTGAGAGCGC TATTTTGGCC CTTGGCGCCG TCGCTGAGGG CTGTTCAGGC GGCTTGCTGC AATACCTCCC GATGCTAATC AATTTCCTCT TACCGATGCT GGATGACGCC CGTCCGTTGG TGCGCTCAAC GACTTGCTGG ACGCTCAGTC GATTCTCTCG ATGGACGTTG CAGTGCGCGA GGCCGTCGAA CGATCCAAAC GCGATGCCCC AGCAGCAAGG TATGGAGCAG CTCAACACGC TGACGACAGC GCTTTGCAAG AGGTGCTTGG ATCACAACAA ACACGTTCAA GCCGCAGCTT GCGGCGCGAT CGCGACCCTT CTCGCCGAAG GTCAAGACAC GCTGGCGCCT TGGACCGAAA CTATCGTGCA GACGCTCACC CAGGCGCTGG CTACATACCA GCGGAAGAAT ATGCGCAACT TGTACGACGC ACTGACCATG CTCGCGGAGA ATATCGGTCC GTCGATTGAG GATGCGCGGT ACGCCGGTGC GATCTTACCA GGAATGCTTC AAAAGTGGGA GAATGCGAAC AAGGTGGACC CTGAGCTGTA TCATTTGCTC GAGTGCCTCA CGGCGATAAT CGTCGGCCTC GGGCAGGCAT CGGCCGAGTT CTCGTCTGGG ATTTTCGCAA AATGCATTTC CGCTTTGACA TACCAGCTTC AGCAGCGCAC TGCAGTGCAA CGCGGCGAGA TGCCAGCCGA AGAGTACGCA ATCGACATCG TCATTTGTAC CTTGGACTTG CTTTCTGGTT TATGCGAAGG CATGGGACAA GCCATCGAGC CGCTCGTCGC GCAGTCGCCT ATTCGAGATA TTCTCATCGC TTCGTGCATG GATGAGTCCC CAGGAGTCAG ACGTAGCGCA TTCGCACTCG TGGGCGATCT CACACGTTCG AGTACTGCGC ACTTGACTCC GTCTTTGCAA CAGTTGATGG AGCTCATTGT TGCGCAGTTG CAGCCAGCGA TGGTCATATC CATGAACATG TCTGTATGCA ATAACGCAAG CTGGGCCGCC GGCGAGATCG CCATTCGAAC GTCAAGCGAC GTATTGCGTC CATTTGTAGC GCCACTGGCG CAATGTTTGG TTCAAATTCT CGACATGCGA ATGGTGAACA GAGCCCTTGG CGAAAATGCC GCCATAAGTC TTGGTCGACT TTCGATGACG TGTCCTGAAG AATTACAAGG TGGTCTCGCG CATTTCATCA CGTCTTGGTG CTCTGCTCTG AGACGACTTC GCGATGGCGT TGAAAAAGAA CACGGCTTCA TGGGGCTTTG CAAGTTGATT CAGATGAATC CGTCGGGCGC GACGAGTGGT TTGAGCGCAT TTGTCGAGGC CGTTGCGTCG TGGAGACAGT GCCGCAACAA TGAACTCGTC GCGACCATGG GTCAACTTGT GCGCGGCTTC AAGGATCACG TTGGGACCGA CCAGTGGGCG ATGGTTGTAC GGGATCTCGA ACCTGGTGTG ATGAGAAAAT TAGCTGAGCA GTACGGCGTT TAAGTGGAGG AAGAGGAAGA GGAAGAGGAA GAGGAAGAGG AAGAGGAAGA GGAAGAGGAA GAGGAAGAGG AAGAGGAAGA GGAAGAGGAA GAGGAAGAGG AAGAGGAAGA GGAAGAGGAA GAGGAAGAGG AAGAGGAAGA GGAAGAGGAA GAGGAAGCGG TTGCATGAGG ACGCAGCCAG ATTGGAAAAG GTCAGGGTAA GGACTTTTAG GAATTGTTTA GGAAGTAGGG AGATGGTTGA AGGAGAGATA TGGGATTTGG AAGAGGATTG GAAGAGGATT GGAAGAGGAT TGGAAGAGGA TTGGAAGAGG ATTGGAAGAG GATTGGAAGA GGAAAAGAAA ACATCGCGCA GCGAGCTC
|
Protein sequence | MATAWTPNGD GAARIIQMIA EYLDPRANQR EMLGRLEQCA GFPDFNNYLA HVLTSDEDAG RREDVRQSAG LLLKNNLKTS WTTTMSEEYR TYVRETLLRA LGHPSRLIRG TCGTCVAVIV RCGGVENWGD LWPTLVRAVE AGDENSRDGA LGALYKACEE VNGRLDVKVP GLPDSPAGMV IPRLFALFSS PAAKVRQQAV GVVNMIAPCW PENHYALLDS YLQGLFSLAN DPDNDVRRLV CSGLVMLIHI CPEKLAPNLR EIIVYMLERQ DDEDKDVAME SCEFWGAFCE AELGDDYVQI LREFTPRLIP VLLTNMAYTE DDEEVISAED DEVNVGREDR DQDIKPTFRD TKDKGSQGEG EDDGQDDSDD FVWNLRKSSA NGLDILSNVF GDELLPLLLP VVEQRLRESR WEIRESAILA LGAVAEGCSG GLLQYLPMLI NFLLPMLDDA RPLVRSTTCW TLSRFSRWTL QCARPSNDPN AMPQQQGMEQ LNTLTTALCK RCLDHNKHVQ AAACGAIATL LAEGQDTLAP WTETIVQTLT QALATYQRKN MRNLYDALTM LAENIGPSIE DARYAGAILP GMLQKWENAN KVDPELYHLL ECLTAIIVGL GQASAEFSSG IFAKCISALT YQLQQRTAVQ RGEMPAEEYA IDIVICTLDL LSGLCEGMGQ AIEPLVAQSP IRDILIASCM DESPGVRRSA FALVGDLTRS STAHLTPSLQ QLMELIVAQL QPAMVISMNM SVCNNASWAA GEIAIRTSSD VLRPFVAPLA QCLVQILDMR MVNRALGENA AISLGRLSMT CPEELQGGLA HFITSWCSAL RRLRDGVEKE HGFMGLCKLI QMNPSGATSG LSAFVEAVAS WRQCRNNELV ATMGQLVRGF KDHVGTDQWA MVVRDLEPGV MRKLAEQYGV
|
| |