Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_46552 |
Symbol | |
ID | 5003826 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 491391 |
End bp | 493407 |
Gene Length | 2017 bp |
Protein Length | 525 aa |
Translation table | |
GC content | 60% |
IMG OID | 640419247 |
Product | predicted protein |
Protein accession | XP_001419614 |
Protein GI | 145350442 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5064] Karyopherin (importin) alpha |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.105236 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.308215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTTC GCCCGGGGTC CAAGGCGAGC GAACGGAAGA AGGGCTTCAA GAAAGCCATC GACGCCGATG AGGCGCGACG GAAGCGCGAA GGTGCGCGGG GTGACGCGCG CGAGCGATGA CGCGCGCGGA CGGGCGCGGA GACGGCGCGC GGGCGCGAAG CGAGGGAGAT TAAGTCGCGG TGGGATGATC TGGCGGCACG AAAACGGTGG ACGCGCGCGC GCGCAGCGCG CGCGCGAGGC GACGGCGACG CGGAGACGAC GAAGGACTGA CGATGCACGC GCGCGCGCGG TGGTCGGCGA ACGCAGATAA CATGATTCAG ATTCGTAAGG ATAAGCGCGA GGAGGCGATG ATGAAGAAGC GAAGGTGCGC GCGAGACGAC GACGCGCGCG AGGCGGAAAA CGCGAACGCG CGGTGATTCA TCGGACGGTG ACTGACGAAG TGGGTTTATG ATGACGCAGG GACGGTGCGA CTGGAAGCGT GGCGTCGGAT TCGACGGCGA TGACGGGTTC GCCGGGCGGG GGATCGGTGC AGAGCAAGCT CGCGCAGTTG CCGCAAATGC TTGAAGCGCT GAAGAACCCG GATCCGAACG TGCAACTCGA GGCAACGATT GCGTTTCGTA AACTTCTTTC CATCGAGCGA TCGCCGCCGA TCGATCAGGT GATTGAGACC GGCGCGACGC CGTTTTTCGT CGAGTTTTTA AAGCGTACCG ATGTTCCAAA GTTGCAGTTT GAAGCCGCGT GGGCGCTGAC GAACATCGCC TCTGGTACGA GCGAGCACAC GGCGATCGTG ATCGATCATG GGGCTGTGCC CATCTTTATC GCGCTGTTGG GTTCAGACAA CCCCGACGTG CGCGAGCAAG CGGTTTGGGC GCTCGGGAAC ATCGCGGGCG ACAGTCCGCG GTGCCGCGAC TTGGTGTTGC ACGCCAACGC GTTGCATCCG CTCCTCGCGC AACTCAACGC CGAAGCGAAG ATCCAGATGC TTCGTAACGC GACGTGGACT TTGTCGAACT TTTGCCGCGG TAAACCTCAG CCTGACTTTA GCGCGTTGCG AGCGGCGCTT CCGGCGCTCG CTCGCTTGGT GCACTCGAAC GACGAGGAGG TGCTCACTGA TGCGTGCTGG GCGCTCTCGT ACCTTAGCGA CGGTACGAAC GACAAGATCC AGGCCGTTAT CGAGGCTGGA GTGTGCCGAC GTCTCGTGGA GCTCTTGGCG AGCAACCATC CGAGCGTGTT GATTCCGGCG CTTCGAACGG TTGGTAACAT CGTGACCGGC GACGACTATC AGACTCAAAT CATCATCAAC TGCCACGCTC TGAAGGCGTT GCTCGGATTA TTGGCGGGAG ACTACAAGAA GAGCATTAAA AAGGAAGCGT GCTGGACGAT CTCGAACATC ACCGCCGGTA ACAAGGACCA AATCCAAAGC ATCATTGACG AGCAAATGGT GCCGCCGTTG GTCGAGTTGC TCGCCAACGC CGAATTCGAT ATTAAGAAGG AGGCGGCCTG GGCGATTTCT AACGCCACGA GCGGTGGTAC GCATCAACAG ATCAAGTATC TCGTCAGCCA AGGGTGCATC AAGCCGCTGT GTGATCTCAT CAACTGCAGC GACGCGCGCA TCGTCACCGT CGCGCTCGAG GGGTTGGAGA ACATTTTAAA AGTCGGCGAG GCTGATCGCG GCGACAACAT GGAAGCCCCG AACGTTTTCG CGCAATACAT CGACGAAGCC GAGGGGTTGG AGAAGATCGA ATCGTTACAG AATCACACCA ACGACGACAT TTACCAAAAG GCGATGCGTC TTTTGGAGAC GTATTTCGGT TTAGAGGACG ACGACGCGCA AAACCTCATG CCCGAGGTCC AGGGCGACCA GTTCGCCTTC GGCGCCGGCG CGCCCACGGG CGGGTTCAAC TTTTAGACTT TTTTCGCGCG ATGGATGAAA CGACTTTTTT CGCGCGATGG ATGAAACGAC GACGACGACG ACGACGACGA CGACGACGAC GATCACTAGC TCTTGAAATC AACGCAA
|
Protein sequence | MSLRPGSKAS ERKKGFKKAI DADEARRKRE DNMIQIRKDK REEAMMKKRS VASDSTAMTG SPGGGSVQSK LAQLPQMLEA LKNPDPNVQL EATIAFRKLL SIERSPPIDQ VIETGATPFF VEFLKRTDVP KLQFEAAWAL TNIASGTSEH TAIVIDHGAV PIFIALLGSD NPDVREQAVW ALGNIAGDSP RCRDLVLHAN ALHPLLAQLN AEAKIQMLRN ATWTLSNFCR GKPQPDFSAL RAALPALARL VHSNDEEVLT DACWALSYLS DGTNDKIQAV IEAGVCRRLV ELLASNHPSV LIPALRTVGN IVTGDDYQTQ IIINCHALKA LLGLLAGDYK KSIKKEACWT ISNITAGNKD QIQSIIDEQM VPPLVELLAN AEFDIKKEAA WAISNATSGG THQQIKYLVS QGCIKPLCDL INCSDARIVT VALEGLENIL KVGEADRGDN MEAPNVFAQY IDEAEGLEKI ESLQNHTNDD IYQKAMRLLE TYFGLEDDDA QNLMPEVQGD QFAFGAGAPT GGFNF
|
| |