Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25624 |
Symbol | |
ID | 5005767 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | + |
Start bp | 414920 |
End bp | 417989 |
Gene Length | 3070 bp |
Protein Length | 743 aa |
Translation table | |
GC content | 53% |
IMG OID | 640421188 |
Product | predicted protein |
Protein accession | XP_001421657 |
Protein GI | 145354786 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5656] Importin, protein involved in nuclear import |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.925958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.511141 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCACGC GGCAGAGCGC GAGCATTTAC TTCAAGCATC TCGTCAATAA GTCGTGGACG CAGCGCGAAG GCGCGACGGC GACGACGGAG ACGAATCCGA TCTTGGACGA GGGCGACAAA GCGGCGGTTC GACGCGTCGC GCTGGAGGCG ATCGCGAACA CGCCGAGCAA GGTGCGAAGT CAGCTCGTGG AGGCGGTGCG AGTGATCGTT CATCATGATT TCCCCGGGCG TTGGCCGGAG GTGGCGAATC AGGTGCTGGA TGGGTTGAAC GCCGCGTCGT CGAGCGAGAG CGGAAAGCTG TGCGGGACGG TGTTGGTGTT GCACGCGCTG TGTCGAAAGT ATGAGTTCAA GGCGGTGGAT GAGCGAGCAG ACATCGAAGA GATGATACGC GTGGTGTTTC CAAAGTTGCT GGAGATTTTG AAGGCGTTGC TTGCGTATCA AGGCCCGCCG GACACGGAAT TGGAGGAGTT GAAGAAGGCG ATTTGCAAGA CGTACTTGAG CGCAACGTAT CTTAACGTCG GACCGTCTTT GCGCGAGGAA GGGACGTTTC GGGAATGGAT GGCGGCGTTT CACGCGATCA TCACCGCGCC AGTGCCGACA GAGAATATGC CGACGGACGA TAAGACTGAG TTGAAACATT GGCCGTGGTG GAAGACGAAG AAGTGGGCGA TGCACGTCGT CAATCGGATG TTCAATAGAT ACGGCAACTT GAAAAAGTGC CAACCTCACG ACAAGGCTCA AGCGACGGTG TATCGTGACA AATACGCTGG ACACTTTGTG ACAGTATACA TCCAATTACT GAGCTCCCTC GCCACGGGTG CGGTGATGCC CGACCGAGTC GTCAATCTCG CCGTGCACCA TCTCTCCACG GCGTTGGGGG TTCCGACGAT GTACAAGCAC ATGGAGCCGC ACCTCGATGC AATCTTCCAG CAAATCGTAT TTCCAATGCT ATGTTTCAGC GCTGAGGACG ACGAGCTGTG GAAGGATGAT CCCCAAGAGT ACGTTCGAAA ATCACAAGAT CTCATAGAAG ACATGTATTC GCCACGAACC GCGGCGTGCA GTTACACACA AGAATTGGTG ATTACCGGCA GGCGTCTGAA GGAAAACTTG CCAAAGGTGT TAGGCGCGAT GGTTCAAATA TTCACCAAAA ACTCTTCCAG CGTCAGATCC GGGCCGATGG ACGCTAGAGC GCGATACGAA CTCGATGGCG CGCTGCTCGT CATCACCACC CTGTCGCAAC TTTTATCCAC GCACCCGGAT TACGCAAAGG AAATCGAAGG TATGCTCATG ACGCATGTTG TTCCGGCATT TGGTTGTGTA CATGGTCATA TTCGCGCCAA GGCGGTATCG TGCGTATCAA AGTATTCGGA TATCACGTTC CGAGACCAGA ATAACTTTAT GCAGCTGTTT TCGAGCGTCG TAAATGCGAT GAAGGATCCC GAAATTCCGG TGAGGTTCGA GGCAGTCGTT GGGCTCGGAG CTTTTGTGCA AGCCACAGAC GACGTGAGCG CGCTGAAGGG TATTCTACCA CAGTTGTTAG ACGAGTTTTT CAAGCTCATG AACGAAGTGG AGTCGGAAGA TGTTGTGTAC ACGCTTGAAA CAATCACCGA AAAGTTTGGC GAAGACATCG CGCCCTTCGC TTTGGGCATG ACGCAGAACC TCGCCGCAGC GTTCTGGAAG GTTGTGCAAG AAGCTGAAGG AAAGGATGAC GATGAGTACG GCATGATGGC ATGCATGGGA TGTCTGCGCG CCATGTCGAC GATTCTCGAA TCCGTTTCTA GTTTACCGCA CATGTACCCC GAACTTGAAG CCGCCGTGTT CCCAATTTTG CATAAAATGA TTAGCGAAGA AGGATACGAC GTGTTTGAGG AAGTTTTGGA GATATTGTCC TACCTAACGT ATTTCACTCC GGTCGTGACA CCACGCATGT GGGAACTCTG GCCGCTGATG ATGCGCATGA TGGATGACTG GGCGCTGCAA TATTTCGAGA ATATGCTCAT TCCGCTCGAC AACTACATCA GTCGAGGCAC GGAGCACTTT CTCACCCCTG GCTCGAGTTA CGTGGAAGAT ACGTATAAGC TGTGCGAAAA GGTATTAGGA GGCGATTATC CCGAGCCCGA TTGCTTGCCA GCGCCAAAGT TGATGGAGTG CGTGATGACG AATTGCCGCG GCCGCGTCGA CGTCGTCATC GAACCGTACG TGAACATTGC ACTTGCGTGT CTCGCGACTG CGGAGTTGCC ACACTTTAGG GATTGGCTCA TGATGACTTA CGCGCATGCA CTTCATTATA ACGCATCGCT TGCGCTGGCA GCGACGAACC GAACTGGCAA AACGAACGAA GTGTTCGCGT TGTGGTCGAA CATGCTCGCC GAGCGCAGGA AGAGTGGAGA ACGTAAAAAC TTCACCTCTG AACACGCCAA GAAAGTTTGC GCGTTGGGAC TCATGGCGCT TTTGCAAGCG CCGGCAGAGT CGTTAACACC TGAGATTCGA GGTGCGCTCG GAGGTATTCT TGACACGCTC ATCTCTTTAC TCGAAGATTT GCGATTGCAA ATCACGGAGC GCAAGTCGGA TGAAGCGAGC GGCAAGAGTA GGCATCAATG GAATGGCCTT GGCCTCTTCG ACGGCGAAGA CGAATACGAG GAACACAACT TTGACGAAGA GGACGATGAT GGAGAAATGC ACTTTGACGC GACGACGCTC CGCGCGTTGG CGAAGCAAGC GCAAGACGCT GATCCGTACT CTCGCGCCGG CGACGTCGAC TCAGAAGACG ACGAGCATTT CTTCTTCGAC GACGATGACG ATTCGTGCCA AAGCCCGCTA GACGACATTG ACACGTTCAT CGTATTTTCG GAGTGCATGA ATCAGTTGCA TCGCACCGGT AGTCTCAATC CGAGCGCTGA ATCTCATTCA AAATTGCAGG AGTTGATCAA TCACGCCGCG ATTCGTGCCG AAGAGTTCCC TCGAGAGCGC GCGGAGGCGA AGAAGGAATC ACACGGCGTG TCTGGTAGTT GAACGAGTAC CAGCATACCG GCGGTACGTT AGACTGATAA TTATGTGAAT ATACTTAGTA
|
Protein sequence | MGTRQSASIY FKHLVNKSWT QREGATATTE TNPILDEGDK AAVRRVALEA IANTPSKVRS QLVEAVRVIV HHDFPGRWPE VANQVLDGLN AASSSESGKL CGTVLVLHAL CRKYEFKAVD ERADIEEMIR VVFPKLLEIL KALLAYQGPP DTELEELKKA ICKTYLSATY LNVGPSLREE GTFREWMAAF HAIITAPVPT ENMPTDDKTE LKHWPWWKTK KWAMHVVNRM FNRYGNLKKC QPHDKAQATV YRDKYAGHFV TVYIQLLSSL ATGAVMPDRV VNLAVHHLST ALGVPTMYKH MEPHLDAIFQ QIVFPMLCFS AEDDELWKDD PQEYVRKSQD LIEDMYSPRT AACSYTQELV ITGRRLKENL PKVLGAMVQI FTKNSSSVRS GPMDARARYE LDGALLVITT LSQLLSTHPD YAKEIEGMLM THVVPAFGCV HGHIRAKAVS CVSKYSDITF RDQNNFMQLF SSVVNAMKDP EIPVRFEAVV GLGAFVQATD DVSALKGILP QLLDEFFKLM NEVESEDVVY TLETITEKFG EDIAPFALGM TQNLAAAFWK VVQEAEGKDD DEYGMMACMG CLRAMSTILE SVSSLPHMYP ELEAAVFPIL HKMISEEGYD VFEEVLEILS YLTYFTPVVT PRMWELWPLM MRMMDDWALQ YFENMLIPLD NYISRGTEHF LTPGSSYVED TYKLCEKVLG GDYPEPDCLP APKLMECVMT NCRGRVDVVI EPYVNIALAW IGS
|
| |