Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17095 |
Symbol | |
ID | 5004162 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 363789 |
End bp | 368482 |
Gene Length | 4694 bp |
Protein Length | 1428 aa |
Translation table | |
GC content | 59% |
IMG OID | 640419583 |
Product | predicted protein |
Protein accession | XP_001419980 |
Protein GI | 145351215 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5308] Nuclear pore complex subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0366064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0215863 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGACGC GCGATCGGGA CGCAGACGCG CGAGAGGCGA TGGAGACGGC GGCGAAGACG GTGTCGAAGC TCGCGGCGGA CGCCGAGCAA GCCGACTTGG TGGAACTGCT TCGAGAGGCG CCTCGGGAGG CGTATTCGTT TCAAAACGCG GGATGGCCGA GCGATGTGGT GGGTATGAAG AAGGAGGAAC TGCCGGGGGT GGTGCTGGAG CGGTACAACA CGCGGCAGTC GGTGTGCTTT TGCGGGGTGC TGCCGGAGAT TTCGCGCGCG TGGGCGAGCG TGGACAACGC GTTGTTTTTG TGGAGGTTGG ACGTCGCGGA CGACGTGCCG GTGGAGTACA GCGGCGAAGA GCAAGCGATC GTGGCGGTGG GGTTGGTGAA ACCGAAGAGT GGGGTGTTTT TAGAGGCGAT CTCGTACGTG CTCGTCATCG CGACGACGGT GGAGCTCGTC ATGGTGGGGG TGTGCTTGGA GGACGACGGA CGCGAGCTCA CGCTGCATCC GCTGCAGTAC TCGTGCCCGA CGGACGCGAC CATCATGAAC GACATCACGT CTACGCCGGA GGGTAGGATC TTCTTAGCCG GTGCGGACGA AGCGCTTTAT GAACTCGTGT ACGCGCAGAG CGACACGTGG CACAGCAAGC GATGCAAAAA GGTGCGACAC AGTCAAAACT TGTCCTCGCT GTTGCCTTCG GTGCTGCGAT TGAAGGGCTC GGACGCTTTG AAACAAGTCG TCGTGGACGC CAAGCGCGGG ATTTTGTACA CGCGCAGCGA ACAAGGCGTG GTGGTTGTGT ACGACGTCGG CGCCGCGGCG AAGGACGCGC CGAGAAAAGT CGCCGAGGTT AAAAACGTCG CGCAACTCGC CGCGCAGGCT CGCGGTCAAG GGTCTCTGTT TGCATCGGCG ACGTCATCGG TGAAGAAAGG GGCGAAGTTA GTGCACGTCG CATTGGTGCA TCCGGAGGAG TCGTCCGTCG TGACGTTGGT GGCGATTTGC GCGGATGGAA GACGCATTTA CCTCACCGCT CTTCCGCCGT CGCGCGGGTA CTCGTACGGC GTCGCCTCGG GTACGGGTAG CTCTCGACAA GGGCCTTCGC GACTGTCCGT GGTCGAACAA CGCGACCCGC CGCCGCAAGG AGGAAATCAG CGCGGTATGA CGACGGCGCA GGCGCTCTTG AACACCACGT CGAGGGCCTT GGAAATTGAA GCCGGGTTTT ACAGCAGCGG CGTTTTGTTA CTGTCGGACG CGACGCCGAA CGACTCTGAC GCGCGATTGA TACTTTCAAA CCGCGACCTG GCGCTACCCC CACACTTGCA GTTGCCGCCG CCGACGCCCC CACCGAGCTC GGGAAGCGGC ACTCGCGGTT TGCGCGAAGT CGTCACCTTG GAACAACTTG AAGGGCGCTG CGCGTCGAGT CTCGGATCGC TCGGTGAGAT TCCTATGCCA AAGAGCGTAC AGGACGCGAT CGATCCGCCA TACCCCACGG GGACGCTTCC AGAAGCACGT GTAAAATCGA CGGGACTACT CTCTGAGCTC GTCACGCAAT ACATGTGCCC GAGAAGAACG TTTGTGTTGA TGACCAACGC CGGGTTGGTT CGATTCGAGA AGGCGCGCCC GATTGACACG TTGAGAAGTG TTCTTGAAAA GAACATTCCG GAACAAATCG AGGAGTTCTT TAAGAGTTAC GGACCGATTG AAGCTGCGGC AATGTGCGTC GCGTTATCTG TTTCGGGATC CGAATCCAAC GCCGTGATCC TCGCCGCGAA GCGGGCCTTT GACGACCCGC GTCTCACCGG CGAGCCGAGC ATCGTGGAGG ATAGCTACGC GCAAAATCAA GAGAATAATG GTGGATCTTT CAACATGGGG CGTGCGATCG TGCAACCGGT GCTGACTTTT TCGTCGGCGC AGCGTGGACT GTACTTGTTC ACGGCTCGCA TAATGTCTAG CACGTGGGAA CGAGCGATCA TTGTGCCCGT TCGCGCGCCC GTTCAAACGA ATTTGAACGG CAACTCGCGA CCGCTATCGC CGGCGATGAA GATCGCGAAC AAAGCGCTCG GCGCGGTGCA AGCGGCAGCT CGGTACATGA GTGAAGAACC GGCGCTCAGA TGCTCCTTGG ATCCGACGTT GCTCAAGAAT TTGCACGATC GCTTGATGCC GTTGGTGACT TTCTTGAAGC AGCGTAGACC GCGCATTAGC AGCGGCGCCA CGATGTCGCA GACGAAGCGG CGTCGAGTGC GTTCATCCGG CAGCGAACTC ACGGCACTCC AGGAAGAAGA GCGCAGTCTC TCGGCCCTCT CCGCCTTGGT GAGCCGCACG GCGCAGGCTT TGTCTCTCAT TAGAATCATC ATCACTGATG AACGTTTCTC ACGAGTGGCC GACATGCTTC CCTCGGCGAT TCGCAAGGAG CTGTCACAGG TACGTTTCCG CGCTCCTACG CCTCGTCTTT TCTTTTTTGC CTGCGCGATG ATGATTTCCC ATATATCAAT CTGTCAATCC CCTGATTCAC AGTGATGACA ACTCATATGG TGCCCTTCCA CTGAGCGTCC ACTTTTACTT TTCCGCGCTG TGTTCTTCTG AGACTTTCGA CTGACTTTTT GTTTTACACT GCGACAGCTC ACCTTGAAGA AGTTGGTTTC GACGACGCAC GGGGCGCGTT TGGCGGGTGC GCTCATCGAA GCGATGATGT CTCACATAAT GTCTCACGCT CGTCACAGCG CCGAGGAATT AGCTGCGGAG CTCCAAAAGG GATGCCCAGA CTTTTTCGGC GCAGATTCGA GGACATTTTA CCACGCCAGA GACTTGTTGC AGTTGGCGCG CGACGCGCGC GCGAGAAAGG AAAACGTCTT GCGCGACCAA TACGTGAACG ATGCGATCGC TTTATTCATG AAGGTGCCGA CTTCTGGAGA CTTATCGGCC GTTTGTGCTG AACTTGTCGA TCTTCGCGCG TTTCATGGCG TCACCGCTGT GCCCCTCGCG GCGGCGGCGG CGCTCGAGGC GCGCGCAGAG GAAGCTCGTT TTACGATGCA TTCGCAACCG AATGTGGACA TGGTGGTAAG TTTATCTTGA CCTCAAATCT CGAAAGAACT TCGTTGCGAT GAAGACGAAT CTGATACGTG AACGTTCGGC TTACCGCCGG GCGTGCCGTG ATGATTAAAA ACTACAGAAC ATCTAAGCCT GAGCGACGGC GAATCATTCT CGGCTCGAGT CACGACGGGA TCGCGAAAAT CTCTTACTGA CGACAAATTT TTACCGCATT ATAGGATCTC CAATCGTGCT TTGAAGTCAC GTGTACCACG ATTCGGGCGC TTGCGACTGG AAGGGCTGAT GCGGACGCTG AACCAGGTTC GCTGAGTCGC GTCGCGGCGG AAGAACTTCC CGAGGACATT CGTGAACGAG GTCTCGTGAA GATTCTAGAG CAGCTTCAAC GAGTGTCTGG AGCTGACTCA CAAGATTTCA TGCACCGCGT ATTTGCGGAG CTGATTGCAG TGAGACGCGA CGCCATGCTT TTATCGCTAC CCGCGGCTAT GTTGGAACCG TACTTGGTGA ACAAGAGCGC GTTGACGTCT GCGCAACAAG GCGGCGCGCT CACGCCGGAT GAGGCGAGAC ATCTCGATCT CTTGGCGCAG TTGTACGCCG CCCGAAGTCT GTTTGGATTG GCGGCACAGG TCGATTGCTC GTTAGCGGAG CGTCGATGCG CGAACGACGA AACGTTCAGT TTGGACCAGC GCATGGCTTT GTTTGAGCGC GCGCTCATGC ACGCCAGAAA GTCGGTCGAC GGGGGATTGA CCAATGGTTT GGACACGTCG TTTTGCGAAA ACGTCGACAG CAAAATCAAA TTGCTCGACA TGCAGCGGCG AGTGCTGGGC GTGTGCATAG AGCGCTCCCG TCAAGCACGC GCCACCGGAT CGTCGAACGC TCCGGAGGAG GCGTTTGTGT ACGAATTGGA GCGCGAGTTG AAGCAACTGA GCGACTTGTA CAACGACTTC GCCAAGCCGT GTGAGTTGTG GGACATCTGC CTCGAAATGG TGCACTTCTC GCAATACCAC GATCCGGATG GCGAGATTGT GTGTGACTTA TGGGATAAGC TCCTCTTACA AGCCGCGTCT CGCGCGCCGA GCGCCGCGAC GTGCCTGCGG GAAGCCTGCC TCGTGGTTCG CGCGTTGGGC GTGAAGTTGT TCCCATCGGA CGTCGCGTTT CCGGTCATTC ACGTGGCTTT GCGTTTGGAG CTCATGGCGG CAGGACTGTG GGGTGTTCCC GATGTCGCAG TCGAAGCGCA CGTCGACGAT GAGTATGACA CGAGCGAAGT CGCGGACGCG CTCGTCGTCG CGTGCAAAGG ACTCGCAGAG CCCGTGCAAC GAGCGTACGA CCGCTTGCTC GCGACCCCGG CCCAACGCAT GCACGATAAG CGTCTGTCGA AAGTGGATGC GTTACAAACC CCGCGTTTGC GCCTCCGCTT GCTCAGATCG GTGTACTTTG TGCTCCAGCT TTGGGACCAG TCGCTCGTGC CCAAGGCGAC GGCGTACGGC GAACCGAGAG CGTACGCCAC CGGAGGCCAC GTGCGCGCGG CGATCGGCGA CTTGTGCGTG AGCTACGCCT CGGAAAGCCG ACGAATGCAG TGTCCGAATG AGGTTATTCG CGCGAACGCC GAGGAACTCG CCGCCGCTTT CGACACGTTC GGCCGCCGCT TGCTTTCTTC GTAG
|
Protein sequence | MRTRDRDADA REAMETAAKT VSKLAADAEQ ADLVELLREA PREAYSFQNA GWPSDVVGMK KEELPGVVLE RYNTRQSVCF CGVLPEISRA WASVDNALFL WRLDVADDVP VEYSGEEQAI VAVGLVKPKS GVFLEAISYV LVIATTVELV MVGVCLEDDG RELTLHPLQY SCPTDATIMN DITSTPEGRI FLAGADEALY ELVYAQSDTW HSKRCKKVRH SQNLSSLLPS VLRLKGSDAL KQVVVDAKRG ILYTRSEQGV VVVYDVGAAA KDAPRKVAEV KNVAQLAAQA RGQGSLFASA TSSVKKGAKL VHVALVHPEE SSVVTLVAIC ADGRRIYLTA LPPSRGYSYG VASGTGSSRQ GPSRLSVVEQ RDPPPQGGNQ RGMTTAQALL NTTSRALEIE AGFYSSGVLL LSDATPNDSD ARLILSNRDL ALPPHLQLPP PTPPPSSGSG TRGLREVVTL EQLEGRCASS LGSLGEIPMP KSVQDAIDPP YPTGTLPEAR VKSTGLLSEL VTQYMCPRRT FVLMTNAGLV RFEKARPIDT LRSVLEKNIP EQIEEFFKSY GPIEAAAMCV ALSVSGSESN AVILAAKRAF DDPRLTGEPS IVEDSYAQNQ ENNGGSFNMG RAIVQPVLTF SSAQRGLYLF TARIMSSTWE RAIIVPVRAP VQTNLNGNSR PLSPAMKIAN KALGAVQAAA RYMSEEPALR CSLDPTLLKN LHDRLMPLVT FLKQRRPRIS SGATMSQTKR RRVRSSGSEL TALQEEERSL SALSALVSRT AQALSLIRII ITDERFSRVA DMLPSAIRKE LSQLTLKKLV STTHGARLAG ALIEAMMSHI MSHARHSAEE LAAELQKGCP DFFGADSRTF YHARDLLQLA RDARARKENV LRDQYVNDAI ALFMKVPTSG DLSAVCAELV DLRAFHGVTA VPLAAAAALE ARAEEARFTM HSQPNVDMVD LQSCFEVTCT TIRALATGRA DADAEPGSLS RVAAEELPED IRERGLVKIL EQLQRVSGAD SQDFMHRVFA ELIAVRRDAM LLSLPAAMLE PYLVNKSALT SAQQGGALTP DEARHLDLLA QLYAARSLFG LAAQVDCSLA ERRCANDETF SLDQRMALFE RALMHARKSV DGGLTNGLDT SFCENVDSKI KLLDMQRRVL GVCIERSRQA RATGSSNAPE EAFVYELERE LKQLSDLYND FAKPCELWDI CLEMVHFSQY HDPDGEIVCD LWDKLLLQAA SRAPSAATCL REACLVVRAL GVKLFPSDVA FPVIHVALRL ELMAAGLWGV PDVAVEAHVD DEYDTSEVAD ALVVACKGLA EPVQRAYDRL LATPAQRMHD KRLSKVDALQ TPRLRLRLLR SVYFVLQLWD QSLVPKATAY GEPRAYATGG HVRAAIGDLC VSYASESRRM QCPNEVIRAN AEELAAAFDT FGRRLLSS
|
| |