Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_49385 |
Symbol | |
ID | 5001302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 463300 |
End bp | 466125 |
Gene Length | 2826 bp |
Protein Length | 936 aa |
Translation table | |
GC content | 55% |
IMG OID | 640416723 |
Product | predicted protein |
Protein accession | XP_001417252 |
Protein GI | 145345513 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5096] Vesicle coat complex, various subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0497486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGG TGAATCAGGT CGTCGAGCGC GGGTGCTCGA TGCTCGTGCA TTTCGATCGC GCGACGAGCG CGGTGGAGCT GAAGGAGGCG CTCGAGACGG GGAACGCGGA AGAGAAGGCG GACGCGATGA AGAAGGTGAT CTCGCTGCTG CTGAGCGGGG AGGCGATACC GCAGGTGTTC ATCACGATCG TGCGGTACGT GCTGCCGAGC GACGATCACA CGGTGCAAAA GCTTTTGTTG CTGTACATGG AGATGATTGA GAAATGCGGG GCGGATGGGA AGATTTTACC GGAGATGATT TTGTTGTGTC AAAACTTGCG AAACAATTTG CAACACCCGA ATGAGTTTTT GAGAGGGTGC ACGCTGCGGT TCTTGTGCCG AATCACGGAG CCGGACCTGT TGGAGCCGCT GATACCTTCG ATCGTGCAAA ATTTGGAGCA CAGGCACTCG TACGTGCGGA GAAACGCGGT GATGGCGATC AATAAGATTT ACGACTTGCC CGACGGCGAG CACTTGATTC CCGACGCGCC GGAAATCATC GAGTCATTTT TGATGAGCGG GGAAAACGAT TTGGGGACGC GTAGGAACGC GTTCTTGATG TTGTACACGC ACGCCCAAGA GCGCGCGGTG AACTATTTGA TGAACAATCT GGAATCCGTG TCAAACTGGG GGGACATTTT GCAAACCGTC GTGCTCGACC TGATTCGCAA GGTTTGCCGC TCGGATCCGA CGCAAAAGGG CAAGTACATC AAGGTCATCT TGATGCTGCT CGGTACGAAC AACGCATCGG TGGTGTATGA GTGTGCGAAC ACCCTCGTGG CGCTTTCGAA CGCGCCGACC GCGATCAAGG CGGCGGCCAA CTGCTACTGC CAACTCCTCG TGAACCAAAG CGACAACAAC GTCAAGCTCA TCGTGCTCGA TCGCTTGACT GACTTGAAGA AGGACAACAA AGAATTGTTG CAAGCGATGA TCATGGATAT CTTACGCGCG ATCTCCTCGC CGAACATCGA CATCAAGCGC AAGACGCTCG ATTTAGTGCT CGACTTGATC ACGCCTCGCA ACATCGACGA CGTCACGAGC ATGTTAAAGA CGGAGGTGAT CAAGTCTCAA TCTGAGAACA CCAGTGAGAC TGGAGAGTAC CGACAGTTGC TCGTCCAAGC GATTCACAAA TGTGCGTTGA AGTTCCCCGA AGTCGCGGGG TCGGTAATTT ACTTGCTCAT GGATTTCCTG AGCGACGCGA ACAGCGGGAG CTCGGCGGAC GTCGCATACT TTGTTCGTGA GATCGCCTTC ACGAACAAGT CGCTGCGTCC GGGGATTATC GAGCACCTGT TAGATTTGTT CTCCACCATT CGTAGCAGCC GAGTGTGCGC GACAGCGCTG TGGATCATCG GTGAGTTTAG CACGACGCAG GCGGAGCAAG AGGCGGCGCT CGAAGTCATT CGTATGAGCC TCGGTCCGGC ACCGCTCGTC GATGGCCCGG ACGGTGAAGA AGAGGACGAA GACACCACGG AAACGACGAC GCGCCCGGCC GTGTTGGCGG ATGGAACGTA TGCGACGCAA GCGGCGTACT CGACTTCGGC GGCGATTTCT CAAGTGCCGA ACTTGCGCGA AATGTTGCTG AAGGGTGACT CCTTCCTTTC GGCGGTGATT GCGAGCACGT TGACCAAGCT TGCGCTCAGA GTGATAGGAT CTGGTTCGGT TCCTCAAGCG CAAAAGAACG CGACACAAGC CGAGTGCATG TTATACATTG TGAGCATGTT GCGCTTGGGA ACGAGTGGCA AAGTGCCGAT CGAAATGGAT AGAGACTCTA AGGCTCGGTT GGAGTTATGC TTCCACGTCA TCGGTCATCC GGAAGAGGCG GATACCGACG TCTGGTTGAA ATCGTGCGGA GAATCTTTCG CTTTGATGAT TGAAGAAAAG CTTAGACGCG AATCTCAGGC GAGCGCGAAC TCCGACGCCG CACCAGTCGC GCAGGCTGAC GATTTGATTG ATTTCCATCA TCTCAAGTCG CGCAAGGGTA TGACGCAACT TGAGATTGAA GATGCCGTCG CGACCGATCT CGCCCGAGCC ACTGGTTTCA TGGACTCGGT CAAGAAGAAC GGACGTAGCC TCGACCGCGT GATGCAACTC ACCGGTTTGA GCGACACAGT CTACGCGGAG ACGTACGTCA CCGTGCACCA GTACGACATC ACGCTTGACG TGACGATGAT CAATAGAACG GACGAGCCGC TGCAAAACGT CATGTTGGAA CTTTCCACCA TGGGTGATTT GAAGCTTGTT GAGCGTCCTC AACCATTTTC GTTACCACCG TTCGGTTCTC ACAACCTGAG AGCGAGCATC AAGGTGAGCT CGACCGAAAC GGGAGTCATC TTCGGCAACA TCGTGTACGA GACCGCTCGC TCCGATCGTA ATGTCATCGT GTTGAACGAC GTGCACATTG ATATCATGGA TTACATCATC CCCGCGACGT GCAGCGACAC GGTGTTTAGA AGCATGTGGG CTGAATTCGA GTGGGAGAAC AAGGTTGCCG TGAGCACAAA CATCACCGAC GTTCGCAAGT ATTTGGATCA TATCGTGACT AGCACGAATA TGAAGTGCCT CACTCCGCCG AGCGCGCTCG ATGGCGAATG CGGGTTTCTG GCCGCCAACT TGTACGCCAA GTCCGTGTTC GGCGAAGACG CGCTGGTAAA CGTTTCGATC GAGAGCAACG ACGGTGAGAT CAGTGGCTTC ATCCGAATCC GCTCGAAGAC GCAAGGCATC GCGCTTTCGC TCGGCGATAA GATCACGCTG AAGCAATCGA TCGAAATCTA GACGTGTTTA ATCAAA
|
Protein sequence | MATVNQVVER GCSMLVHFDR ATSAVELKEA LETGNAEEKA DAMKKVISLL LSGEAIPQVF ITIVRYVLPS DDHTVQKLLL LYMEMIEKCG ADGKILPEMI LLCQNLRNNL QHPNEFLRGC TLRFLCRITE PDLLEPLIPS IVQNLEHRHS YVRRNAVMAI NKIYDLPDGE HLIPDAPEII ESFLMSGEND LGTRRNAFLM LYTHAQERAV NYLMNNLESV SNWGDILQTV VLDLIRKVCR SDPTQKGKYI KVILMLLGTN NASVVYECAN TLVALSNAPT AIKAAANCYC QLLVNQSDNN VKLIVLDRLT DLKKDNKELL QAMIMDILRA ISSPNIDIKR KTLDLVLDLI TPRNIDDVTS MLKTEVIKSQ SENTSETGEY RQLLVQAIHK CALKFPEVAG SVIYLLMDFL SDANSGSSAD VAYFVREIAF TNKSLRPGII EHLLDLFSTI RSSRVCATAL WIIGEFSTTQ AEQEAALEVI RMSLGPAPLV DGPDGEEEDE DTTETTTRPA VLADGTYATQ AAYSTSAAIS QVPNLREMLL KGDSFLSAVI ASTLTKLALR VIGSGSVPQA QKNATQAECM LYIVSMLRLG TSGKVPIEMD RDSKARLELC FHVIGHPEEA DTDVWLKSCG ESFALMIEEK LRRESQASAN SDAAPVAQAD DLIDFHHLKS RKGMTQLEIE DAVATDLARA TGFMDSVKKN GRSLDRVMQL TGLSDTVYAE TYVTVHQYDI TLDVTMINRT DEPLQNVMLE LSTMGDLKLV ERPQPFSLPP FGSHNLRASI KVSSTETGVI FGNIVYETAR SDRNVIVLND VHIDIMDYII PATCSDTVFR SMWAEFEWEN KVAVSTNITD VRKYLDHIVT STNMKCLTPP SALDGECGFL AANLYAKSVF GEDALVNVSI ESNDGEISGF IRIRSKTQGI ALSLGDKITL KQSIEI
|
| |