Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42302 |
Symbol | |
ID | 5006499 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009373 |
Strand | + |
Start bp | 115215 |
End bp | 117218 |
Gene Length | 2004 bp |
Protein Length | 622 aa |
Translation table | |
GC content | 58% |
IMG OID | 640421920 |
Product | predicted protein |
Protein accession | XP_001422394 |
Protein GI | 145356347 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5096] Vesicle coat complex, various subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.0110936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.00027641 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGGGAC TCGGATCGCT GAGCTCGGCG GACAAGACGA AGGCGTCGAC GATGCGGTTG TTTCAAAAAT CGCTGCGAGA CATGATCACG GGCATTCGCA GTCACAAGGA TTCGCAGAAG GAGTTTATAA ATAAATGCCT GAGCGACATA CGCGCGGAGG TGGTGAGCTC GGACATCAGG ACGAAGGCGG TGGCGATCGA AAAGGCGACG TACCTGCACT CGCTCGGGTA TTCGATGCAC TGGGCGTCGT TTCACGTGGT GGAATTGATG TCGACGACGA ACGTAAAGTA TAAGAGAATC GGATACTTGG CGGCGTGTCA AAGTTTTGGG GACGACACGG ACGTGGTGCT GTTGATTCCT AATTTGTTGA AGAAGGATTT GGCGTCGCCG AATCCCGCGG AGGCGGCGCT GGCGATTTCG TGCCTGGGGA ACATCGTGAC GCCGGAGTTG TCGCAGACGC TCGTGGCGGA CGTGTACTCG CTGTTGAACA ATCATAAGCC GGATTTGCGA CGACGAGCGT GCTTGTGCCT GTACAAGTGC TTCTTGCGAT ATCCCGAGGC GCTGCGACCG TCGTTCGCGC GATTGACAGA ATGTCTGGAC GACGACGATC AATCCGTGGT GCAGGCGGCG GTGACGGTGC TTTCCGAGCT CGCCATGCAC AACCCGAAGA CGTACCTACC GCTCGCGCCC AAGTTTTATA AACTCTTGAC GTCGAGCTCG TCGAATTGGA TGACGATCAA GCTCGTGAAG GTTTTCGGCG CGCTCACGCC GCTCGAGCCG CGACTGGCGA AGAAGCTCGC GGGGCCGATC TCGGAGATTC TCGAAACCAC CAACGCCAAG TCGTTGATGT ACGAGTGCGT GCGCACGGTG GTGATGGGGA TGACGAGTCA AGAAAAGGTC GTCCGCCAAG CGGTGGACAA GCTCAAGGAT ATGTTGGAGG ATCACGATCC AAACATCAAA TTTTTGGCGT TGCACGCGTT GACGTTTTTG CTCGATTCGC ACCCGCGCAT CGTCGCCGAG CACAAGGGGA ACATCTTCGA GTGCCTCGAC CACGAAGATT CAAACATTCA ATACTGTGCG TTGAAAATTG TGTGCGGTTT GGTGACGAAG CGCACGCTCA TCGACACCAC GGCGCACTTG ATGAACGCCA TGGGCAAGGC GGATCAACGC TTCAGGGACG AACTCGTGTT GAGCGTGATT CACATCTGCA TGAACGAGCG CTACGCCTTG GTGACGGATT TTGTGTGGTA TCTTTCCGTC CTCGCGGATC TCATTCGTGT CCCGTGCTCG TCTCACGGCG CGCTCATCGG TGAACAAATC ATCGACGTGT GCTTGCGCGT CGAGGTGATT CGCGAAGCCG CGGTGGGCAT CTTGGGCCCG CTCTTGCTCG ATGACTCATT GCTCGAGCAA TCCAACGTCA ACAAGACCGT GCCCGGAGCG CTCAAGGCCG TGGCTTGGGT CGTGGGCGAG TATGCGCACT ACGTTGTCGA TCACGAAGAA ATTCTCGACG CGTTGCTGAG CCCGCAAGTC AAGCAACTCC CGGGCGACGC GCAAGCGGTC TACTTGCAAA CCATCTTCAA GGTTTACGCC AGTGCGGTGC TGATGTACTC TCAGGGCATT CGCCCCACGG GCGCCTTGGG TGCGCTCCCG GCACCGGCGA CGGAGCCTTT GATCGAGCTC GCCGAAAACG GCGGTGAAGA CGGCGGCGCG GCGAGCGCGC CGACGCCGGG CAAGTTGGTT CCCGAATCCG AGGACCTCGT CGTGTTGCGT CAAAAAGTCT CGCGCGCGAT CGAGCCGTTC ACTACGAGTT TTAATCTCGA AGTTCGCGAG CGAAGCTGCC AACTGCAGCA AATGCTCGTC ATCGTCGGCT CGTCCGAGGA CAAGGAACCG GGCTCGGGAA TCGCCATAAT CGAAGCTTTC GCCACGGTTT TGAGCGAAGA GATCCAACCG GTGAGTGTCA AGGCGCAGCG TAAGATCGAC ATCCCCGCCG AGCTCGGCGC GGAC
|
Protein sequence | MPGLGSLSSA DKTKASTMRL FQKSLRDMIT GIRSHKDSQK EFINKCLSDI RAEVVSSDIR TKAVAIEKAT YLHSLGYSMH WASFHVVELM STTNVKYKRI GYLAACQSFG DDTDVVLLIP NLLKKDLASP NPAEAALAIS CLGNIVTPEL SQTLVADVYS LLNNHKPDLR RRACLCLYKC FLRYPEALRP SFARLTECLD DDDQSVVQAA VTVLSELAMH NPKTYLPLAP KFYKLLTSSS SNWMTIKLVK VFGALTPLEP RLAKKLAGPI SEILETTNAK SLMYECVRTV VMGMTSQEKV VRQAVDKLKD MLEDHDPNIK FLALHALTFL LDSHPRIVAE HKGNIFECLD HEDSNIQYCA LKIVCGLVTK RTLIDTTAHL MNAMGKADQR FRDELVLSVI HICMNERYAL VTDFVWYLSV LADLIRVPCS SHGALIGEQI IDVCLRVEVI REAAVGILGP LLLDDSLLEQ SNVNKTVPGA LKAVAWVVGE YAHYVVDHEE ILDALLSPQV KQLPGDAQAV YLQTIFKVYA SALVPESEDL VVLRQKVSRA IEPFTTSFNL EVRERSCQLQ QMLVIVGSSE DKEPGSGIAI IEAFATVLSE EIQPVSVKAQ RKIDIPAELG AD
|
| |