Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39338 |
Symbol | |
ID | 5004851 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 95499 |
End bp | 97505 |
Gene Length | 2007 bp |
Protein Length | 630 aa |
Translation table | |
GC content | 52% |
IMG OID | 640420272 |
Product | predicted protein |
Protein accession | XP_001420751 |
Protein GI | 145352857 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5096] Vesicle coat complex, various subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.109278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0495163 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCAG CGCCCCCTCC GGCGGTGCAG AAGAACAAAT CCAAGGAATT TTACGATTTG GTTCGACGTA TAGGTGAGAG AACGCGACGG AGATGTCGCG AAAGCGAGTG AGATTGTCCC GCTGCGTGCG ATTGTCCCGC GACAATGCAT CGACGACTGA CGCGCGCGCG CGCGCGAAAT ATCCAAACAG GGGAGTGTAA GAGCAAGACA GATGAAGACG TCATCATGCA GCGCGAGTCG ATGTACCTTC GAGCGCTGCT ACAGCAGCCC AAGATTGATA AAATGAAGAT CAAGGAAGTC ATGCTGCGGT TGATGTATCT GGAAATGCTC GGTCACGACG CGTCGTTCGG ACACATACAC GCGGTGAAAG CGTGCGTGGA GAGCGACATC GCGATAAAAC GAGCGGGGTA CCTGGCGACG ACGTCGTTTT TAAACGAAGA TCACGATTTG ATCATTTTAA TCGTGAACAC GGTGCAGCAA GATTTAAAGA GCGATGATTA TTTGGTCGTG TGCGCGGCGT TGACGGCCAT CATGCGGTTG GTGAACGAGG ATACGGTGCC GGCGGTGTTG CCGCAAGTTA CATCATTGCT CATGCATCCC GTGGCCCACG TGCGGAAGAA GGCGGTGATG GCACTCATGC GATTTTATCA AAAGAGTCCG CAGAGTGTGA GTCATTTACA CGGCAAGTTT CGAGAGATGA TTTGCGATAA GGATCCAAGC GTCATGTCCG CGGCTGTGTG TGCTTTACAC GAACTGGTGG CGCACGATCC CGAACCACAC AAGAATTTGT CGTCGAGCTT CGTGAGCGTG CTCAAGCAAG TCATCGATCG AAGATTACCA AAGTCGTACG AATACCACAG GACGCCGGCT CCGTTTGTGC AAATCAAGTT ATTGAAGATA TTAGCAATCT TAGGCGCTCA TGACAAGACC ACGAGCAGCG AGATGTATAA TGTTTTGGAA GACACGCTCG CGCGGGCGAC AGACTCTAAG AACCAAATAG GTAACGCTCT GGTGTACGAA TCGGTGAGGA CAATCACAAG CATCTATCCA AACCCGCAAT TGTTGGCGCA GTGCGCGATG GTGATATCTC GGTTCATCAA GAGCTCAAAC AACAATTTGA AATATGCTGG CTTGAATACA CTGGCATGTA TAGTAAACGT CAATCCGCAG TACGCGGCAG AGCATCAGAT GGCGGTCGTG GACTGCTTGG AAGACTCCGA CGAGACGTTG CGCAAAAAGA CGCTCGACTT GCTCTACAAG ATGACAAAAC CAAACAACGT GGAGGTGATC GTCGAGCGTA TGTTGGCCTT TTTGAAACGG GACGGCGACA AATATAGCGA TCAGTACGTG CGAGAGGAGA CGGCTTCACG TGTCGCAGAA CTCGCGGAGA GATATGCCCC CGACGCAAAG TGGTACGTGG AAGTCATGAC GGAACTCTTT GAGACGGCGG GCGACGTGGT AAAGCCATCC ATCGGTCAGG GTTTAATGCG TCTATTAGCT GAAGGCACGG GAGATGATGC TATCGATGAT CTTTCGCGCA AATCTGCCGT TAATGCGTAC GTGAATTTGC TTCACAAGCC AAAACTTCCT CTCGTCTTGT TGAAGACGAT GGTTTGGGTC CTCGGCGAGC TCGGGGAACT GAGCGGTCGG AACGCCGAGA CGCTGATGGA CATGCTCGTT GAAGTCACGG AGAAGCAAAT TCATGGCCCC GCAGTTGAGA CTTTAGTTTT GAGCGCCATA GCGAAGATAG CACGTCGCGC CAGTGGTGGG TTGAGCCCAA ACGCGCGCGC ATTCGTCGAG CAAAACGCGA AGAGCAAATT CGTAGAGAAG CAGCAACGTG CGCTCGAAGT CGATGTGCTC GTGGGTGAGG AGACGCAGAT ACTTTCGGGT GTCATCGCAC CTTCCGCAGT AGATGTCAAC GTGGATGCAT CGCTGAGTAT GCTGAATCAA TACGTCTCAA ATGCGCTCGC AAACGGTGCA AAGCCGTACC AGGAAAAGGC GCAACGA
|
Protein sequence | MSSAPPPAVQ KNKSKEFYDL VRRIGECKSK TDEDVIMQRE SMYLRALLQQ PKIDKMKIKE VMLRLMYLEM LGHDASFGHI HAVKACVESD IAIKRAGYLA TTSFLNEDHD LIILIVNTVQ QDLKSDDYLV VCAALTAIMR LVNEDTVPAV LPQVTSLLMH PVAHVRKKAV MALMRFYQKS PQSVSHLHGK FREMICDKDP SVMSAAVCAL HELVAHDPEP HKNLSSSFVS VLKQVIDRRL PKSYEYHRTP APFVQIKLLK ILAILGAHDK TTSSEMYNVL EDTLARATDS KNQIGNALVY ESVRTITSIY PNPQLLAQCA MVISRFIKSS NNNLKYAGLN TLACIVNVNP QYAAEHQMAV VDCLEDSDET LRKKTLDLLY KMTKPNNVEV IVERMLAFLK RDGDKYSDQY VREETASRVA ELAERYAPDA KWYVEVMTEL FETAGDVVKP SIGQGLMRLL AEGTGDDAID DLSRKSAVNA YVNLLHKPKL PLVLLKTMVW VLGELGELSG RNAETLMDML VEVTEKQIHG PAVETLVLSA IAKIARRASG GLSPNARAFV EQNAKSKFVE KQQRALEVDV LVGEETQILS GVIAPSAVDV NVDASLSMLN QYVSNALANG AKPYQEKAQR
|
| |