Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33739 |
Symbol | |
ID | 5006388 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009372 |
Strand | + |
Start bp | 66007 |
End bp | 68574 |
Gene Length | 2568 bp |
Protein Length | 331 aa |
Translation table | |
GC content | 50% |
IMG OID | 640421809 |
Product | predicted protein |
Protein accession | XP_001422294 |
Protein GI | 145356136 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02167] bacterial surface protein 26-residue repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.39925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000589113 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTGATT GGGACACGGG CAATGTCACG AACATGGGTA GTGCATTTTC GTCTAAATAC TACTTCAATG CATCCATCGG GAACTGGAAC ACGTCGCGAG TGACGTATAT GGGTTCCATG TTTCAGTCTG CGTATTCGTT TAATCAGGAC ATCGGGAACT GGAACACGTC GCAGGTGACG AACATGTATT CCATGTTTCA GTCTGCGAAT TCGTTTAATC AGGACATCGG GAACTGGAAC ACGGCGCGAG TGACGTATAT GGGTTCCATG TTTCAGTATG CATCTTCGTT TGATCAGGAC ATCGGGGGCT GGAACACGTC GCAGGTGACG AACATGTATT CCATGTTTCA GTATGCGTCT TCGTTTGATC AGGATATCGG GAACTGGAAC ACGTCGCGAG TGACGTATAT GTATTCCATG TTTCAGGGCG CATCTGCGTT TGATCAGGAC ATCGGGGGCT GGAACACGTC GCGAGTGACC AATATGGGTT CCATGTTTCA GTATGCATCT TCGTTTGATC AGGACATCAC AAACTGGAAC ACGTCGCGAG TGACCAATAT GGGTTCCATG TTTCAGTATG CATCTTCGTT TGATCAGGAC ATCACAAACT GGAACACGGC GCGAGTAACG TACATGACCA ACATTTTCCA AAACGCGACG GCGTTTCAGG CGAAATATAC ATGTTCGACC GTTCAATCTG GACCCATCGC ATCGTGTTCG CAGTGCGTTG CAAACTGTGC CGCTTGTAGT AACACTGCGT CGGGTGTTTG TAGTGTTTGC ATGACTGGGT ACACATTAGA CGGAGGAATG TGTGCGGCAC CCGCTGCGGC GCTCACCGAC GCAACGTTTT CTACTGCGAT CGCTAGTTGC CTCGAGGAGG CAGCGTCGGA TGGCCTGTGT ACTTCTTACG GCTTTGCAAG TGGGTTCGGA GCGATGCCTG ATTGGGACAC GGGCAATGTC ACGAACATGG GTAGTGCATT TTCGTCTAAA TACTACTTCA ATGCATCCAT CGGGAACTGG AACACGTCGC GAGTGACGTA TATGGGTTCC ATGTTTCAGT CTGCGTATTC GTTTAATCAG GACATCGGGA ACTGGAACAC GTCGCAGGTG ACGAACATGT ATTCCATGTT TCAGTCTGCG AATTCGTTTA ATCAGGACAT CGGGAACTGG AACACGGCGC GAGTGACGTA TATGGGTTCC ATGTTTCAGT ATGCATCTTC GTTTGATCAG GACATCGGGG GCTGGAACAC GTCGCAGGTG ACGAACATGT ATTCCATGTT TCAGTATGCG TCTTCGTTTG ATCAGGATAT CGGGAACTGG AACACGTCGC GAGTGACGTA TATGTATTCC ATGTTTCAGG GCGCATCTGC GTTTGATCAG GACATCGGGG GCTGGAACAC GTCGCGAGTG ACCAATATGG GTTCCATGTT TCAGTATGCA TCTTCGTTTG ATCAGGACAT CACAAACTGG AACACGTCGC GAGTGACCAA TATGGGTTCC ATGTTTCAGT ATGCATCTTC GTTTGATCAG GACATCACAA ACTGGAACAC GGCGCGAGTA ACGTACATGA CCAACATTTT CCAAAACGCG ACGGCGTTTC AGGCGAAATA TACATGTTCG ACCGTTCAAT CTGGACCCAT CGCATCGTGT CGATCGGGGT CACCATTTCA GAATTCGACC GAGTTGAAGC TTGCGGTCGA CAGCTGTTTA CTCGCGGATC CAACAGGAAA TTGTGATTGC AGAAGTTCTC TGGTCGACTG CCGGGCGGCG AGCGGCGATT CAATATCGAA ATGGGATACA CGCTTCGTAA CGAATATGAG CTCCTTGTTT GCAGGATCAG ATCAATTTAA CGCAGATTTA GGTGCTTGGA ACACAAGTGC CGTTACTTCG ATGTCGCACA TGTTTCATGG AGCCGCTGCG TTCAACAAAG ACATTAGTGG TACGTATAAT GCGAATATCT TCGCGTGGGA CGTGAGCGGC GTGACGGACA TGACTCGAAT GTTCGAAGGC GCTAATAGTT TCGCTCAGGA CATCAGCTCG TGGAACGTGG TGAACGTCGC GAGCATGGAC AGTATGTTTT CGGGCGCAAA CGCGTTCAAC GCGCCCATCG GCAGTTGGAA CACAGACGGC GTGACAAACA TGTTTCAGAT GTTCCACTCA GCACATGTGT TCGATCAACC AATCGGAAGC TGGAACACGG AACGGGTGAC GAGCATGCAG TGCATGTTCA TGTATGCGCG TGAATTCAAT CAGGATATCA GCTCCTGGAA CACGGTCAAC GTGGTAAATA TGTGGGATAT GTTTAATAGT GCGGATGCGT TCAACCAGAA CATTGGGTTA TGGAACACAG AGGAGGTGAC GGATATGAGA TACATGTTCT ATAACGCGGG TGCGTTCAAC CAGCCCATAG GGTCGTGGAA CACCGCCAAT GTCACGACCA TGCGATCTAT GTTTCAGGGC GCATCTGCGT TTAATCAGGA CATCGGAAAC TGGAACACGA CTTCTGTCAC GGACATGTAC AGCATGTTTG GTAGCGCGAC GGCGTTCAAT CAGAACATCA CGGGATGG
|
Protein sequence | MPDWDTGNVT NMGSAFSSKY YFNASIGNWN TSRVTYMGSM FQSAYSFNQD IGNWNTSQVT NMYSMFQSAN SFNQDIGNWN TARVTYMGSM FQYASSFDQD IGGWNTSQVT NMYSMFQYAS SFDQDIGNWN TSRVTYMYSM FQGASAFDQD IGGWNTSRVT NMGSMFQYAS SFDQDITNWN TSRVTNMYSM FQYASSFDQD IGNWNTSRVT YMYSMFQGAS AFDQDIGGWN TSRVTNMGSM FQYASSFDQD ITNWNTSRVT NMGSIYMFYN AGAFNQPIGS WNTANVTTMR SMFQGASAFN QDIGNWNTTS VTDMYSMFGS ATAFNQNITG W
|
| |