Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50892 |
Symbol | |
ID | 5004291 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 568220 |
End bp | 569958 |
Gene Length | 1739 bp |
Protein Length | 502 aa |
Translation table | |
GC content | 60% |
IMG OID | 640419712 |
Product | predicted protein |
Protein accession | XP_001420552 |
Protein GI | 145352431 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGACGACGAC GACGACGACG ACGCGCGCGA TGGCGTCCGC GAGCGCGAGG GATGCGGTGA AACAGTTCGT GACGTACTTT TATCGACACA TTCGCGAGAA GAACGGTGCG CGCGCGACGC GACGCGACGC GACGCGACGC GACGCGAGAG GTTTTGCGCG CGCGCGCGAT CGAACGCGCG GCGCGGGCGC GGAGGAGGCG TCGGGACGGC GCGGTCGCCG AGGAGGCGCG CGGTGCGCGA TGCGCGCGAT GACTGACGAC GTCGCCGACG GCGAGGCGGA TCCGCGCAGT GTACGAGACG CTGACGATGT ACGAGCGCTC GTTTCCGGCG ATCGGCGAAC GCTTCTTCAA GAAGACGGCG TGGCCGACGC CCGATGCGAT CGCGACGTAC GCGGAGAACG ATCCGGTGTT TAAGATGCTG TACGCGGAGC TTTATTACCG ACACCTGCAC CAAGCGACGA CGCCGAGCGC GAACGAACGG AAGGGGTCGT GGGAAAACTA TTGCGAACTC TTCGGCGCGT TGTTGCACGG CGACGCCAAC ATGCAGTTGC CGAACGTGTG GTTGTACGAG ATGGTGGATG AGTTCGTGTA TCAGTTTCAG TCGTTCCGAC AGTTTAAAGG CAGCTCGAAG CGCACGAGCG AGGAGTTACA GGCGATCAAG GCGTTGGACG CGCAAGTTTG GGATCGCACG AGCATGTTGA ATTTTTTAAA GGCGCTGGTG GATAAGAGCG ACATCGTGCG CGTGCTCGAG CGCGAGCGCG CGGGAGAGAT TTCGTTTGCG GCGAACGAAG GGTACGCGCT TGATTCGAGT AACGTTTTGC CCACGCTCGG GTACTGCGCG CAGATTGGTA TCTGCCGCTT GCACGTTTTG ACGGGCGATT ACGAAGGAGC GCTCGCCGCG CTCGACGCCG TGGATTTGGA CAAGGATGGG TTGTTCAAGA AGATTCCGGG CGCCTACGTC GCGACTTCGT ATCACGTCGG GTTCGCCTAC TTTATGCTCG GCCGATACAC GGACGCCATT CGTCACTTTA ACGAAAGCAT CCAGTACGTC GAACGGCTGA GGTTTGGCGC CGCACGTCCG CACGCGCTGC CGCTCTTGTT GAAGAAGCAA GAACAAATGT ACGCGTTGAT CGCTATCACC ATGGCGCTCG TTCCGGGCCA GCAGTACTTA CTCGATGGGT CCGTTTCTTT GGGTTTGCAC CAAAAGTATA GCGAAAAGGT TAACCGGATG ACGAGCGGCG AAGTCGCCGT CTTCGACGAC CTCTTTTCGT ACGCGTGCCC AAAGTTTGTC GCGAGCGGCG ACGGAGACAA CTCCGAGGCG CACAAGACGC AGCTCAAGGC GTTCTTGGAC TCCGTCGCTA CGCACGCCGT AATTCCCAAG CTTCGTGGAT ACTTGCGAAT GTACACGTCG ATCAAGCTCG ACAAACTCGC GGCGCTCGTC GAGACGCCCG TGGAGAGTCT GAAGGCGAGC TTGAAGGCGT TCGTACAAGG GTACACCGTG AAGGAGTGGA CGGGTGGAGC GAGCGCGTTG GATGGTGAGG AGGTTTACTG CGGTGACATG AGTGTCATCT TAGACGGGGA TGTGGTCAGA GTGGAAGAGA CAAAGAAGGC GAAGAGCGCT CGTGAATTCT TCGCGAGAAC GAATCAAAAG TTGGCCGCCT CGCTCGAGGA CTTGGCGCAG GCCAAGCCGC TGCTCATCAA GCCGTCGGCG ATCGTGGGTT AATCAATAAA CCAGTAGAC
|
Protein sequence | MASASARDAV KQFVTYFYRH IREKNVYETL TMYERSFPAI GERFFKKTAW PTPDAIATYA ENDPVFKMLY AELYYRHLHQ ATTPSANERK GSWENYCELF GALLHGDANM QLPNVWLYEM VDEFVYQFQS FRQFKGSSKR TSEELQAIKA LDAQVWDRTS MLNFLKALVD KSDIVRVLER ERAGEISFAA NEGYALDSSN VLPTLGYCAQ IGICRLHVLT GDYEGALAAL DAVDLDKDGL FKKIPGAYVA TSYHVGFAYF MLGRYTDAIR HFNESIQYVE RLRFGAARPH ALPLLLKKQE QMYALIAITM ALVPGQQYLL DGSVSLGLHQ KYSEKVNRMT SGEVAVFDDL FSYACPKFVA SGDGDNSEAH KTQLKAFLDS VATHAVIPKL RGYLRMYTSI KLDKLAALVE TPVESLKASL KAFVQGYTVK EWTGGASALD GEEVYCGDMS VILDGDVVRV EETKKAKSAR EFFARTNQKL AASLEDLAQA KPLLIKPSAI VG
|
| |