Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26403 |
Symbol | |
ID | 5004331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 162436 |
End bp | 164391 |
Gene Length | 1956 bp |
Protein Length | 621 aa |
Translation table | |
GC content | 55% |
IMG OID | 640419752 |
Product | predicted protein |
Protein accession | XP_001420263 |
Protein GI | 145351826 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG3000] Sterol desaturase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00293247 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00469523 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGACGA AACCGGGCGC GTTGTATGAC TTCCCGTGGG CGGAGTGGGG ATCGATGAAG TATGCGGTGT TTTTGCCGTT CGTGGCGACG GTAGCGCTCG GAAAAGACGA TGGGGATTCG TTTTGTTGGC ACTTGTTGGC GATCGCGGCG CTGCGGTACG CGAGCGCGCA ACTTTGGATA AGCTTGTCGC GAGTGCACGC GTGGACGAGG AAAACGCGGA TACAAGCCAG AGGGATCGAC TTTAAACAGG TCGATAGGGA GGATAATTGG GATGATTACA TATTGCTCCA AACGCTGGTG ATCGCGCTCG TGCATTGGAT GCCGGGATTG GGGTTTAATA ATTTCCCGGC GACGAATGAA AAGACGGCGG TGCAGCTGTT GTTGTTGCAC GCGGGGCCGA CTGAGTTCAT TTATTATTGG CTGCATCGCG CGTTGCATCA TCACAAATTG TACAGTGCGT ATCACTCGCA TCATCACGCG AGCTTTGTCA CCGAACCGAT CACCGGGAGC GTGCATCCGT TCATGGAGCA CTTGATGTAC ACCGCGAACT TCGCCATTCC GCTCATCGGG ACGTGGGCGC TCGGCGGCGG CTCCATCGCG ATGTTTTACA TGTACTTGCT CGGCTTTGAC ATGCTCAACG CCATCGGGCA CTGTAACTTT GAGTTCATCC CGCGTTGGTT CATGAGACTG CCGCTGATGA AGTACTTGAT TTACACGCCG AGCTATCATT CGTTGCACCA CTCGCGCGTG CACACAAATT TTTGCTTGTT CATGCCGCTG TACGACCACG TGTACGGCAC AGCGGATGTA ACGAGCGACG AGTTGTACGA GAAGGCGATC AATGGGCGGG CGGTACCGGT CACGGCGCCT GATGTGGTGT TTATGGCCCA CGGAACTGAG TTATTGAGCG TGTTCCACTT GCCTTTCATG TTGCGCAGCT TTTCGTCTCG CCCGTTTGTC TCGCAGTGGT GGCTGAAACC GTTTTGGCCG CTGTGCGTGC CTTTTGTGCT TGTGTTGCGA ATGTTCGGCA AATCTTTCGT AGCCGATCGA CACCGGCTCA AGACGTTGAA CTGTGAGACG TGGGTGACGC CCGCGTGGGG GTTTCAATTC TTCATCAAGA GTGAATTTAA TCACATCAAC AGGAAGATTG AGGAGGCGAT TTTGGACGCC GATCGCGCCG GAGTGAAAGT CGTCGGCTTG GGAGCGCTGA ATAAGAACGA GGCGCTAAAC GGGGGAGGCG CACTGTTCGT CAACAAGCAT GGGAAGTCTC TGAAGACGAG AGTCGTGCAC GGGAACACGC TCACGGCGGC GGCGATTTTA CAGAAAATCC CGAGCGAGTG CAAGGAAATT TTCCTCACGG GGGCGACGTC CAAACTTGGA CGAGCGATCG CTCTTTATTG CGTGGAGAGA GGCATGCGTG TGGTGATGTA CACAACGAGC GAAGAGAGGT TCGAAAAGAT CAGAAATGAA GCGGCGAAGA AGGATCAGCA TCTCCTCGTG CAATCTACGT CCCTGAGCGA CGGGGCAAAG ATAAAGGACT GGGTCATCGG CAAGCACTGC CCGGAGAAGG ATCAAAATAT GGCGCCGAGG GGCGCGATAT TCCACCAGTT TGTCGTGCCG CCAATCCCCG AAACGCGAAA GGATTGCGTG TACACCGACC TGCCCGCGTT CAAGCTTCCT AAAGAGGCAA AAGACTTCCG GTCATGCGAG ATGACGATGA AGCGTGGTCA CATTCACGCG TGCCACGCGG GCGCCCTCGT GCACTCGCTA GAAGGCTGGG ATCACCACGA AGTCGGTGCC ATCGACCACA CGCGCATAGA TACTACTTGG GAAGCGGCGC TGAAGCACGG ATTCGCCTTG GCTTAAGCTT TCACGCTCTT GCATAATTTT TAGGATGACT TCGCCGAGGC TCTCGACATT ACAGAGCCGA TTCGGGGTTG TCCAGTCGCT CGCTTC
|
Protein sequence | MATKPGALYD FPWAEWGSMK YAVFLPFVAT VALGKDDGDS FCWHLLAIAA LRYASAQLWI SLSRVHAWTR KTRIQARGID FKQVDREDNW DDYILLQTLV IALVHWMPGL GFNNFPATNE KTAVQLLLLH AGPTEFIYYW LHRALHHHKL YSAYHSHHHA SFVTEPITGS VHPFMEHLMY TANFAIPLIG TWALGGGSIA MFYMYLLGFD MLNAIGHCNF EFIPRWFMRL PLMKYLIYTP SYHSLHHSRV HTNFCLFMPL YDHVYGTADV TSDELYEKAI NGRAVPVTAP DVVFMAHGTE LLSVFHLPFM LRSFSSRPFV SQWWLKPFWP LCVPFVLVLR MFGKSFVADR HRLKTLNCET WVTPAWGFQF FIKSEFNHIN RKIEEAILDA DRAGVKVVGL GALNKNEALN GGGALFVNKH GKSLKTRVVH GNTLTAAAIL QKIPSECKEI FLTGATSKLG RAIALYCVER GMRVVMYTTS EERFEKIRNE AAKKDQHLLV QSTSLSDGAK IKDWVIGKHC PEKDQNMAPR GAIFHQFVVP PIPETRKDCV YTDLPAFKLP KEAKDFRSCE MTMKRGHIHA CHAGALVHSL EGWDHHEVGA IDHTRIDTTW EAALKHGFAL A
|
| |