Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30641 |
Symbol | |
ID | 5000571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 511280 |
End bp | 513131 |
Gene Length | 1852 bp |
Protein Length | 614 aa |
Translation table | |
GC content | 55% |
IMG OID | 640415992 |
Product | predicted protein |
Protein accession | XP_001416969 |
Protein GI | 145344914 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0313408 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.102836 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGTTTT TTCGAAAGAA GACGTCGACG CGAGCGTCTT CGGACGAGGG GAGGGAGAAC GCGAGCGCGG AGGGGAGACG GTCGCTTCGG GACGGCGCGG ACGGGACGAC GTCGCCGAAG GGCGCGCGGA GCGGGGAGCT GAGCGAGAAC GGGTCGACTC GGGCGGTGGG GACCGAGCGG ATGTCGCAGT CGAGCGAATT GGATGATTAT GAGTCTCGCG ATACGCTGAC GGAGTTCGCG ACGTCGAGCG CGAGCCCGCG GGTACCGCCG GGGCGAAAGA GTCTTCGAAG ACGGTCGCTC GAACTGTACA GGCAAGCCGC CGCCGGCGCG ACGATGACAA ACTCGAAGAA AGCGTTGGTG GATGCGGCGA TCGATGAGAT CATCGCGGAC GAGGAAACCG GTGAATTCGT TCCGGGGACG TCGACGGCTG CGGAGAACTC GGTGTATCGC ACGATGTTTC ACACCGTTCG CCGTCTCGTG GACCTCGAAG CTTCCAAGCG CGCTCACGGC TCAGATAGCG ATCCATTAGA TACGTTACAG GCGATCGCCA ACGCGCTGAG TTTAGCGAAC GCGTTGATTT TACAGATTCA TGGAAGTTTG GCGAGCAAAC AGGCGGCGGT GGATCTGCTC GATCCAGTCG CGAAAGGCTT CGTCGAACAG TTTTTCACGG CGGACAACTC CTCGCTCATG CTCGCGCAAA ACATGGCGCA GTTGGCGATC GAGATGCGAG GACCAAGAAA TCACATCCAG AGCCAAAATC ATTCTGCCGC TTCTTTACCT TCGCGGGCGA GCAGTGAAGG CGTGCGAGAT GAGGACAGTC CGGGAGGGTC CATGAATACA ATTTTCGCCG ATGTAGGCAG GAATCCGATG TACACGAGTT GGACGTCGTT CGACATCTAC AAATTCGCCA GCGAGTGCGA GCGCGGACGT AAAGGTCCGC TTCAAACGCT TAGTATGGGT TTGTTTGACC GTTTCAATTT GTTTCATGCT TTACCTTTGG ATCGCAATTC TGTCTCAAAC TTTATTGCAG ACATTGAGAG GAAATATCGC GACAACGATT ATCACAATCG CGTTCACGCC ACCGACGTGA CTCAAGCCGC GGCTTATCTC ATCGAGACAA GTTTAGAGTC CCAGATTGAG CCTATACACA CTTTTGCAAT GCTTGTTGCC GCCATGTCTC ACGATGTGGG ACATCCTGGA GTCAACAACA CATTTCTCGT CAACTGTAAA TCCGCGGAGG CGGAGCGTTG GAACGATGTG AGCGTCAACG AGAATGGTCA CTTGTTCACA GCTTTTTCGT TGCTCAAGAA GCACGCGGTG CTAGCGAAAT TTACAGATTC CGAGCAGTCG GACTTGAAGA AGTGGTTACA AAAGATGATC ATGTACACCG ACATGGAGTT CCATGGCGAG CTGACGCAGC GAATGCTGAA GGAAATCGAG GACGAACAAG ATGAGGAAAC GAATAGTATC AAACCGATTA AACAGTGGCA AAATATTTGG GTACCGTTAG CATTCGCGCT ACATTGCGCA GATATTAGTA ATCCCGCCCG CCCGTACGAG CTAGCGTTGG CTTGGGCGCA AGCCGTCACG GCTGAGTTCT ACAAGCAGGG AGATCGCCAA CGCAAACTCG GGATGCGAGT AGAACCCTTC ATGGACAGAA GTCTGGCGGG ACCGGCTAGC ACACAATCCA ACCAGCTAGG TTTTATCAAG TTGGTTGTGA AGCCTTCACT TTGTGTTCTC GAGGCGTTCA TGCCCGCGGC TTCGCGACAT CTCTTGGATA CTTTGGAAGA GAACATCGCG GCGTACGGCA ACGACGTCGC CTCCGCGGCT CACGCAGGGA GTTAAAGGTG TA
|
Protein sequence | MGFFRKKTST RASSDEGREN ASAEGRRSLR DGADGTTSPK GARSGELSEN GSTRAVGTER MSQSSELDDY ESRDTLTEFA TSSASPRVPP GRKSLRRRSL ELYRQAAAGA TMTNSKKALV DAAIDEIIAD EETGEFVPGT STAAENSVYR TMFHTVRRLV DLEASKRAHG SDSDPLDTLQ AIANALSLAN ALILQIHGSL ASKQAAVDLL DPVAKGFVEQ FFTADNSSLM LAQNMAQLAI EMRGPRNHIQ SQNHSAASLP SRASSEGVRD EDSPGGSMNT IFADVGRNPM YTSWTSFDIY KFASECERGR KGPLQTLSMG LFDRFNLFHA LPLDRNSVSN FIADIERKYR DNDYHNRVHA TDVTQAAAYL IETSLESQIE PIHTFAMLVA AMSHDVGHPG VNNTFLVNCK SAEAERWNDV SVNENGHLFT AFSLLKKHAV LAKFTDSEQS DLKKWLQKMI MYTDMEFHGE LTQRMLKEIE DEQDEETNSI KPIKQWQNIW VPLAFALHCA DISNPARPYE LALAWAQAVT AEFYKQGDRQ RKLGMRVEPF MDRSLAGPAS TQSNQLGFIK LVVKPSLCVL EAFMPAASRH LLDTLEENIA AYGNDVASAA HAGS
|
| |