Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25091 |
Symbol | |
ID | 5003865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 625476 |
End bp | 627216 |
Gene Length | 1741 bp |
Protein Length | 510 aa |
Translation table | |
GC content | 57% |
IMG OID | 640419286 |
Product | predicted protein |
Protein accession | XP_001419860 |
Protein GI | 145350962 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.063384 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGCG TGAGACTGAC GAATGCATCG GATGACGCTC GGCGACGCAG TGATGGCGAT GGAAAGGCGA CAAACGGCGG CGGCGTTTCG GACGTCGCGC GCTTGGCGCT GAGCGCGGAG GCGTCGATTC AACGAGGAAA CGGCGGTGGT GGGCGCGCGC GAAGGCAACT CGATGTTGAA TTTGGTGAGT TGGACGTCCA AGCACTTCAA GATGATCGAT ACGTCTTGCA AAACATGTAC GACGAGTTTG ATGATTCCGT CATCGGCGTC GAAGAGCAAG GCTCGTATCC ACGTCGAGGA GGGGCGTCGC GAGGGGACGG TCACGAGGAT TCTGCGTCTA AACGTTCGAG CACGACTGGC GCGAGCGTCG AATGGACGAA AGGTTACACG AAGACTCTGA AGAACTTCGA TGCGGATTCT CCTCGCGTGG GCACGCGTAG CGCTCGCGCG CCGCGCGGCG GCGTGGAGGC ATCCAAGGCT GTCGTAGCGA AGCGCGGCGT CGCGACGGCG AAGAACTCCA GTCTGGCATC TAACGCGAGT GCCATGAAGA ATGCCAGTGC GTGCCGAGAG TTCGTTGCGA GTAAGAAGTG CAACTGCAAG AAGAGCAAAT GTCTCAAGTT GTACTGCGAA TGCTTCGCCG CCGGCGCCTT TTGCAAGGAT TGCTCTTGTC AGCAGTGCCA AAACACGACG GAGAACGAGG CTATCGTGAC GAAGACGAGA CAGCAGATTG AGCAGCGCAA CCCGTATGCG TTCGAGAGCA AAATCATGGC TGATGCAGGC GACGATGCGC GTCACACCAA GGGTTGCCAT TGCAAAAAGA GCGCGTGCTT GAAGAAGTAT TGTGAGTGCT TCCAGGCGGG CGTCAAGTGC CAAGATTACT GCAAGTGCGA AGGTTGCAAG AACAACGACA ACGGTCCTTC CCCCGCGCTT CCACGAGGTG GAGCCGCGAA GGCGTCGAAA GTCTCCAAAT CTAAAGCGCG CTCGCGAGAG ACCGTCGCTG CAACGACGAT GGCGGCCAAG GAGTTGCAAG ATGACCTCAT AATAGAGGAT TTCAAGTTGA CGGGGGTGAT GGGGTCGCCG CTTCGAGCGT TTGAGCATAC CGACGACTTA TCGAGTGAAT ACTCGAACTT GAGCAAGTCT GTGGAGCAGT CACCGCTGCG AACTCTTCTA CTTTCCGAAG GCGTTCAACT CTCGCCCTTC TTCTCTACGA GCTCACTTCC CGCGACTGCG TTGTCTCCTC TTCGCGCCGG ACAGATGTCT CCGCTTCGTC CAACGCCATC CCACGGTTTC ATGTCTCCGC TCACGCCGGG CATGCGTCCG GGCAAGTACA GCATTCGGTC TTCGTCATCT CGCGGTAAAG CGCCCGTACC GCTCTTCAAC GAGGATAGCG GCCACGGTGG TGAGTTCAAG ACGCCGCGAG ACAGAAAGAC GAACACTGTC TTCGGCGGCA TGCACGACGC TGCGAAGGAC GGTCCGTTGC GAGCCACGTT CACTTCACCC ATTCCGCTCA GCACGCCTGA CGTGTACGAG TAAATCTTTA AGTATTCGCT GACGCGAGTA TCATTCTGTG CGCATCCAAG ATTGCTCCGC TCAATTTGCG TTTCGCGACG AGCGACGCCG CTCGACGAGC TCGGCGACGC TTCGCTTTCT GCTTATTCTG TTCGCGATCG ACGGCACGAT AGCTAGTATA GTCACACCGA CACACTCACA CATGTAACAT ACCCTTTTTT GATTACTTAA A
|
Protein sequence | MNRVRLTNAS DDARRRSDGD GKATNGGGVS DVARLALSAE ASIQRGNGGG GRARRQLDVE FGELDVQALQ DDRYVLQNMY DEFDDSVIGV EEQGSYPRRG GASRGDGHED SASKRSSTTG ASVEWTKGYT KTLKNFDADS PRVGTRSARA PRGGVEASKA VVAKRGVATA KNSSLASNAS AMKNASACRE FVASKKCNCK KSKCLKLYCE CFAAGAFCKD CSCQQCQNTT ENEAIVTKTR QQIEQRNPYA FESKIMADAG DDARHTKGCH CKKSACLKKY CECFQAGVKC QDYCKCEGCK NNDNGPSPAL PRGGAAKASK VSKSKARSRE TVAATTMAAK ELQDDLIIED FKLTGVMGSP LRAFEHTDDL SSEYSNLSKS VEQSPLRTLL LSEGVQLSPF FSTSSLPATA LSPLRAGQMS PLRPTPSHGF MSPLTPGMRP GKYSIRSSSS RGKAPVPLFN EDSGHGGEFK TPRDRKTNTV FGGMHDAAKD GPLRATFTSP IPLSTPDVYE
|
| |