Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33216 |
Symbol | |
ID | 5003202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 573961 |
End bp | 575900 |
Gene Length | 1940 bp |
Protein Length | 513 aa |
Translation table | |
GC content | 65% |
IMG OID | 640418623 |
Product | predicted protein |
Protein accession | XP_001419205 |
Protein GI | 145349574 |
COG category | [S] Function unknown |
COG ID | [COG1806] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0330963 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.247759 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGCGCGCG CGCCGAGCGG CGATCGAATG CGCGCGCGGG CGACGCGGAC GCGCGCGTGG ACGCCGCGCG CGCGACCGCG AGGCGATCGA CGCGGTCGAC GCGCGCCGGG GCGCGGTCCG ACGGTCGACG CGCCGAACGG GGTGCCTCTA ACCGTCGCGA CGTCGACGCG CGACGACGCG CGGCGCGACG ACGACGCGGC GACGAGGGCG AGCGACGCGC CGAGCGACGC GGAGACGATG TCTCGAAACG CGAGCGACGC GCACGCGCGG ATGGCGCGCG AGCGGGCGCG GATTCGAGCG AAATTGGATA AGACGAAGGT GTCGCGGCTG TGGCGGGCGC CGACGGGGGA GACGCTGGAT CGCGTGACGG GGCTGGAGGT GACGGTGAGC GGTGAAGCGC TGCGGCGCGT GATGCCAGGG AATGGATACG TCGAGAATAG CACGCTCGCG GCGGAGCTGC GGGCGGCGGA GGCCAAGGCG GGGGCGCTGC GGGACGGGTT GCGACGCGCC GAGGCGGATC TCGCCGAGCT GAGGCGGGCG AGCGCGAACG GAGCGACGAC GTGGCCGTTG AGGCAGCCGG CGGTGCAAGC GAGCGATCCA GACGAGGTTC GACGCGCGGG TGCGCGACGA GGGACGGCGG CGCTGCGGAA GTTGTCTCGC GCGCGGTCGT TGTCTAGTAA GAAGCGAGGA GGTTCGTCGA CGGAGACGAC GACGGAGACG ACGACGGAGA CGACGACGTC CACCTCGACC GCATCGGTGC GCGAGGTGGT GAGCAAGTTC GTGCGCGATA CGCCGATACC AGAGGGTATG ACCGCGGCGC CGAGTGATAG CAAATTGCCG AAACCCATAT TCGTCGTGAG CGATTCGACG GGCGCCACGG CGACGCAGGC GGTGCAAGCT GCGCTCGGCC AATTTGAAAA GTGCATGGAT ATTTCCTATC CGACGAATTT AGAAGTTTTC AGATTCATCA ACGACAATAA AGAGCTCGCG ACCATCGTCG CTCAGGCGAA AGAAGACGGG GCGATGATCG TGTACACGCT GGCGGAGGTT GAGATGTCAG AAAGCATGGC GCGGATGGCC GCCGCGGCGG GCGTCGACGC CGTCAACGTC TGGGGCACAC TTTTGCGTCA AATGGAGGGG CATTTGGAGA TGCCTTCGAT GAATCTACCG ATGATCAAGC GACCGATTCG TTCGAGCGGA TCGACGAACT CGATGTCGAG CGCGAGCGTG TCGGCGGCGT TGTTGTCGAG CGATTACTAC CGTATGATCG AAGCCGTGGA ATACACTCGA CAGTGCGACG ACGGCGCGTC GAGCGCGAAG TGGAAAGAGG CCGATATCCT CATTCTAGGC ATCAGTCGCA CCGGAAAAAC TCCGCTGAGC ATCTTTCTCG GACAGCGCGG ATACAAAGTC GCGAATTTAC CCTTGGTCCC CGTCAACGGC GAATTACGAA TCCCGTCCTA CATCGCCGAC GTCGATCCAA ACCGCATCTT CGGCCTCAAA ATCTCCCCCG ACGTTCTCCA CGCCATCCGC GCGCACCGCC TCAGGACGAT GGGGGTCACC GAGGCCGAAT CCCAGGCCAG GAGCGACGCC ACGGGCGTCT CCAATCGCGC GGCGTCGCGC CGCTCCTCCT ACTCGGACCT CTCCGTGGTT CGCCAAGAGC TCGACCTCGC CGAAGCCTTA TTCCGGCGAA ACCCGACGTG GAAAGTCTTG GACGTCACGC ACAAGGGAGT TGAAGAGACC GCGGCGCGCA TCATGGCCGT GATGACCGAG CGTTTCGGCC GCGCCCACGT TCTCTCGCGC TTCTCCGCGT GAGCGTCTCG ATCCTTCCCC GCGCCGTGCG CACACAGCCC GCACGCACGC ACATCGAACA AATAATCCTC ACTGTATGAC CAAATGTCCG AGTTTCTTTA TTTATCCCGT CACGTCACCG TCAACGCGAC
|
Protein sequence | MARERARIRA KLDKTKVSRL WRAPTGETLD RVTGLEVTVS GEALRRVMPG NGYVENSTLA AELRAAEAKA GALRDGLRRA EADLAELRRA SANGATTWPL RQPAVQASDP DEVRRAGARR GTAALRKLSR ARSLSSKKRG GSSTETTTET TTETTTSTST ASVREVVSKF VRDTPIPEGM TAAPSDSKLP KPIFVVSDST GATATQAVQA ALGQFEKCMD ISYPTNLEVF RFINDNKELA TIVAQAKEDG AMIVYTLAEV EMSESMARMA AAAGVDAVNV WGTLLRQMEG HLEMPSMNLP MIKRPIRSSG STNSMSSASV SAALLSSDYY RMIEAVEYTR QCDDGASSAK WKEADILILG ISRTGKTPLS IFLGQRGYKV ANLPLVPVNG ELRIPSYIAD VDPNRIFGLK ISPDVLHAIR AHRLRTMGVT EAESQARSDA TGVSNRAASR RSSYSDLSVV RQELDLAEAL FRRNPTWKVL DVTHKGVEET AARIMAVMTE RFGRAHVLSR FSA
|
| |