Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_23972 |
Symbol | |
ID | 4999811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 1035865 |
End bp | 1037760 |
Gene Length | 1896 bp |
Protein Length | 629 aa |
Translation table | |
GC content | 61% |
IMG OID | 640415232 |
Product | predicted protein |
Protein accession | XP_001415671 |
Protein GI | 145341138 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGA TGGCGACCCG CGACGCGCGC GCGACGGCGC GACCGACGCG CGACGCGATG CTCTGGACGA TCGAAGAGCT CGCGAGCGCG GGCGTCGACG CGACGACGCT GCGAAACCTG CTGCCGGTGG GACTCGCGGG CGAGGGCGCG CGGGCGGCGC GCGCGCGGAA GCGCGTGCGA CATTTTTTTC TGCTGAGCGC GTCGATCGCG GCGATCGAAG GGGCGGAGGG AGAGGAGACG GAGAAGAGTC TGGAGGCGCA TCGAGACGCG ATGGCGCGCG CGAGAGACGC GTGCTTTCAG GAGGACGACG GCGACGAGGA GGATTTGATC GGCGAGGATA CGGAGGCGTT TCGGGCGTAC GTGAAGGCGG CGGAACGAGC GATCGAGGTG CTGGGGATGC GGGAGGCGTG CGAAGGGCTC GATGAGCGCC GTTCGGCGCG CGACGAGACG CCGCGAGACG CGGTGCGACG GGTGGTCGAG GAGTGGAAGG ACGCTTTGCT CGGCGTCCAC GGTCGGGACG TGGACGAGCA CTTGGATCGC ATCGAGGACG CGTGGACGCG CAAGGCTGCG GATCGTTATC ACAGATTGTG CGAGGTTGAG GAGTGGGACG AGCCGTCGCG CGCGTTGGTG AAAAAGGCGC TCGAGCTCTT GCGCGCCGAG CTTTGCGGGA AAGCGACGAC GTTGACGACG GTGCAGGAGC ACTTGGCGAA CTCGTATTAC GTGCCTTCGC GCGCAAAGCC AGGGTTCACG CTGGGTAAGC GTGATGATGG CGTGCGCGAG TGGGTTGACG TCGGCGCGCG TGGCGTGAAG CGCCGAGCCG AGACGAACGC GGACGCTTTC GTCTCGTCGC CATCGACCAA GCGAGTTCAT TCAGAAGAAA CATCTCCGGC CAAACCTCGT TCGTCTCCGG GACGATTAGG CGCCTTGCTG TCGAAGACTA TTTCGTGGGT GCGCAAATCG GTGGACGCGC CAGCGTTCAT CAAATCACCC TTCGGCAAAT CACCGCTCGG CAAGTCATCG CGGATCACGG CGTCGCAAAC CATCGACGAC GTAGAAGAAA GACGTACAGA CGAAGTAGAA ACTCTTGAAG ACGACACGCC CCCCAGCGAG CACGAATCAG AAGAAGAAAT AATGCCGACG CAAGTGCCTA CGGGCTCGTA TACTGACGAA GATGACGAAG ACGAAGACGA AGATGAAGAC AAAGACAAAG ACCCGGAGCC ATCTCCGACG CAAGTGCATA CGGGCTCGTA TACTGACGAA GATGACGAAG ACGACGACGA AGATGAAGAT GAAGACCCGG AGCCATCTCC GGCGCCGACG GAGCCGTCTC CGGCGCCGAC GCAGACAATA CAACCGGCTC CGCTCAAACG GCAAGGAAAA GTGTATCCGA AGGCACAACG ACTGGTGTCG GTGAGAGCAG CACACGCGCG CTCACCTTTA ACCGGCTCCG ACGACGAGTA CGACGAAATC GAAATCGAGG CGACGCCTGG CAATTACCTC GTGCCCCGCG TCAATCGACT CCAGCCAGTG AAGACCAGTG TCAAGCAATC CCCAACTCGT AAGAGAAAAT ACGAGAGACA AACAACAAGA CGCGCGCCTG GACGCCCGAA GAACTGGACG CCCGAGGAAG AGACCGCCCT GATCGAGGGC GTGGAAAAGT TTGGCAGTGG CAAGTGGAAA ACGATTTTAG CAGACGACGC GCGCGGTAAG AACGTTTTCG CCGCCAACGC CCGGACAAAC GTCGATTTGG CGAAAAAATG GTACCATCTA CGCCCATCTC ATTTGAGCAA CATGTGGCGA CAGCACGAGC AAGATCAAGA AATAGTGGCA CGCCAAGAGA AACCTAAGTT GGATTACATT ATCGACGCAA TCCTAGAGGG CAGTCATTGA CTTCGA
|
Protein sequence | MATMATRDAR ATARPTRDAM LWTIEELASA GVDATTLRNL LPVGLAGEGA RAARARKRVR HFFLLSASIA AIEGAEGEET EKSLEAHRDA MARARDACFQ EDDGDEEDLI GEDTEAFRAY VKAAERAIEV LGMREACEGL DERRSARDET PRDAVRRVVE EWKDALLGVH GRDVDEHLDR IEDAWTRKAA DRYHRLCEVE EWDEPSRALV KKALELLRAE LCGKATTLTT VQEHLANSYY VPSRAKPGFT LGKRDDGVRE WVDVGARGVK RRAETNADAF VSSPSTKRVH SEETSPAKPR SSPGRLGALL SKTISWVRKS VDAPAFIKSP FGKSPLGKSS RITASQTIDD VEERRTDEVE TLEDDTPPSE HESEEEIMPT QVPTGSYTDE DDEDEDEDED KDKDPEPSPT QVHTGSYTDE DDEDDDEDED EDPEPSPAPT EPSPAPTQTI QPAPLKRQGK VYPKAQRLVS VRAAHARSPL TGSDDEYDEI EIEATPGNYL VPRVNRLQPV KTSVKQSPTR KRKYERQTTR RAPGRPKNWT PEEETALIEG VEKFGSGKWK TILADDARGK NVFAANARTN VDLAKKWYHL RPSHLSNMWR QHEQDQEIVA RQEKPKLDYI IDAILEGSH
|
| |