Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27537 |
Symbol | |
ID | 5005452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 460732 |
End bp | 462815 |
Gene Length | 2084 bp |
Protein Length | 555 aa |
Translation table | |
GC content | 52% |
IMG OID | 640420873 |
Product | predicted protein |
Protein accession | XP_001421295 |
Protein GI | 145354022 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00178801 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.844643 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACACA TCGTGAATAC GATCGTACCC CAAGCACCCC CAACCGGTCC CAACGCAGAA GGGTCTATCG AATTGCTGGA TGAACAACTG GCATCAATTC TCAAGGACGT CGACGTCCGA GCGCGGGCGA ACGCGTTAGA TGCCCCAAAT TCTGCGGCAG TACATATCGT GATGAAAGTC CACGCGTTCG ATACAATCAT ATGTCGACTT GACGAACATT TGCTCGAATT CGCGGCGGCA GAACTTGTGA GTGAGCTTGC CGAGCTGGGT CGAAGACACG CTGAATTACG ACATACGCGA AGTTGTGTCG CGATAGGTTT CGGTGACGCA GAGCTTCGAG CTACTTTCGA TGCGCTACGA TCGTCTATCG AGACACTTCG CCATGCCGTT GAGAACGACG ACTTCGAAGC TGTATTCACC ACTTTGAATG GTGGGAAGGA GATCATTGAT TCTCACTTAC GATGGTGTCG GGATCAATGT GAATTGTTCA AGAGCGCCGA GGACGAGCAC CCCGATTTCC TCGAGGACGA GCCTCCACTG ACGACCAAGT TTTTCAAGGC GACAGTCAAG GTGATACAAT CCCATGAAAA GCAACTGTCT AAATTCAAAT CGGAGTGTAT TCGTAACTGG CCCCCGCTTT TCAAGAACTT TGGCCGCGTC AAGTACATAC AAGGCTTTGC TGAGCAACTT CAGAATCAAC TCCATCTCGA TCTCAATGAA AGCACGCCGG AGGAGGCCAA GGCTGTTTTC GATCAGGTGT TGGCCTTCAC AGATGGCTTC GCGGTTTGGT GCGGCGATTT AGTGGCACAG GTCGAAGGCT CAGATGTCTT CATCTTGTTT GAGCCAATCG CCCGCGCGAT GATGGAAAGC TTGAAAGCCA TCAGAACAGA CGCTGAGTCG CAAAGTGCAC TCGCGAGGCA AGCATTGAGT ATGATAGATT ATGCGGTGAC GCAACGGGCG TTGGCCGCAA CGACGCGATC GGGCTTGCAG ACAGCACTCG CAGATGGTAC GTCTGCGCTT GAAGACCAGC TCGCCGCAGA AATTGCACAC ATGGACGCTG CGAAGATTTC TCGAGAGACT ACTTTCGAGA CCGACAAGAT GCGCATCGAG GAAGCTTTGA CTACATCAGA GCAAGAGTCG GACGGTAGGC GAGCGGTTTT AGAGGATGAC ATTCGATCGC AACAGGACAC TATCGCGCGG CTACAAAACT TGAAACGGGA AGAGGCCGCG GCATCGAGCT CTATAAACCC GTTGAAATGG GAGCGTTTCA GTCGAAGCAC TGAGCTCGCC GAAGCTGTAA GTGCACTCCG TATGAGCAAA CAAAACTTGA AGAATGAAGA GCTTGCAAAT GAGAAGGCCC GCGGGGATGC CCGTAAGGCT GTTCATTCGT TGGAACGGGG CCTGAAAAAG GACCTGGTCG AATGTGACGT CCGAGCAAAA AAAATCCGAG ACGATGTGGC AAAGGAGCTG AAGAAACAAA CTCTGGATGC TCAGAGGAAG ACACCCGAAG ACGCTAAAGC CCATAAGGCT AGTCTGGCTT CCAGGCGCGC GGTGTTCCAG CAACAACGCG CCGCCGCCAA ACTCATCGCG AGCGAAAGTA GAGAAGCTAC GCGAATCCAA GCACAGCTTG CATACTACAT GCGAAATTTA AAGGCGCGCG AATCGTGATA TAGCGGTCGT CTACGAGGAA TACACGCATA GTCTAAACAT GACCATCCTG TTGGAGAAGG AGCTGAGTTT CCGCTTCGTT ACGGATGAGA TTGTCGCGAG GAGATGAGCC TTCAAATTCG GGGCGATGGC TCACAAATCA GGCAAGGGAA GACAGGTGCA CGTGCAAACT CTCAGCGGAA AGAGTTCATC GTCTCGAAGA TCTCGGTGTT TGGTGAGACA AACTCGACAC ACAGCAGGAG CACTTCACTG CCTGAGTGTA CAGAAGATGT AACTAGCCTG AAGCGTTCGA CCGTTGAGAC CTCAAGTTTT GGTGCCCTCG CTTAAGATTT CTTCAGAATA GAATCTCTGA AAGCAAGGGA TGCTCAAAGC AGCAAAAATG ACTTCCTACT ACGGGCCACA CAGGACGCCG AGCC
|
Protein sequence | MAHIVNTIVP QAPPTGPNAE GSIELLDEQL ASILKDVDVR ARANALDAPN SAAVHIVMKV HAFDTIICRL DEHLLEFAAA ELVSELAELG RRHAELRHTR SCVAIGFGDA ELRATFDALR SSIETLRHAV ENDDFEAVFT TLNGGKEIID SHLRWCRDQC ELFKSAEDEH PDFLEDEPPL TTKFFKATVK VIQSHEKQLS KFKSECIRNW PPLFKNFGRV KYIQGFAEQL QNQLHLDLNE STPEEAKAVF DQVLAFTDGF AVWCGDLVAQ VEGSDVFILF EPIARAMMES LKAIRTDAES QSALARQALS MIDYAVTQRA LAATTRSGLQ TALADGTSAL EDQLAAEIAH MDAAKISRET TFETDKMRIE EALTTSEQES DGRRAVLEDD IRSQQDTIAR LQNLKREEAA ASSSINPLKW ERFSRSTELA EAVSALRMSK QNLKNEELAN EKARGDARKA VHSLERGLKK DLVECDVRAK KIRDDVAKEL KKQTLDAQRK TPEDAKAHKA SLASRRAVFQ QQRAAAKLIA SESREATRIQ AQLAYYMRNL KARES
|
| |