Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_377 |
Symbol | |
ID | 5005028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | + |
Start bp | 152473 |
End bp | 154683 |
Gene Length | 2211 bp |
Protein Length | 737 aa |
Translation table | |
GC content | 57% |
IMG OID | 640420449 |
Product | predicted protein |
Protein accession | XP_001420916 |
Protein GI | 145353215 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.103149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000398719 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGAGGCGC GATTGGCGGA GTTGGAGAAC ACGGCGACGG GCGCGAACGC GCAGACGCAC AGGATATCGC AGCAAGAGTA CGTGACGCGG TTGCACAGGC TGAACGAGGA CATCGCGAAC GCGTGGTTGG CGGAGGATCG GGTGAACGCG CTGAAATTGT GCGTCAAGGT GGCGAAACTG TTGGGCGACA CAAAGGTCGG GAAGTTTTAT CCAGTGCTGT TCGTGCTGGT GACGGAGGTG ATCGAGACCG TGGGTCGGTT GGTGTACGAT CGGATATTGC GGAAATCCGA GGAGTCGACG TCGGAGGGGG GGACGAAGGC GTTGCCGGAG GACTTTAAGG CGTCGCGCGT GAGAACGGTG GCGAAGAATA CGTGCAGAAA TTGGTTTTAC AAAATCGCCA CCATCCGGGA CATCGTGCCG CGGATTTACA TGGAGCTCGC GCTATTTAAG TGCTATAGAT TCATTCAAGA CGAGCCGCCG ACGGTGCAAA TACGACGACT GATGAAGATG TCTCGCGGAG TGGCGGATCC GCTCGCGGCG GCGTACGTTC GCATGTACAT CGCAAAGTGC GCCCTGGCGT ACGGCTGCGA GACCGAAGAC AGCGCGATGA CGTTGGAAAT ATTGAAAGAG TTCATGCCGT CCTACGTGAA CGTGCTCGAC GAGACCGCTG ACGAGGACCC AGCGACAGCT TACATTTTCC GGCTCGGCTT GCGCCGAACC GAGTACTCGG AACTCATGGA TCCCGCCATG GAGTGGCTAA TCGAGTGTTG CGCGACGAAC CCCAACCCGT CGCTGCTGCA TAAAGTGCTG TATATGGGCG GTGAAACTCC GCCAGTGCCG TTCCTTCGCG CCGTGTTTCG CTCCTTGTCG CCGACGGTCG TTCGCGAAAA CGCCCTCAAG CTCATCGCGC TCGTGGGCGC GACGTCCACG GATGAAGATT CACTCGAGCA TAGGGACGCC ATGGCTGATT GCTATCGAAC CCTCGTCGAT AAGTTTGATG TCATCGCGCC AAACGAAGGC GATCGGTTGG CGATATTAGG CGACGTTTGG CGCGTCGTGC AGAAATGGAC TCACGTGGAA CCGTATTTGC GCGTCGCCGA GCGGTTTTTG CTCTACGTCA TCAAGTATCT CGCGCAAGGT GAACTTGAGA CGTTGATGAA GGACGTTGCG AGACACGTGC ACGCACACTT AGCCCAAGCT AAAGAGCGCG AGCCGACGAA ATCGGCCGAA TTACCCGCCG AGGCGATGCG CTGCGTTGAG CGGGCGTTAC GCGTCGTCAC CGATCAATTT CGCGACGTCG ATTACCTCGT CTCCTTGAAG TGGTACGTGT ACCTCGGCGA AATATTAGGC GGTGAAGCCA AGGTGAAATT TAGCGCCGCT CTTCTGAAGT GTGGGTCGAG CGCTGGATCC ATCTCGGATC CCATGTGTTT GCACACCCTG CTAGAAGCCG CGAGAACGGT ACACGACGAT ATCGACGGCA TGAGCTCGGA AGAATCGCGC GCCGAAGCTG AAAGCTTGGT CGTGGACTTC ATCAACAACG TCAGCTTTGC CGGTGACTTC GAGGCGCACT TGAACTTCCT TGTCACCGCT CGAAGCGCGT TCGCAAACTT GGACATGGCG CAAGAAATTT TAGTATACCA CGCGATTTCT CTGATGACGA GTGTGTACGA GCGCGTGGGC GTCGAACACA CGAACAAGAC GAAAGCCTTC GTCAACGCCT GCACGGCGTA CTGTCAAATC ACGATACCCA GTGTGCGTGG AGTCAATGTT CGCCTACAAC TGTTCATGCT CACCGCGCAA GCGGCGATCG TGCACGGATT GATTCAGCAA GTGGATGGTC TGATTCGCAG CGCGGTGACG GATGCGCAAG AGAGCGGCGG CGAAACCGCC GTGGGCGGAT GGATCGATCT TCCCACCGAT CGCGCCGGCG CTGAGATGCT AGATTTCGTT CGGCGATGCA GTGCTCTTCT CGTGGTACAG CCAGGGAATT TAGAAAAAGG AGCATTCTTG GTCTTCCGCG GGTTGATGAA AGTCGTGGAG GATTTCGATT GGGAGCCGTC GAGCGCCGAC GAAGTTCGCG CGTACGTCTC GTTGATCCCC ATGATCACTG CCATGGCGCA AGAGACGATA CCCTTTAAAA TCGACGGCTT ACAGTCCAAC GACGTGCTAT TCGCCGGAGA GGAGGCGTAC GTGGAAGAAG CCACCGAGCT C
|
Protein sequence | VEARLAELEN TATGANAQTH RISQQEYVTR LHRLNEDIAN AWLAEDRVNA LKLCVKVAKL LGDTKVGKFY PVLFVLVTEV IETVGRLVYD RILRKSEEST SEGGTKALPE DFKASRVRTV AKNTCRNWFY KIATIRDIVP RIYMELALFK CYRFIQDEPP TVQIRRLMKM SRGVADPLAA AYVRMYIAKC ALAYGCETED SAMTLEILKE FMPSYVNVLD ETADEDPATA YIFRLGLRRT EYSELMDPAM EWLIECCATN PNPSLLHKVL YMGGETPPVP FLRAVFRSLS PTVVRENALK LIALVGATST DEDSLEHRDA MADCYRTLVD KFDVIAPNEG DRLAILGDVW RVVQKWTHVE PYLRVAERFL LYVIKYLAQG ELETLMKDVA RHVHAHLAQA KEREPTKSAE LPAEAMRCVE RALRVVTDQF RDVDYLVSLK WYVYLGEILG GEAKVKFSAA LLKCGSSAGS ISDPMCLHTL LEAARTVHDD IDGMSSEESR AEAESLVVDF INNVSFAGDF EAHLNFLVTA RSAFANLDMA QEILVYHAIS LMTSVYERVG VEHTNKTKAF VNACTAYCQI TIPSVRGVNV RLQLFMLTAQ AAIVHGLIQQ VDGLIRSAVT DAQESGGETA VGGWIDLPTD RAGAEMLDFV RRCSALLVVQ PGNLEKGAFL VFRGLMKVVE DFDWEPSSAD EVRAYVSLIP MITAMAQETI PFKIDGLQSN DVLFAGEEAY VEEATEL
|
| |