Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_94665 |
Symbol | |
ID | 5003727 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 148229 |
End bp | 150194 |
Gene Length | 1966 bp |
Protein Length | 570 aa |
Translation table | |
GC content | 53% |
IMG OID | 640419148 |
Product | predicted protein |
Protein accession | XP_001419503 |
Protein GI | 145350201 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGAGA GCGATGGATG CGCGACGTCG CGAGACTTTG CGAGCGTGAA GATGGAATAT TTTAAGACGC GCGCGCGCGA GGCGCTGGGG AACGGGAACT ATAAGGAGGC GTACGCGACG TACACGAAGT GTCTGGAGGA ACTCGCGCCG AGGGACGCGG GCGAACGCGC GAAACTGCTG TGTAACCGTT CGATGGCGTA CGCCAAGTCT GCAAACTTTA AAGCGGCGCT GGAGGACGCG TCTGCGGCGG TGTCGTTGAT GCCGACGTTC GCCAAGGGGT GGTGGCGCAA GGCGACCGCG CACGTGGGGC TGCGACAGTT CCCGGACGCG CTCGCCGCGT ACAGGCAATC GTTCGAATGT CAGGACGAAG GCGACGCCGG CGTGATCGAC GAGCACGTGA AAGCGATGAA TAAAACGATC ACTTCATTTA CGCGCGAGCA ACTTGCGAAC TGGATTCTTG AAACTCTGCA AGACATGGAG AATCGCGAGC TCATCGAGCA CGCGCACCTC GAAAACGTCA CGTCGTTGGA GATGGCAGAG GGAATGTTTT GTCAAATCAA AGGCATAAAT GAAGGAAGCA AGCCGCGCGG CGACTACTAT CGGTTCGTAC AACATTGGAA CGTACACGGT ATGAGCGTGC CGATGGCTTA CACCCAGCGT GCGAGCATGT ACCGCCACGC GTTGTGCTTT ATGCAAGCTC GCGCCGACGC GGCGGCGGCG TTGACCTTGT TACAAGACGA CGCATCGCTT TCTGACAAGG AAACGGAGAT GACCTTCACG TACAAAGTCG ATGACTTCTC GCCTTTCAAG GAAGATACAG TCATCACCAA AGCTTGGGCG TGGTACGAGA TGGGCAAGGC TTTTGAAGGG CGCGAAAATG GTGACACGCA AGCGGCGGCG AAGTGCTTCT CGGCGATAAC GCACTTGGAC ACCAAGTATC CGATGTTCAC GAACGCCTTC AAGAATGTGT GCAGTCGTAT GACTGATGTC GAGGCTGGGA AAATATTGAG CGACATCAAC GATCAGTACG GCGTGGCGGA GTATGGCGTG CGCACCGTTC CCGACGCTTT GGCAACTTAC GTGGTCACCG TCAGCCTTCT CTTCAAAACT GGCAAGCTCG TCGGCTTCAA CTCTAAAGTG CGAGACTCGT TTCGAGAAAA CGTCGCGAAG GGTGCGAATG TCATCAAAGA AAAGGTACTC ATAGAAAGCG TGCGCTCAAG GTTTCGTGGA GCGCCTGGTG TCGCGCTTTC GTATCGAATT CTCGCCGGCG AAGATAAATT AAACTCCGAG GTACGTGTTT GATATCCGTC GTTCGGCGGC TGGTATTCAC GATGTTAGCG CAGAAAATGC TCGAGAAAAT TCAAGCACAC GATTTGATGT TACTGGGTGG CGCAGAAACG GAGGCTGCGA TGGGTACGGC GAGCGACGTG CAAGGTGAGT TAATTAAATG GAATAAATTG ATGAGGAATT AAATAATTCA TCACGTAGGC GAGCTCAAGG AACTGAAGTC GGAGGACTTG GTAGATCGCA TCAACAACTT ACAAGGAATT ATCTTATCGA CATTAAAAAG TGGCAAGGAG ATCGTAGTCG CGAAGCGACC ACCAACCGAC ATGGAGCTTC CTTACAGAAC GTACAAATTA GTTTACAACG ACGGCACCCC GGTCGAACGA GTGAACAAAA CGGGCTATCA AATGTCACAA GTGCACTACT CGGCGCAAGG AATGCACAAA CGCGAAGTAT GGGCTGAGAT GATGGTGCGA ACGACCGCAA TTAATTATTT ACAATTAAAT TAAATTTGAC ACAGGACAAG TCTTGCCGAT GGCACCAGAG CTCATCTGAG ATAGCCGTTC AAGCGTTACA AGTGCCGAAG GACGCCAAGA AGCAAGATTT GGATGTCTGC ATCACGCCGT CGACCGTCAG AGTGTCATGT CGAAACACTG GACGCACGTA CCTAGAGGTG ATTTAA
|
Protein sequence | MEESDGCATS RDFASVKMEY FKTRAREALG NGNYKEAYAT YTKCLEELAP RDAGERAKLL CNRSMAYAKS ANFKAALEDA SAAVSLMPTF AKGWWRKATA HVGLRQFPDA LAAYRQSFEC QDEGDAGVID EHVKAMNKTI TSFTREQLAN WILETLQDME NRELIEHAHL ENVTSLEMAE GMFCQIKGIN EGSKPRGDYY RFVQHWNVHG MSVPMAYTQR ASMYRHALCF MQARADAAAA LTLLQDDASL SDKETEMTFT YKVDDFSPFK EDTVITKAWA WYEMGKAFEG RENGDTQAAA KCFSAITHLD TKYPMFTNAF KNVCSRMTDV EAGKILSDIN DQYGVAEYGV RTVPDALATY VVTVSLLFKT GKLVGFNSKV RDSFRENVAK GANVIKEKKM LEKIQAHDLM LLGGAETEAA MGTASDVQGE LKELKSEDLV DRINNLQGII LSTLKSGKEI VVAKRPPTDM ELPYRTYKLV YNDGTPVERV NKTGYQMSQV HYSAQGMHKR EVWAEMMDKS CRWHQSSSEI AVQALQVPKD AKKQDLDVCI TPSTVRVSCR NTGRTYLEVI
|
| |