Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_5662 |
Symbol | |
ID | 5003217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 426882 |
End bp | 427796 |
Gene Length | 915 bp |
Protein Length | 305 aa |
Translation table | |
GC content | 52% |
IMG OID | 640418638 |
Product | predicted protein |
Protein accession | XP_001419159 |
Protein GI | 145349477 |
COG category | [L] Replication, recombination and repair [R] General function prediction only |
COG ID | [COG0494] NTP pyrophosphohydrolases including oxidative damage repair enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.059754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GAGATTTACC TGGAGCTCGC CGCGCGGTTC GTGTTGAACG CGCCGCCGAG CGAAATCGAG GATTTCAATC GATTGATGTT TCTCATAGAG CAGGCGCACT GGTATTACAT CGACTTTACG TGCGAACAAG ACGCGTCTTT GAGCGCGAGC ATGGGTTTAA ACGAGTTCGC CAAGGGGATG ATCAGCAGCG TGGAGACGCT GAAGTCGCGA ATCCCTGGAT TTAAGAATAA TTTCGAGAAG TTTAAGTCGT ATAAGTTTGC AATCCCGACG TGCGGCGCGG TGCTGTTGAA TCCGACTATG GATAAATGTT TGATGGTTAG AGGGTGGGGC AGTCAGAATA AGTCGTTAGG ATTTCCCAAA GGGAAGATGG ATGCTAATGA GACCGAGGCA GAGTGCGCGG CGAGAGAGGT TGAGGAGGAG ATCGGGGTGG ACATTCGACA GTTCATAGTG GAGGAGGACA AGGTGGTGTT TATGCGAAAG CGGAAACCAT CCGACAAGTT GGGTCAAAAG AACACGCTGT ACTTGATTCA GGGGATCTCC GAAGAGACAA AATTCTTGAC GCACACGCGA AAAGAAATAT CGGACATCGT GTGGAACCCT CTCTGGATCT TTGATCAGCC CGAAGAAGCG TTGAAAAAGT TCAAAAACAA GTATGGGCAA ATCTACCCGG CTTTACGGGA CATCATGGCG TGGGTGAAGC AGAACAAGAA AAAGCACCCG AAGCCGCGAA CGAATCAAGT AGCGGCGCCG CAATCGCCCG CCGCGGGACG CGCGGCGCCG ATGTCTCTAG AAGATCTCGA AAACGAGCTC ATGTCTGGAT ACGACGACGA ACCCGAGGAC GACGAGCCCG CCAAGCCGCA CAAGGCGTTC GAGGCGTTGA CGAATTTCAA GTTCAACAAA GCACGTATCC TTCAG
|
Protein sequence | EIYLELAARF VLNAPPSEIE DFNRLMFLIE QAHWYYIDFT CEQDASLSAS MGLNEFAKGM ISSVETLKSR IPGFKNNFEK FKSYKFAIPT CGAVLLNPTM DKCLMVRGWG SQNKSLGFPK GKMDANETEA ECAAREVEEE IGVDIRQFIV EEDKVVFMRK RKPSDKLGQK NTLYLIQGIS EETKFLTHTR KEISDIVWNP LWIFDQPEEA LKKFKNKYGQ IYPALRDIMA WVKQNKKKHP KPRTNQVAAP QSPAAGRAAP MSLEDLENEL MSGYDDEPED DEPAKPHKAF EALTNFKFNK ARILQ
|
| |