Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29544 |
Symbol | VEF3501 |
ID | 5006799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 420166 |
End bp | 421581 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | |
GC content | 58% |
IMG OID | 640422220 |
Product | predicted protein |
Protein accession | XP_001422742 |
Protein GI | 145357063 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00819423 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGCCGA GCGCGTACAC GCCGTCGACG TCGAAAACGT TCGACGGTCG CGCGCTGCGA CCGGACTACA AGTCGGCGAG CGCGGGCGCG GACGACGTGG ACGAATACCC CGTGAGCTCG CCCGTGGAGA GCGATTGTTT CGGGGACTCG GAGAGCGAGC CGGACGCGCG TTTCGCGACG TACACGGGCG CGATCAGCGT CTACGGCGAG TTTCGCGATC GACGGTCGCC GAGGCAGGTG TTCTTGCGAC GGAATCTGAG TTATCACGTC GAACGCGCGA GCGCGGGGTC GAGGGATGAG CGAGGAAGCG CGCGGGCGCG CGGCGCGGCG GCGCGCGAAC GCGAACTCGC GGAAAAATAC GAGACGACGT ACGAGTACGA TCGATGCACG ACGAGAGTCA GAGTGCGAGG ATACGGGTGT AATATGTGTA AATGCGTGTG CGTCGGCTTG CGGGGGTTGA TGACGCACCT GAGGGCGTCG CATGATTTGT TTAGGTACAG CGCGAGACGG GAGGGACGGC GAGCCATCGT GCGAATTTAT CCAAAGGGGG AGAATTTTAC CGCGGATAGG AGCTTCGTGT TGCGGTCGCA GACGGATATT CAGACGGCAA ATGATAAGGA GTTTTCGTTT TATCGAGGGA AACGGTCGAA GAGGGAGGTA TTTAGAGAGA GAGTGACCAA GGCGGAGCTC GACGCGTTGT ACGAGCGAGA CGTGCGAGCG GCGCCGCCGC CGCGGGTGGT TTGGCGCGCG GTGCCGAATG ATGATCACTA CGATCGTGTA AAATTAGAGA AGCGCAAACG CGCGCAAGAA GAAGAACGGA GAAGAATCGC GGCGTTACCT TTCAAGCCGA TGGTATGCAA GAAGAACGCG AATGGGGCGG CGAGCGCCGC CGCGACGAAA CCAAAACCAA AACCGCCAAA GAAACCACTT GGACCGTTTT ACAATTCGAG GTCATTCGTC GAGATGTCAG AGATTCCAGA GCAAGATTCT GACGATGAGA ACTTGATTCC CGTAGAAATC ATGGAAAGCA AACGCTTCAT GGAAGAGTTC GTGGATTTTA GCACCGAGGA ACTGTCATTC ATGGAGGCTT GGAACGACGT CGCGATGAAA TTTCGCTGCG TCGCGGATTA CGAAGCCCCC TCGCTATGCG AAGCGTTCGT GCGCGTGCAT GGCGATAAAC TGAAAGTTAG CGACGAGTTC TTCAAAATGT TCGTGCTCAC GCTCTTCGGG ATGTACGAAA ACGGCATTCT CAACCGCCGC GCGGTGGCGG AAGCCTTGGG AAAATGCAAG TCGCTCTCCG GCGCCTCCGC CTTCGATCGA GTCTCCACCT CTGTCACAGA CCGCGAGCAC TGTTCCGTCG CCCTGTTCGA AAAGTTTCCC AAGTTTGTCA CGACCCTGAA AAATCTTAGC TACTAG
|
Protein sequence | MAPSAYTPST SKTFDGRALR PDYKSASAGA DDVDEYPVSS PVESDCFGDS ESEPDARFAT YTGAISVYGE FRDRRSPRQV FLRRNLSYHV ERASAGSRDE RGSARARGAA ARERELAEKY ETTYEYDRCT TRVRVRGYGC NMCKCVCVGL RGLMTHLRAS HDLFRYSARR EGRRAIVRIY PKGENFTADR SFVLRSQTDI QTANDKEFSF YRGKRSKREV FRERVTKAEL DALYERDVRA APPPRVVWRA VPNDDHYDRV KLEKRKRAQE EERRRIAALP FKPMVCKKNA NGAASAAATK PKPKPPKKPL GPFYNSRSFV EMSEIPEQDS DDENLIPVEI MESKRFMEEF VDFSTEELSF MEAWNDVAMK FRCVADYEAP SLCEAFVRVH GDKLKVSDEF FKMFVLTLFG MYENGILNRR AVAEALGKCK SLSGASAFDR VSTSVTDREH CSVALFEKFP KFVTTLKNLS Y
|
| |