Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39472 |
Symbol | |
ID | 5004914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 296605 |
End bp | 298542 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | |
GC content | 61% |
IMG OID | 640420335 |
Product | predicted protein |
Protein accession | XP_001420815 |
Protein GI | 145352989 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.000845209 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.555705 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGCG CGACCGCGCG GATAACGAGC GCGACGAGGA AAGACGCGCG CGGGCGACGA ACGGTGACGC GCGCGAGCGC GCTGGACGAC GCGAGCGAAA CGCACGACGT CGTGGTGATC GGAAGCGGGA TCGGTGGGTT GTCGTGCGCC GCGCTGTTGG CGAAGTACGG ATACGAGGTG AAGGTGTTTG AGAGTCACTA TCTCGCGGGC GGGTGCTGTC ACATGTTCGA TCATCGAGCG CCGGATGGGG CGTTGTATAA GTTTGAGGTC GGACCGAGCA TTTGGGAGGG GTTGGATCGA CCGACGGGGA ACCCGCTGAG GATGGTGCTG GACGCGCTCG GGGAGACGGT GCCGGTGAAG ACGTACGACG GGATTTCGAT GTGGACGCCG GAGGGGCACT GGAGGTTTCA AACCGGGGAC GATGACGCCC CGGGTGGGTT CTGCGATTTG TTGCGCGAGA AGGCGACGGA TCCGGAGTTG GCGATCAAGG AGTGGAAGGC GCTGAAGGGG CGGTTGGAAC CGTTGTACGA CGCGCTGGAC GCGTGTCCGC TCACGGCGCT TCGCCAGGAC GCCGGGTTGT TGGTGTCCAC GGTGATTGCG ATTCCGTTTT ACCTCACCCA TCCGAACGTG ATGTTGGATA TTCCTTATAT TTTGGACTCA TTTCATAAGT TATCGCGACA GTACGTCACC GAACCGTTCT TGAAGCAGTG GATCGACATG CTGGCGTTTT TCAGCGGGTT CCCGGCGGAG GGCACGATGG GGGCGACAAT GATTTATAGT ATTCCAGGAT TCCATCGTCC CGGGGCGTCG TTGTGCGCGC CCGAGGGCGG TACGCAGGCA GTTGTGGATA AGCTTCAATA CTGCTTGGAA AAGTATGGTG GTGAATTGCA GCTCAAGTCG CACGTGGAGG AAATCATCGT CGAAGACGGA GAAGCCAAGG GTGTGCGTCT TCGAAACGGT AAAGTCATCA AGGCGAACGT CGCGGTCGTT TCGAACGCGA CTATCTGGGA CACTGTGCCG ATGTTGCCCC AGACAGACGA ATTAATAGCA CAAGGTCTCG ACAGGGCGGT GGAGTGGAAG GACGAAATGT CGGAAATCCC CGCCCTCGGA AGCATCATGC ACTTGTTCCT CGGCATCGAC GCCACGGGCT TACCCGATCT GGATCCGTCG CATCTGTGCG TTTTGGACTG GGACCGACCT CTCGGTGATC CCCAAAACGT CATCACAATC TTCATTCCCA CCGTGCTCGA TCCTGAAGTC GCACCCGAAG GTAAGCACAT CATTCACGTG TACACAGCCG GAAGCGAACC GTACGACATT TGGGAAGGCA AGGATCGAGG GAGTCAAGAG TACAAGGATT TCAAGCGCGA ACGCGCTGAA ATCCTGTGGA ATGCCATCGA ACGCATCATA CCGGACATTC GTAGCCGCGT CGAAGTTGAA GTGTACGCGT CTCCGCAGAC GCATCAACGG TTCCTCCGAC GGCACCGCGG CACGTACGGC CCGGCGCTCC CAGCCGGCGG CAGCCTCTTC GGCTTCTTAC CTCTGCCCGA AGTTCCGCAA CCGGGCGTAT TATCCCCTAT CCCCAAGTTA TTACGTTGCG GCGACTCCGT GTTCCCGGGC GTCGGCGTTC CCGCGGTCGC CGCCTCTGGC GCCATCGCCG CGAGCACGCT CGCCCCGCTC CCCAAGCACC TCGGTCTCAT GTGGGACGTT TCCCAAACTC AAAACACCTT TTGGCGCGAG TGCGGAGGCA AGGAAAAGTG GCTCGCGTCC GCGCCCGCGC CTTTCCGACC CGCCGCGGGC GGCGGCGCCG AATTCATGAC CCCAGACCAA TACTACAAGC CCAAGCCCGA GGCCGAAAAG TCCTTCGCCG AAATCGGCAA GCGCGGCGTC GTCGCTCAGT CCCGCGAGCG CGGCGTTGCG GATCGCGAGT CGCGGTGA
|
Protein sequence | MQRATARITS ATRKDARGRR TVTRASALDD ASETHDVVVI GSGIGGLSCA ALLAKYGYEV KVFESHYLAG GCCHMFDHRA PDGALYKFEV GPSIWEGLDR PTGNPLRMVL DALGETVPVK TYDGISMWTP EGHWRFQTGD DDAPGGFCDL LREKATDPEL AIKEWKALKG RLEPLYDALD ACPLTALRQD AGLLVSTVIA IPFYLTHPNV MLDIPYILDS FHKLSRQYVT EPFLKQWIDM LAFFSGFPAE GTMGATMIYS IPGFHRPGAS LCAPEGGTQA VVDKLQYCLE KYGGELQLKS HVEEIIVEDG EAKGVRLRNG KVIKANVAVV SNATIWDTVP MLPQTDELIA QGLDRAVEWK DEMSEIPALG SIMHLFLGID ATGLPDLDPS HLCVLDWDRP LGDPQNVITI FIPTVLDPEV APEGKHIIHV YTAGSEPYDI WEGKDRGSQE YKDFKRERAE ILWNAIERII PDIRSRVEVE VYASPQTHQR FLRRHRGTYG PALPAGGSLF GFLPLPEVPQ PGVLSPIPKL LRCGDSVFPG VGVPAVAASG AIAASTLAPL PKHLGLMWDV SQTQNTFWRE CGGKEKWLAS APAPFRPAAG GGAEFMTPDQ YYKPKPEAEK SFAEIGKRGV VAQSRERGVA DRESR
|
| |