Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43564 |
Symbol | |
ID | 5006739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 196543 |
End bp | 198207 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | |
GC content | 64% |
IMG OID | 640422160 |
Product | predicted protein |
Protein accession | XP_001422516 |
Protein GI | 145356601 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.079373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0235879 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGC GCGCCGCGCT GTCGACGCGG CCGGCGCCGG CGCGCGCGCG CGCGCGTCCC GCGCGGCGCG CGACGATCGC GCGCGCGCGC GACGCGGACG TCGTCGTCGT CGGCGCCGGT CTCGGCGGCC TGAGCGCGGC GGCGCTGCTC GCGTCGCGGG GACTGCGCGT GACCGTGCTC GAGGCGCACG TCGTCGCGGG CGGCGCCGCG CACGCGTGGA CGCGCGGTGG ATACACGTTC GAATCCGGAC CGTCGCTGTA CTCGGGGCTG GAGGCGCGAC CGACGACGAA CCCGATGGGA CAGGTGCTGC ACGCGATCGA CGAACGCGTG GCGTGCGCGC GGTACAACAC GTGGACGTGT CACCTGCCGG AGGGAACGTT CGTGACGGAG GTCGGGAACG ATCAGTTCTT GGAGGTGCTG CGAGAGTACG TGGATGAGGA GGCGTGCGAG GAGTGGACGC GGTTGAAGGC GCTGATGGAA CCGCTGGCGA CGGCGAGCGC GTCGCTGCCG CCGGCGGCGG TGCGAAGCGA CGTCGGAGGG GCGTTCACGC TGGCGCGATT CGTGCCGGGA TTGGTGCGAT CGGCGCCGTA CATCGGGAAG ATTATGGCGC CGTATAGCAA GTTTATGGCG GAGAACGACA TCAAGCACCC GTTCATTCGT AATTACATGG AGATGCTCTG TTTTTTGCTC AGCGGGGCGC CCGCGAGCGG AACGATGGCG GCGGAGATCG GATACATGTT TGACGATTGG TACAAGCCAA ACTCGATGCT CGAGTTTCCG ATGGGCGGGA GCGGAGCGAT CGTGGACGCG CTCGTGCGAG GGCTGAAGAA GTTTGGCGGC GAGCTAGAGC TCGGGACACA CGTCGAGGAA ATCATCGTCG AAGGCGAAGG CAAGGAGAAG CGCGCGACGG GCGTGCGGAC GAGAGACGGA AAGGTTTACA AGGCTAACAA GGCTGTGATA TCGAACGCAA CGCTCTGGGA CACGGTACCG ATGCTACCAC CGGATGCGAT GCCTCGAGAG TGGATAGAAG AAGCGACATC GACGGCGATG TGCGAGTCGT TCATGCACTT GCATCTCGGC ATCGACGCCG CCGGCCTTCC CGATGACTTA GAGATTCATC ACATTTACGT CGAAGATTGG GATCGCGGCG TCACGGCGGA GCAAAACATG GTGCTCGTGA GCATTCCTTC CGTACTCGAC CCCTCGATGG CGCCCGAGGG CAAGCACGTG ATTCACGCGT ACACGCCCGG AAACGAGCCC CTCGATATTT GGGACGGGCT CGACCGTAAC TCGGACGAGT ACAAAAAGTT GAAGGAGGAA CGCTCGCAAG TGCTTTGGAA AGCTGTAGAA AAGATCATCC CAGACGTGCG CAAACGCTGC GAAATCACGA TGGTCGGCAC GCCCGTGACG CAAAAGAGAT TCTTGCGAAG GGCGAAGGGC ACGTACGGGG GCACGGGTTG GATATCCGAG GATCAAGACA GCATTCCGAT CACGAGCGCG TCGACGCCGC TTCCAGGCCT GCTTGTCGTC GGCGACTCTA ACTTTCCCGG CCCTGGCGTC CCGGCCGTAG CCGCGGGCGG TTGGAGCGCG GCGAATGAAT TGATTTCGCC GCTTCAAACC GCCGCTTTGC TCGACAAAGT GTGTCCTCCG GGGACGTCAA AGTGA
|
Protein sequence | MATRAALSTR PAPARARARP ARRATIARAR DADVVVVGAG LGGLSAAALL ASRGLRVTVL EAHVVAGGAA HAWTRGGYTF ESGPSLYSGL EARPTTNPMG QVLHAIDERV ACARYNTWTC HLPEGTFVTE VGNDQFLEVL REYVDEEACE EWTRLKALME PLATASASLP PAAVRSDVGG AFTLARFVPG LVRSAPYIGK IMAPYSKFMA ENDIKHPFIR NYMEMLCFLL SGAPASGTMA AEIGYMFDDW YKPNSMLEFP MGGSGAIVDA LVRGLKKFGG ELELGTHVEE IIVEGEGKEK RATGVRTRDG KVYKANKAVI SNATLWDTVP MLPPDAMPRE WIEEATSTAM CESFMHLHLG IDAAGLPDDL EIHHIYVEDW DRGVTAEQNM VLVSIPSVLD PSMAPEGKHV IHAYTPGNEP LDIWDGLDRN SDEYKKLKEE RSQVLWKAVE KIIPDVRKRC EITMVGTPVT QKRFLRRAKG TYGGTGWISE DQDSIPITSA STPLPGLLVV GDSNFPGPGV PAVAAGGWSA ANELISPLQT AALLDKVCPP GTSK
|
| |