Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33533 |
Symbol | |
ID | 5003573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 369803 |
End bp | 371437 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418994 |
Product | predicted protein |
Protein accession | XP_001419574 |
Protein GI | 145350354 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.175528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCGC CGGATTGGAT GACCCAGTTG AACCGACTGT GGGGTGGGGC GTCGGAGATT CCCGTGGCGG ACGCGAAACT GGAGGATATC ACGGGATTGC TCGGAGGCGG GTTGTTTCAG CCGCTGTTCA AGTGGATGCG AGAGAGCGGG CCGGTGTATT TGCTTCCCAC GGGACCCATC ACGTCGTACG TCGTCGTGAG CGATCCCGAC TGCATTAAAC AGGTGTTATT CAACTACGGA AGCCGGTACA TCAAGGGCAC AATCGCGGAG GCGGGTGAAT TTTTGTTTGG ATTGGGCGTC GCGTTGCAAG AGTTGGAGCC GTGGAAGATT CGACGGAAGG CGGTGGCGCC CTCTTTGCAT CGCAAATACG TCGAAGCGAT GGTGGATAGA TGCTTTGGTC CGTGCGCCGA TCGGATGGTG AGTATTTTGG AGGGCGAAGC TGGCGCTGGC GGCGTCGGTG GCGTGAACAT GGAGAGTCGA TTTAGCAAGA CGGCGCTCGA CATCATCGGC ATCTCCGTGT TCAATTACGA CTTTGAGGCG CTCACGACGG CGGCGCCGGT GATTCAAGCC ACGTATACTG CGCTCAAGGA GGTCGAGACA CGATCGATGG ATCTTTTGCC CACGTGGCGC TTGCCTGAGA AGTTCTTGCG CGTCGTTAGC CCGCGCCAAC GCGACGCGCA AGACGCCGTC ACCGTCATTC GCGACGTCAC CCAGCGACTC GTGGACGACT GCAAGCGCAT GGTCGAGGAG GAAGAAAAGG TCGGTGGCGC TGAAGAGTGG GCTCGCGATT ACTTGAACGA GTCAAATCCT TCGGTGTTGC GCTATCTCAT CGCTGCGCGC GAAGAAGTGT CTTCGACGCA GCTCAGAGAC GATTTGCTCT CGCTTCTCGT CGCCGGTCAC GAAACCACCG CATCGGTGCT CACGTGGGGC ACGTACGAGC TTCTCAAGCC TGAAAACGCG GAGCAGCTTC GCTTACTTCG AGCCGAGCTC GATGAAGTGC TAGGCACGCG TCCGTTCCCG ACGTTTGCGG ATTTACCAAA GATGCCCTAC CTCGAGCGAT GCTTTCACGA GTCCATGCGG CTTTATCCGC AACCTCCCGT GTACACGCGA CGCGCCGTCG TGGAGGATGT CTTGCCCAAC GGCATGACGA TACCAAAGAA CCAAGATTTA TTGGTATCGA TTTACAACCT CCACCGTTCG CCCACGAGTT GGGGGCCGAC ATCGCAAGAG TTCGAGCCCA TGCGCTTCGG ACCGCTGGCG AACGGGCAAC CGAACGAGTT AAACACGGAC TACCGCTACG TGCCGTTTAG CGCCGGACCG AGGCGATGCC CCGGGGATAA GTTTGCCGTC TACGAGGGCA TCGTCATTTG GGCCACGATG TTCAGGCGAT TAGACTTGGA GTTAAAGGCT GGTCACGACG TTGTCATGAC GTCTGGGGCG ACGATTCACA CGAAGAGCGG TTTGTTAGCC ACCGTCAAAG CGCGCGCCAT GCGAGAAGTC GCCGAGGCGG ACCGTGTCGA CTGGGCTAAC TTGAAGCCTG CGAAGGATAT CGGCGAGGAG TGGATGGAAA AGGCACTCTT CAATAGCGAA GCGACCGGTG CGGTGAGCGC GGGTAAATGC CCCATGGGTC ATTAA
|
Protein sequence | MAAPDWMTQL NRLWGGASEI PVADAKLEDI TGLLGGGLFQ PLFKWMRESG PVYLLPTGPI TSYVVVSDPD CIKQVLFNYG SRYIKGTIAE AGEFLFGLGV ALQELEPWKI RRKAVAPSLH RKYVEAMVDR CFGPCADRMV SILEGEAGAG GVGGVNMESR FSKTALDIIG ISVFNYDFEA LTTAAPVIQA TYTALKEVET RSMDLLPTWR LPEKFLRVVS PRQRDAQDAV TVIRDVTQRL VDDCKRMVEE EEKVGGAEEW ARDYLNESNP SVLRYLIAAR EEVSSTQLRD DLLSLLVAGH ETTASVLTWG TYELLKPENA EQLRLLRAEL DEVLGTRPFP TFADLPKMPY LERCFHESMR LYPQPPVYTR RAVVEDVLPN GMTIPKNQDL LVSIYNLHRS PTSWGPTSQE FEPMRFGPLA NGQPNELNTD YRYVPFSAGP RRCPGDKFAV YEGIVIWATM FRRLDLELKA GHDVVMTSGA TIHTKSGLLA TVKARAMREV AEADRVDWAN LKPAKDIGEE WMEKALFNSE ATGAVSAGKC PMGH
|
| |