Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18007 |
Symbol | |
ID | 5005541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 125527 |
End bp | 127209 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | |
GC content | 58% |
IMG OID | 640420962 |
Product | predicted protein |
Protein accession | XP_001421212 |
Protein GI | 145353848 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00198856 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.151799 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGA CGCCGTCGAC GCTTCGAACG CCGTCCCGCG CGCGCGCGCC CGGTGCGGCG AACAGTCGGG GCTCGACCGC GCGCGCGCGC GCCATCCAAC CGCCGAAAGC GCCCGATGCG TACCGCGGCG CGCTGTTCCC GGGCGTCGAG GTCCCCGATA ATGATCTCGC GCGATCGTTC AGCGCGCTGT TTCCGTGGGG AAACGGGGCG CGGGTGACGG AGAAGGTGCT GGGGGATTTG CTGAAGCCCG AGGTGCGCGC GGCGCCGCTC TTCGTGCCGC TGTACGATTA CTACCGCGAA TACGGCGGCG TGTACAACTT GGGCGCCGGA CCGAAGTGGT TCGTCGTGGT GAGCGATCCG GTGGCGGTGC GGACGATGTT CAAGGACCAG GCGGATAGCT TTTCGAAGGG GATACTGACG GATATCATGG AGCCGATTAT GGGCGACGGG TTGATTCCGG CGAATAAGGA GATTTGGGCC AAGCGTCGAC CGGTGATCGG GGCAGGATTC CACGGGGCGT GGTTGAAACA CATGTGCAAC TTATTTGGAG CGTCGGCGAT GCGGTTGGCC GATAAGTTGG ACACGTTCGT GGAGTCGGAG AAGACGGTGG AGCTCGAGAG CGAACTGTAC GCGATGGCGC TCGACGTCAT CGGCAAGGCG GTGTTTAATT ATGAATTCGG GGCGTTGAAG CAAGAGACAC CGATCATCAA GGCAGTCTAT CGCGTTTTGC GGGAAAGTGA GCACCGTTCG ACGTTTCCGT TGCAGTATTG GCAGATTCCT GGGGCGATGG AGCTCGTGCC GAGACAAAAG CAATTCAAGG AGGATATGAA AATGGTCAAC GACGAGCTGT CAGTGTTGAT TAATAACGCT ATCGCGTCGC GAAACGAAAC TGGCTTGGAG GAAATGGAGC GCAGAGACTA TTCTAACGTC GAAGACGCGA GTCTGTTGCG GTTTTTGGTG GACATTCGTG GCGACGAGGC GACGAGCACG CAGTTGAGAG ACGACTTGAT GACGATGCTC ATCGCCGGCC ACGAAACCAC CGCCGCGGTG CTCACGTGGA CGCTTTATTT ACTTGCGCAG CACCCGGAAA TCGCGGATGA TGCGGTGGCT GAAATTAACG CCTGTGTTGA AAACGCTGAC GGCATCCCCA CGCCCGAGGA GGTGCGCAAG CTGGAAAAGG TTCGCATGAT TCTCGCCGAG GGTATGCGCC TGTATCCGGC GCCGCCGATT TTGATTCGTC GTGCGATTAA GGATGTCACG CTCCCGCGCG GAGGTAACGG GAAGGAAATC ACCTTGAAAG CCGGCACCGA CTGTTTTATC GCGGTTTGGA ACCTCCATCG GTCACCTGAT TTGTGGGAGG ATCCAGAAAA ATTTGACCCG AGCCGGTTTT CGCGAAGGTT TGAAAATCCC GCCATTGAGG GCTGGGGCGG CTTGAATCCC GAACTAATGA CTGGTTTATA CCCGAACGAG CAGTGCACAG ACTTTAGTTA CGTGCCTTTC GGTGGCGGAC AGCGTCGCTG CGCCGGTGAT CAGTTTGCGA TGCTCGAGGC AGTGACGGCG TTGAGCGTCT TGTTGAAAAA GTTTAAATTT GAGCTCGCGT GCGAGCCGGG CGAGGTCGAG ATGATCACCG GGGCGACGAT TCACACGAAG AAAGGGTTAC CTATGAAGCT TAAGAGACGA TAG
|
Protein sequence | MRATPSTLRT PSRARAPGAA NSRGSTARAR AIQPPKAPDA YRGALFPGVE VPDNDLARSF SALFPWGNGA RVTEKVLGDL LKPEVRAAPL FVPLYDYYRE YGGVYNLGAG PKWFVVVSDP VAVRTMFKDQ ADSFSKGILT DIMEPIMGDG LIPANKEIWA KRRPVIGAGF HGAWLKHMCN LFGASAMRLA DKLDTFVESE KTVELESELY AMALDVIGKA VFNYEFGALK QETPIIKAVY RVLRESEHRS TFPLQYWQIP GAMELVPRQK QFKEDMKMVN DELSVLINNA IASRNETGLE EMERRDYSNV EDASLLRFLV DIRGDEATST QLRDDLMTML IAGHETTAAV LTWTLYLLAQ HPEIADDAVA EINACVENAD GIPTPEEVRK LEKVRMILAE GMRLYPAPPI LIRRAIKDVT LPRGGNGKEI TLKAGTDCFI AVWNLHRSPD LWEDPEKFDP SRFSRRFENP AIEGWGGLNP ELMTGLYPNE QCTDFSYVPF GGGQRRCAGD QFAMLEAVTA LSVLLKKFKF ELACEPGEVE MITGATIHTK KGLPMKLKRR
|
| |