Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43793 |
Symbol | |
ID | 5006579 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 207042 |
End bp | 209411 |
Gene Length | 2370 bp |
Protein Length | 763 aa |
Translation table | |
GC content | 58% |
IMG OID | 640422000 |
Product | predicted protein |
Protein accession | XP_001422521 |
Protein GI | 145356611 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.848996 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0246757 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCCA CAGACGCGTT CGAGCACCAA AACTTTAACG CCGTGGATTT CATCAATCGA GTGCTCCCGG ACGAACGCGC GCTCGCGGGC GTCGATAAGA TGATCGCGAA GCTCCGCGCG CGCGTGAAAC TGGTGGACGC GGAAATATTG GGCGCGCTGC GAGCGCAACA CGGGAGCGAA GCGCGAGCGA GGGACGATTT CGAGGTGATC GTGAGCGGGA TCGACGCGCT CGCGGAACGC GCGACGGAGA CGGAGAGAAA GGCGGCGGCG ACGGAGGCGA ACGTGCGAGA GATATGCGCG GATATAGTGC GATTAGATAG GGCGAAGAAT CACTTGACGA ATTCCATCAC GACGCTGCGA CGGTTGTCGA TGTTCGTGAG CGGGATGGAA CAGTTGGAGT TGTTCGCGTT GCGGAGGCAG TACGGGGACG CGGCGAATTT GTTGCAGGCG GCGTCGCAGT TGGCGACGCA CTTCGAGGGG TACTCGCAGA TTCCGAAGAT TGCGGAGTTG CAGGAAAAGT ATCGCGGGGT GAAGAATCAG TTGCGCGCGG CGGTGTTTGA TGATTTTCAC ACGACGTGGC TGCCGCACGT GATGGACGGC GACGCCGCGG CGCAGAAGAA ATTACGCGAC GCGTGCCTCG TCGTGAACGC GCTCGAACCG AGCGTGCGCG AAGAGTTAGT CGGCAACCTC ACAAACAGAG AGCTGACCAA CTACGCGTCA GTTTTCAGCG CGCACGAGAG TGGAGATTTC CTCGGTCGCA TCGCGAGACG GTATGATTGG ATCACGCGAC AGTTACAATC GAAGGAGTCC ATGTGGGCGG TATTTCCAGC GCATTGGCGC GTGCCACAAC TTTTAAGCGT GTCTCTTTGC AAGCTCACGC GCGCACAACT CGCGGAGGCG CTTGATGCCC GCGGGCCGCA CGACGTACAA AAGCTCTTGC ACGCGATGCA CGTCACCATA GAATTTGAGA TGGAGTTAGA CGAGCGTTTC GGCACTGGCG CGGGTGTGGA AGACGACGAG CTCGAAGGTG ACAGTGCGTC GGCTTCGATG CTCCGGCAGA AACTCGAGCG CGCTGAGCGC GAGAAACAGA CAGAAAACTT GCGCGGAGGG CGCGTCCTGC CCATGGATTC GGCCGCGGAA GCCGCGGCGA CGTTCATGTT TCGGGGGAGC GTTGGATCGT GCTTCGAAGA TCATCTCGCC GATTACGTCG CACTCGAGCG ACGTCAATTA TTTGAGCAGA TCAACGAAAG TATTCGAAAC GAAACTTGGC AGGGTGACGA AACAAACCCA CGAATTTTGG CGAGCGCGAC GAGCGTGTTT TTGAACATAA AGAAAGTGTT CAAGCGATGC TCCAATTTGA CGCGCGGTAA GACGCTCTTC GCCGTGCACC AGGTGTTCGT ACAAGTTCTC ATCGCGTACG CCAAGGCTTT GAACGAACGC ATCGACGTCG CAGCGTTGAA CGCGACGGAC GCTCGCCGTC CCGAGGCGCA GCGAGCGGCG GAAATCAAGT GCATATGTCT CATCGTCAAT ACGGCTGAGT ATTGCAACGA AACCGTCGGT CCACTCGGTG ATTCTATGGT CAAATCGTTG GAAGACAATT TCAAGGAGAA AGTCGACATG ATGGACGTCG AAGATGCGTT CAGCACGACA CTGTCTGAGG CACTGAACAA ACTCATCGGC GTGGTGGAGG CGAAATCAAA CCTCGTCTCT GGGATGCTTC GCGTGAACTG GGGCGCGCTC GACGTCGTCG GCGACCAGAG CGAGTACGTA GACACGTTCG AACGCGCTAT TGCGCACGCG ATGCCTGTGC TTCGCGCTTC AGTGAGCGAC ATCCATCACA CATTCTTTTG CGAGAAACTG GCGTCGTCAA TCGCGCCGAA ATTGTACATC GCAGTGTTCA AGTGCAAGCG CTTTTCGGAA ATCGGCGGTC AGCAACTTTT GCTCGACATG CACGCGGTGA AAGCAATTTT ACTGTCCTTG CCCGCCATAG CCGCTGCCGG TACGGACGTC ACCGCCGAAC CATCGGCGCC GCCGATGAGT TACGCGAAGA TGATCGCTCG CGAGATGGGC AAAGTCGAAG CGCTCGTGAA AACTATCCTC TCCCCGAACG ACGGATTGGC GGAAACGTTC AAAGCCCTCC TTCCCATGAC AGCCAACGCC ACGGATTTCA AAGCTATTTG CCTGCTCAAG GGCATGAAAC CAAACGAAAT CTCCGAGCCC CCGTTCGGGC TCTTCGCCTC GGTCGGCGCT CCCGCCAGCT CCAAGCCGCT CGAGGATTTA CCCAACGTCC CGAACAGACC CAAGGCGCCG CGCATGGACA ACGTCACCGC AAAAATGTCT GGCATGTTCA AGCAGGGCAC CAAACAATAG
|
Protein sequence | MSSTDAFEHQ NFNAVDFINR VLPDERALAG VDKMIAKLRA RVKLVDAEIL GALRAQHGSE ARARDDFEVI VSGIDALAER ATETERKAAA TEANVREICA DIVRLDRAKN HLTNSITTLR RLSMFVSGME QLELFALRRQ YGDAANLLQA ASQLATHFEG YSQIPKIAEL QEKYRGVKNQ LRAAVFDDFH TTWLPHVMDG DAAAQKKLRD ACLVVNALEP SVREELVGNL TNRELTNYAS VFSAHESGDF LGRIARRYDW ITRQLQSKES MWAVFPAHWR VPQLLSVSLC KLTRAQLAEA LDARGPHDVQ KLLHAMHVTI EFEMELDERF GTGAGVEDDE LEGDSASASM LRQKLERAER EKQTENLRGG RVLPMDSAAE AAATFMFRGS VGSCFEDHLA DYVALERRQL FEQINESIRN ETWQGDETNP RILASATSVF VQVLIAYAKA LNERIDVAAL NATDARRPEA QRAAEIKCIC LIVNTAEYCN ETVGPLGDSM VKSLEDNFKE KVDMMDVEDA FSTTLSEALN KLIGVVEAKS NLVSGMLRVN WGALDVVGDQ SEYVDTFERA IAHAMPVLRA SVSDIHHTFF CEKLASSIAP KLYIAVFKCK RFSEIGGQQL LLDMHAVKAI LLSLPAIAAA GTDVTAEPSA PPMSYAKMIA REMGKVEALV KTILSPNDGL AETFKALLPM TANATDFKAI CLLKGMKPNE ISEPPFGLFA SVGAPASSKP LEDLPNVPNR PKAPRMDNVT AKMSGMFKQG TKQ
|
| |