Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28831 |
Symbol | |
ID | 4999896 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 374644 |
End bp | 376608 |
Gene Length | 1965 bp |
Protein Length | 584 aa |
Translation table | |
GC content | 59% |
IMG OID | 640415317 |
Product | predicted protein |
Protein accession | XP_001415474 |
Protein GI | 145340734 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00444962 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCGA GCCGCGGGGA CATCAAGCCC GAGCTCGACG ACGATAATCC GTGCGTGCGA CGACGCGAAC GCGACGACGC GCGCGAACCG ACGACGAAAG AGACGCCGCC GCCGCCGCTG ACGACGGACG ACTGACGACG CGTTTCGACG GACGCAGCGT CGTCGAGACG ATCCCGGTGT ACTTTAATCG CCCGCCGGAC GATGGCGGAG AGATGGTTCT GATTCAGGTG CGAAAACCGC GCGGAAAGCG CGCGCGAGAC GCGCGCGAGG CGAGCGAGAG AGCGATCGAT GGGGACTGAC GAGACGCCGC GACGGCGACG AGGAACGCAG TACCCGCTGC GACCGCCCGA TCGGCCGTAC GACGTGGCGA ACGTGGAGAG CGTGCGATAT AAACCCGATG CGGCGAAACT AGAGATGACG ATGCCGATCG AGGAGAGCGA AAGAAATCGA GACGAGGACG CGGCGGAGCA CACGCGAATT TCTTCGCTGC TGTTGACGAG CTCGAGCGCG AAGGCGGAGG CGCACGGAGA CGGAGTGGTC GTGGGCACGA TACATAACGG GGCTATGTAT TTGACGCCGA TCGAGGCGGT TTATCAGATG CGACCGAGTT TACAGCATTT GGACGCGGCG GATAGCGCGA GGCAACCGCA CGACGCGGCG CGAGAGGAGC GAGAGTTGCA AGAGGAGGAG GAACGCATGC TGCTGCCTCT TCAGGTGCAA GTGCGACGAC GAGAGACGGC GAAACAGACG GAAATGCGAG TGCAGTCGCA CGCGTTTTTG CGGCAGCGTG AGCACGAGGA GGCATGGATT CCGTTGCAGC CCTCCATGCC GGGAGATGCG GATACAGAGT TCGTCAAGGC GTACGTGACG ACGACGCACG GCGAGTCGTG CGGACAGGCG GTGACGGCGA GAGAATACCT CGACGTCATG TGCCCTGTGA GCGGAAGCAA ACGCGTCGCT CCCGAGGACG AAAATGCCAA CACTTTGGGC GATGGAACTG GGTTGAGCAA GTCTCAGCTC GCGTCTCTGC CGCTCGATAG ACGTATTCGA GCGTTGTTCG CCAAGGGTCA AAAGTCGTGC ATGAAATTTA GCAGGATTCG TCAATTCGTG ACGGAACCCA TAGAGCCTGA ACAATTGATC GCGGTGATTC AAGACAGCGC ACACTTGGTG CAAGGTAATT GGGTGGCCAA GTCCACGCTG CGGTGCGGTG GGAACGTGAC CTGGGAAAAC ATGCGAGACT CGGCGCTCTT TCAGTTCGCG CGTTCGCGCA ACGTCAAACC AGAGTTGGTA TCCAACTGTT TGAAGCGTAA CAGCACAAAA TCGGATCCAA CGTTGGCAAA GATGCGTCGC GAAGTTCTCG CAGAATTCTC TCGTCCTCGA GGTTACGGTA GCGCAGCAGA TGGTTTCGAG TTCAACGAAG CCACCGACGA GGCGTTCGAA GCAGAATTCC CCGACGTCGT CTCGCGAGAG ATGGAATCGT GGCTCGCCTT GGCGCCATCC CTCGACATCA CGTTGCTCAC CAACGAATTG ACGACGCCGA GCGTACCGCC ACACGTCTCA CAAATTCGCC TCACCACCGT TCGCTCGCTC GTGTACGCGA AGTTTGAGAA GAGGACCCAC CTGAGCTTGG CGGACATTCG CGCGTTTTTG CTGACGCAAG CCCCCGACGT CGCGGCGTGC GCATACTTTC GACAAGCCGA ACTTCTGGGG ATGTTCGACG GTGATATCGC GTGCATCGAG GGTATTTGCG TTCTGGTATC CGTGAACGAC CCCGCGATCG ACCCTCTTCG CAGGAAAATC TTAGGTTTGC TCTACAAGCA GAGCGGCGTT GTCAAGCGTT CCGAAATCAT GGACGCCCTC GGCGACGGCG CGCCCAGTCA GCACACCTAC ACCAAAGTCC TCTCAGACTT GTGCTCATCG AAAGGAAGCA TTTGGACCAT TAAGGCGGCG GAGGAGATGA GGTAA
|
Protein sequence | MRASRGDIKP ELDDDNPVVE TIPVYFNRPP DDGGEMVLIQ YPLRPPDRPY DVANVESVRY KPDAAKLEMT MPIEESERNR DEDAAEHTRI SSLLLTSSSA KAEAHGDGVV VGTIHNGAMY LTPIEAVYQM RPSLQHLDAA DSARQPHDAA REERELQEEE ERMLLPLQVQ VRRRETAKQT EMRVQSHAFL RQREHEEAWI PLQPSMPGDA DTEFVKAYVT TTHGESCGQA VTAREYLDVM CPVSGSKRVA PEDENANTLG DGTGLSKSQL ASLPLDRRIR ALFAKGQKSC MKFSRIRQFV TEPIEPEQLI AVIQDSAHLV QGNWVAKSTL RCGGNVTWEN MRDSALFQFA RSRNVKPELV SNCLKRNSTK SDPTLAKMRR EVLAEFSRPR GYGSAADGFE FNEATDEAFE AEFPDVVSRE MESWLALAPS LDITLLTNEL TTPSVPPHVS QIRLTTVRSL VYAKFEKRTH LSLADIRAFL LTQAPDVAAC AYFRQAELLG MFDGDIACIE GICVLVSVND PAIDPLRRKI LGLLYKQSGV VKRSEIMDAL GDGAPSQHTY TKVLSDLCSS KGSIWTIKAA EEMR
|
| |