Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27761 |
Symbol | |
ID | 5005621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | - |
Start bp | 135155 |
End bp | 138052 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | |
GC content | 63% |
IMG OID | 640421042 |
Product | predicted protein |
Protein accession | XP_001421719 |
Protein GI | 145354914 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0965558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.498472 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCCC GCGCCAAGCG GGTCGACGCG TCGACGCGCG CGGCGCCGGA CGCGCGAGAA ATCGGGGCGT TCAAGACGAA CGATGGTTTA AAGTCGGGCG TCTTCGTGCT GAAATCCGCG CGCGCGGCGC CGCGAGGCGA CGAGGCGTCG CGACTGAACG CGTTCGAACC GGTGTTGCGG GATGAAACGC GCGGCGCGCG AGAGCGGCGC GCGGCGGCGT GCGCGCGGGC GTGGGAGGCG CTGGATGGAC ACTGTCAACG CGTCGCCGAC GAGGCGAACG CGGAGGCGTT CGCGAGGGTG AGGAGGTTCG TCCGCGAGCA GGGCGCGACG CGGGTCGAAC GCGCGCGAGA AACGGAGGAA AAAAATAACG GCGGCGATCG ACGCGTGCCG GTGGGCGTGG TGCTGGCGGG AGGGGTGAAC AGCGACGATC ACGAGGAGAC GTTCGCGGCG CTGACGAAAT CGTTGAGGAA GAGCGGAGAC GCGCACGTGG CGCTGTTGCG GTCGAGAGAT CTGAAGGCGC GCGCGGGGGC CGGGACCGGG GGGCTAGGGG TGGCGTTTGG GGTGATCATG CGGCAGTTGG ACCGCACGGG GGGGCATTGG GGGGGGAAGA GCATGCGCGC GCTGCGGCGA TGGCACGAGG AGACGTCGGG GGCGCGTCGA GACGGGCTCA TCGCGGCGTC GACGGTGCCG TCGCCGATCG GGAACGCGTT GACGGTGTAC AATGGCGGTG ATTCGCGCGC ATCGGGAGAG GGCGACGACG TTCGCGCGCG GGGGGGGAAG AAGCGGGCGG CGACAGATTC GCGCGCGACG TCGCCGGCGA GACGGTCGAA GCGGCGCGCG AGAGATGACG TGGAGCGCGC GGGCGACGAC GAAAGGACGT TATATCCAGT GATTCAAGGT CGCGATTGTC CGGTGGTTAT CGTCGTCGAG GATACGGAAA GTTTCGACGT GCGCGTGTTG GATTCGTTTA TTCGAAGCGT TTCGGAATTC GTCTCGAGCG TGCCCGTCGT CGTCCTCTTG GGCTTGGCCA CGAGTGTGAG TTCGCTTCAA GGGATGTTAC CAGCCGCCAC GGCATCGTTG ATGAACGCGC AGGCGTTCCA GCTTTGGGCG CCCGGGCAAA TGATGGAGGC TGTGCAAGAG CGCGTGCTGT TGAGCCCCGA GCGCGTGCCC GCGTTTGGAA GCGAGGTGTT GAACGTTTTG CACACGCGCT TTAAAGAGCA CGACTTTTCC CTCGCCGCCG TGCGTCGTGC GCTCCATTTG CTCACGATCA CACACTTCAT GACAGAGCCG CTGAGCGCGG TGTTGCCGCT GCTCGCTAAA GATTCGAGCG AGCCTTGCGC CGACGGCGAC GAGAGCGACG ACGAGTACGG CGACGCCATC GACCAATTCG TTCGAGATCT CGATCCGAAC GCGGTGGAGT ACGCGCGCAT CAAGCACGGA TTCGGTGCAA AGTCGATGCG AGACGGGGAC GGCGACTCCG CGTCTTCAAC GACGCACACG AAGAGCGAAC TGAGCGCGGC GCTGAAGGAT GTGTACCGCT TGCGGCGACG GTGGGCACTC TCGTTGCGTT GCATTCAAGT CGCGTGCGCG GCGACGGAGA AAAAGTTCAA AAAGAGCGTG ACGACGATGG CAGACTTACT CCTCGACGCG TCGCGAAAAG GATACTTTGG TGGTGCTGGA GAATTCAACC CCAAGAGTCA AGGCGGGACG TTGCTCAACG TCGCGTTCTC GCGCATTCGG GACGAGTTCA CGACGGCGGA AATCGGTGAT TTGGTCAAGT CTATGCTGAA ATTCTTGGAC GATGATGAGG TCATGCGGTC GGCAGAAGGT TCGGAGCTGC GCTTTTTACT GTCAAACATA GAGGACGGCT CGTTCGACGC GGAGGACGCT CGACGGCGAG AGGCGGCAAT TTTGGCATCC ACGCCGACGA ACGAAGGTGG CGACGAACGG CGCGGTCTCG TCATCGCTGC GTCGAATGCC AATGCAACGG CTGAAAAACT GGCGCACGAC GAGACGAACG CGGAAGATCA GCGACTAGCG ATGGAGGCAA TCTTAAACGC ACGGAAACGG CGTGGAAACG GCGCGGCGTT TGCGACGAGC CTCACGCCGG CGCCGAGGAC TTCAACGCGC GATGAAGACG CCGCGGCGAA CGATGGTTTG GTACTTCCTA AGTCTCCACC GCGCGGTCGA AGCGCACCGG CGAGGAGTGA AGCCAACGCC CTAGCTTTGG TGACGACGAC GGTGGCAACG GCGGCGAAGC CAGAACGACC TCGACTCGAC TCGGCGAAAT CCGCGGCGGC GAATGAATTT TGCGACATTC TTCGCGCCAT CGCGCGCAAG TACGGGAGTC ACCCACCGGA ATCGATCGAT GCGAGCGGAA TATTCGTCGT CACCGACGTC GAATGCGTTC GCGCCGCGCT TCAGGCGTCG CCTCGTTTGT CGTTGGAGAA CACGCTCACG GATCCGAGCG AGCTGCTGCG ATGCGCGTGC TGTCGTCACG ATGACTACGC CTTGGCGAAC GGTCGACGCG TGCCGGAAAC GTTGCCTGAC ACCGCCGCGG CGTATCGCCT ACTCGCGCGA TTCGGCGAAC GAGCGCCCAT ATACGACTGG TTTCAAAGTT TTTGCGAGAG TAAAGCGGGG TCGGAGATGA AACGCGGCGA TGCAGCGGCG GCGGCTCGCG GTAAAGGCAC GTTCGGTTTG CCGCGAGAAA AATTATGGCA GCTTCAGGCG CGATTTACTC GCGCCGTGGC CGAGTTGGAG TTCTTGGGCA TCGCGCGTCC GTACCAAAAA GGTCGCAAGG GCGTCGAGTA CATGGTACGC ACGGCGTTCC CTCTCGACAA GCTCGCGAGA GATTCCAGGA GCAACGTCGT CGAGCGCTTA GCCATCCAGC CGGCTTGA
|
Protein sequence | MTSRAKRVDA STRAAPDARE IGAFKTNDGL KSGVFVLKSA RAAPRGDEAS RLNAFEPVLR DETRGARERR AAACARAWEA LDGHCQRVAD EANAEAFARV RRFVREQGAT RVERARETEE KNNGGDRRVP VGVVLAGGVN SDDHEETFAA LTKSLRKSGD AHVALLRSRD LKARAGAGTG GLGVAFGVIM RQLDRTGGHW GGKSMRALRR WHEETSGARR DGLIAASTVP SPIGNALTVY NGGDSRASGE GDDVRARGGK KRAATDSRAT SPARRSKRRA RDDVERAGDD ERTLYPVIQG RDCPVVIVVE DTESFDVRVL DSFIRSVSEF VSSVPVVVLL GLATSVSSLQ GMLPAATASL MNAQAFQLWA PGQMMEAVQE RVLLSPERVP AFGSEVLNVL HTRFKEHDFS LAAVRRALHL LTITHFMTEP LSAVLPLLAK DSSEPCADGD ESDDEYGDAI DQFVRDLDPN AVEYARIKHG FGAKSMRDGD GDSASSTTHT KSELSAALKD VYRLRRRWAL SLRCIQVACA ATEKKFKKSV TTMADLLLDA SRKGYFGGAG EFNPKSQGGT LLNVAFSRIR DEFTTAEIGD LVKSMLKFLD DDEVMRSAEG SELRFLLSNI EDGSFDAEDA RRREAAILAS TPTNEGGDER RGLVIAASNA NATAEKLAHD ETNAEDQRLA MEAILNARKR RGNGAAFATS LTPAPRTSTR DEDAAANDGL VLPKSPPRGR SAPARSEANA LALVTTTVAT AAKPERPRLD SAKSAAANEF CDILRAIARK YGSHPPESID ASGIFVVTDV ECVRAALQAS PRLSLENTLT DPSELLRCAC CRHDDYALAN GRRVPETLPD TAAAYRLLAR FGERAPIYDW FQSFCESKAG SEMKRGDAAA AARGKGTFGL PREKLWQLQA RFTRAVAELE FLGIARPYQK GRKGVEYMVR TAFPLDKLAR DSRSNVVERL AIQPA
|
| |