Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19010 |
Symbol | |
ID | 5006746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 225999 |
End bp | 228383 |
Gene Length | 2385 bp |
Protein Length | 794 aa |
Translation table | |
GC content | 66% |
IMG OID | 640422167 |
Product | predicted protein |
Protein accession | XP_001422527 |
Protein GI | 145356623 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.386268 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGCG CGCGCGAGCC CGCGGGCGAC GCGGGCGGTC GCGCGCCGTC GCTCGACGCG CTCGACGCGC CCGATCGCGC GCGCGACGAC GCGCGCGACG GCGCGAACGA CGTCGAATGG CGAACGGTCA ACGGACGCAA AAATGCGAAA TCGTCGCGCG AGCGTCTCGC GCGGTCGTCG TCGTCGCAGT CGCGCGCGGA GGAGATCGCG GACGCGAGCG TCGCGAGGGT GAACGCGTAC GAGGCGATCG CGCCGCCGGG AACGCCGACG CGAAGCGAGA GGGAAGGGGA GGGCGAGGGA ACGGAGGGAC GAGACGCGAG ACGACGCGCG AAGCGACGCG CGAGACGAGG CACGCGACGG ACGAGCGCGG CGCTGCGCGT GCTGGGAGGC GCGAGCACGG GAGCGAGCGA GGATGAGTAC TACGGAGAGG GCGCGGACGG CGGTTGGGGA ATCGGAGGGA TCGCGGCGCG AGCGGCGGCG TTGGCGGCGG CGACGGCGAG GGCGAGATTC GATACCGCGC GCGAAGCGAC GACGACGCCG GATTCGGACG CGGCGGGGCG AGGGAAGGAA GAGGGCGAGG TGGTTCCGTT GATGATGCGG ACGTTTCCGC CGATCGGGGG GTCGAGACCG CCGTTGCCGG GACGAGCGAC GGTGAGGGCG CCGATGTCGC CGCCGAGGCC GAGACCGCCG AAGACGCCGA AGACGCCGAA ATCGCCGACG ACACTCGAGC GCACGAAAAG TGGGCGTCAC GGAGGGGACG GGGAGGAGAG AGAGCATCAA AACGCGCTGG ATTGGTTCTT TCATCAGATG GATCTCGCCT TTGCGGGCGT CATGCTCGCC GTGTACGCGT TTTACGACAC GGTGATGAAA TTCATAGGTA TCAAGGTTTT GCGCGTATCT CAGAGTTCTC GGAGCGTGCG ACAGACGGAG GCTCGAGTGG CCGAAGAGCG CGCGGCCGTG GAGGAGTTTG AGGAGATTTT GTCGGCGCGG CAAACCGCGA CCGAGGCGTC GTCGAGGTCG CCGTCGGCGA GCGACGTTGA CGGCGCAAAC CGCACGAGTC GACCTCGCAC GCCTCCGAGC GCGTCGACGA GCGCGACGAC GTCAAAACAC GCGACGATAC CTTCGGTTGA GCAGCGCGTG GTGGAAGAAC GCGAGGAAGG TGAACTGACG CCGTCGCACG CTTTACGGCG ACGACTAAGC TCGAGTCTCG GCGGCGCGTG GGGTGGCGGT GGCAACACGC CACCTTCGTT CCGAGTTGCG CGCGTCGACG AAGACGCGAC GCACGCGCAC AACGAGCTCG TCTCGTGCGT GGGCTCGCGC GGCGATGAAT ACATAACTGG AGGGTGGGAC GGCACGCTGC GGACGTGGAA GTGGGATCCG ACGAAAGGGC TGTCCGGAGG CTTGCCCATG ACCGGACAGC ACAACGACAA CGTCGAGTTC CTGAGCGTCG ACGCGAGAGA AGACCACGAG CGACTAGCGA TTTCGGGTGG GCGCGATTGT ACGGTGCGCA TTTGGGACGT CGCGAAGCGA TCGCAGCGAA GTCGTATTTA CGCGTTTGAA AACATCGCGA GCGGGTGCGT CGACTGGGAG TCGCAAACAG TCGCCGTGGG CTCACGAGGA GGCGCGGTGA TGTTGTGGGA CGCCGAAAAG GGATCCAAAA AGTGCACGCT TCGCGGGCAC GATGGTGAAG TCACATCCAT GTGCACGTAC GATTGGTCCG AAGGTGGCGC CACGCTTTAC GTCTCCGGCG GCGCTGACGG CACGGTTCGC GTGTGGGACG CTCGTCAGCA TGTCGCCGTT GCGACGATGA CGGAGCATCG TCGACGCGTG TACGCTGTGT GTCCGGGTCC AAAGGGTATC ATCTTCGCCG GCGATTTTTC GTCGAACGTC AAGGTTCACT CTTTATCCAA CCCGGGCGCG CTACCTCGCT TGCTGCCAAA CGTGCCGAGC ATGGACGGCT GCGAAGCCCC GATCGCGGGG TTGCAATACG TGAAACTCGA CGGCATGAAC GGTGGCGGCC TGTTGCTCTC AACCGCTGCT TACTTCCCGC TCAACGAAAA CGGCGAGGAA TCCGACGACG ACGACGCTCC GCAAGGCTGC GTCCACGTTC GCGCCGTCGA CGCCACGGGC GCCGGCGTCG GCCCGGTCTC CGACCAAGAC GGCGACGGTT ATATGTACAC CCTGAAAGGC ATCGAAGGGT TGCTCACGTG CGCGTCCCTC ACCGCCACAT CCGACGGTCA TCGCATGCGT CTCGTCGTCG GCGCCGGATC TGGCGCGCTC GGCGCGTACG CCGAGGGCGG CGCGCTCAGC GGCCAAACCG CCGACGACGC CTACGCGTCC ACCATCGAAC GCGCCGACGA CTTGGGCGTG GAATCTTTCG ACTGA
|
Protein sequence | MDRAREPAGD AGGRAPSLDA LDAPDRARDD ARDGANDVEW RTVNGRKNAK SSRERLARSS SSQSRAEEIA DASVARVNAY EAIAPPGTPT RSEREGEGEG TEGRDARRRA KRRARRGTRR TSAALRVLGG ASTGASEDEY YGEGADGGWG IGGIAARAAA LAAATARARF DTAREATTTP DSDAAGRGKE EGEVVPLMMR TFPPIGGSRP PLPGRATVRA PMSPPRPRPP KTPKTPKSPT TLERTKSGRH GGDGEEREHQ NALDWFFHQM DLAFAGVMLA VYAFYDTVMK FIGIKVLRVS QSSRSVRQTE ARVAEERAAV EEFEEILSAR QTATEASSRS PSASDVDGAN RTSRPRTPPS ASTSATTSKH ATIPSVEQRV VEEREEGELT PSHALRRRLS SSLGGAWGGG GNTPPSFRVA RVDEDATHAH NELVSCVGSR GDEYITGGWD GTLRTWKWDP TKGLSGGLPM TGQHNDNVEF LSVDAREDHE RLAISGGRDC TVRIWDVAKR SQRSRIYAFE NIASGCVDWE SQTVAVGSRG GAVMLWDAEK GSKKCTLRGH DGEVTSMCTY DWSEGGATLY VSGGADGTVR VWDARQHVAV ATMTEHRRRV YAVCPGPKGI IFAGDFSSNV KVHSLSNPGA LPRLLPNVPS MDGCEAPIAG LQYVKLDGMN GGGLLLSTAA YFPLNENGEE SDDDDAPQGC VHVRAVDATG AGVGPVSDQD GDGYMYTLKG IEGLLTCASL TATSDGHRMR LVVGAGSGAL GAYAEGGALS GQTADDAYAS TIERADDLGV ESFD
|
| |