Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32955 |
Symbol | |
ID | 5003346 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 135887 |
End bp | 139074 |
Gene Length | 3188 bp |
Protein Length | 1049 aa |
Translation table | |
GC content | 59% |
IMG OID | 640418767 |
Product | predicted protein |
Protein accession | XP_001419080 |
Protein GI | 145349311 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000297669 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.336011 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGTT GGGGACGTCG CTCGTGAGCG TGAGCGCGGA TGGGGAGGTT TATATATCGA GGGCGAGCGC GACGACGCGG AGCGACGGCG AGTTTGAGGG AACGCTGGAG ATCGAACGCG TCGTGACGTT GGACGACGGC GAGCGGGCGA CGTGTTGCGC GAGCGATGAC GATGAAATCG TGGTCGGGAC GGATAGGGGG CGGGTGGTGT TTTTGTGCTG GGAGGAGGGG TCGGAGGGGC GAAGCGCGAC GTGCGCGCGG AGCGAACGAG ACGGCGGCGC GACGAGCGTG GCGTGGTGCC GCGATACCGG TACGGTGGTC GTGCGGTTCG ATGCGGGAGC GGTGTGCGCG CTGGAGATTG ATGCGGCGGG CGACGTGCGG CGAAGGAACT GGTTCGATGC GTTCCCGTGT CCGGCGACGT GCGCCGCGTT TCATCGAGGC TCGCGGCGGC TCGCGCTCGG CACAGCCGAC GGCGAAATCC GCGTGTACGA CGACGCGATG ACGTCGGAGG CCGCGAAACC GCGACATGTT TTCGGGTTAA GTCCGTGGGG CTTCGGCGTG GAAGACACTG GCGCGTTAGC GCACGCTTCT TGGTCGAACG ACGGGCGAGC GCTCGCCGTC GGTTGGCGTC GGCGCGGCGT GAGCGTGTGG AGCGAATCGG GATGTTTGCT CATGTGTACT TTACATCACG GCGGCGGCGA TAGCGCGGTG GGAACATCGT CTTCGCCGCG CGCGACGTTC ACGGGTGACG AGGAAGTGCC AGAGATGGGC GCTTGTCTGG CCCCGCCCGC GTGGGGCGTC GGCGACTACG CGTTGTTCGT CCCGGTGCGG TCCGAGTCTG GTTCGAAAGT TTTAGAGTAC GCGCTCGCGA AGAGCGTGTC GAACAGTCGC GTCGCCCCTC GATCATACGA TGGCGGTTGC GATCACGATG ATGCATCACT GCTTCTTGGT GACGATCGTA TTTTCATCGT CGCCTCGAGC GCGACGAGCA CCAGAGTTCA CGCGCGACAG GAAGTGTGTC CGAGCGAGTA CGTCCAGCGT CAGTGGCCCA TGTGCGTGGC GGGAATGAGC CCGAGCGGCG ATCGAGTCGC CGTCGGAGGC GTGCGAGGAT GCGTCGTGTT TGACACGCGC GGTGAATGCT GGAGTCAGTT AGGAGACGTG GAAGAGGAGA ACTCCTTCGA GGCAATCGCG TTCGATTGGA TGCAACCGGC ACCGCCGACA TCGGGGCGGC AGCGTAGCGT GTTGCAGCCG GTACTCGCCA TCGTCGCTCG ATTAGGTAGA ACGAGGATGT TTACAAAGTC TATCAAGTTG TCGTACGGCA TTAGCTTTTA CGCCGACGGC GGAAAGGGTG ACCTTTTGAT GACGATGCCG CTACCATCCG AGGCGACGGG CGCATACGCG TGCGGAGAAT TCTTACTCGT GAGCTTCTCG AATGGCGAAA TAGCAGTGTA CGAGGTGGAA GAAATGTCAT CGAACGTGGG AGCGATTTCC GCACACCACG TTCGCGAGGA TGCGGGACAG CGACGGAAAA CAACCTTAAA CGCGGGCGGG CGCGTGCAAG GAATGTGCGC GGTGCCGCCG GCGGCGGCGC CCGAGCGAGC GCCGAGCGAG TGCGTAGTTT TGACAGAAGC TGGCGAACTC TTCGTCGTAG ACTTGACGGA TGAGTACGAT CAGGTGAAGC TCTTTGATGA CGTCGCGGAG TTCTGGGTCG TTGGAAGCGC GAACGCACCG CAAGAAATGA TGATGCAAGG GGACGAAAGC AGCGACTTTG AGAGCGACGC GAGAGATTCG CTTTCTGTGG ACGGGGGATG CGTATTTGCC TATGGTGCCG AAGGGATGCG CATATGTTAC TTTCCGAATG GCGACTTACG ACAAATCCTC ATCAACGGCG CAACGTCGTG TGACGTTGAG CGCGCGGCGA ACAATCCAGA ACTCGAGTTT GATCGAGAAT TGTATCCGAT GAGCGTGAGC CTGAATATGA ATCGCATCAT CGGAGTGACG CAAAAGTTTT CCTTCGCAGA TGCGGTAGAC ATGCCATACT TCACAATCGC ACCCAAATCG CACACAATTG TGCCTTACAT TTTGCGCAAG CTTTTGAGTT CTGGCCAGCA CGACGCCGCG TTGCGATATG CGCGTGCCGC TCGACGACAG ACGCCGCACT TCATGCATGC GCTCGAATGG TTGCTCTTCA CAGCGCTGGA ACGCTCGAAT CGCGAAATCA CGTCACAAAC AGTACTCAAG CAATCGATTG CACTGCTGTC GGAGTTGCCA AATTATCTTG ACGTCATCGT GAGCGTGGCG CGGAAGACAG ACAACACGCG ATGGGAATCG CTCTTCAAAT ATGCCGGTAA ACCAAGCGAG CTTTGTGTCA AGGCGTTGAA ATTAAAGCGC ATTCGTATCG CGGCGTGCTA CATTCTCGTC GTCGATAAAC TTGAGGGCGA AACAATGGGA CGTGAAATCG CCGTGCGCGT GATGAGAGCC GCGCTGGAAG CCCGCGAGTA CAAGCTCGTC GAAGACCTCA TCAAGTTTTT ACTGCAGCCA GCAGACGAAG CGGCGAAGGA AAATCAAAAG CCTGGAATCT TCAAGCGCGT GCTTGAAGTC ATCGCTCCGC CGCCAAACAG CGTAATTGCG TTAGGCGGGC GCGCAGATCG TGAGCTTGCG CTTGGTGAAC CAGAACAACT GTTATTGAAG TCGCACGTCG ACTCTCTGGG CCGCGAACGC GATGTCGCGG CGATGGGAGC TTTTATGAGC GAAACGTCCT TTGACGGCGT CGCTTACTTG AAACACGAGA CGGACGAGAA TGGTGAAGCG TACATCTCCG ATTTCGCGGG CAGTATCGAG TTAGCAGCGC GGCGATTACG AGAAGGAAAA TTACGACGAG CGGCGTCGAG TCAAAGTTCT CGCACAGAGA GTTTATTTCT CGTCGATCCC ACGCGCGCCG TCGGATCGAA AGTCGAAAGC GACGCCACTT ACGTCACCTC GCTTCTCGCC ACCGCGCGCG AGGCGGGTTG TACCGACTGG TCTTTACTAC TCGCTACTCT TTTAGGACGC GCAGACGTCC TAAACGAGTT TTTCACGAAC GAACCGGCGC TTCGAGAACC TTGGATGAAC ATCGCAAAGC GCGTCGCGAC AAACACGAGC GACGCGACGT TGAAGAATCA CCTCACGGCG CTCGTCTCGG ATATTTGA
|
Protein sequence | MNGWGRRSAS ATTRSDGEFE GTLEIERVVT LDDGERATCC ASDDDEIVVG TDRGRVVFLC WEEGSEGRSA TCARSERDGG ATSVAWCRDT GTVVVRFDAG AVCALEIDAA GDVRRRNWFD AFPCPATCAA FHRGSRRLAL GTADGEIRVY DDAMTSEAAK PRHVFGLSPW GFGVEDTGAL AHASWSNDGR ALAVGWRRRG VSVWSESGCL LMCTLHHGGG DSAVGTSSSP RATFTGDEEV PEMGACLAPP AWGVGDYALF VPVRSESGSK VLEYALAKSV SNSRVAPRSY DGGCDHDDAS LLLGDDRIFI VASSATSTRV HARQEVCPSE YVQRQWPMCV AGMSPSGDRV AVGGVRGCVV FDTRGECWSQ LGDVEEENSF EAIAFDWMQP APPTSGRQRS VLQPVLAIVA RLGRTRMFTK SIKLSYGISF YADGGKGDLL MTMPLPSEAT GAYACGEFLL VSFSNGEIAV YEVEEMSSNV GAISAHHVRE DAGQRRKTTL NAGGRVQGMC AVPPAAAPER APSECVVLTE AGELFVVDLT DEYDQVKLFD DVAEFWVVGS ANAPQEMMMQ GDESSDFESD ARDSLSVDGG CVFAYGAEGM RICYFPNGDL RQILINGATS CDVERAANNP ELEFDRELYP MSVSLNMNRI IGVTQKFSFA DAVDMPYFTI APKSHTIVPY ILRKLLSSGQ HDAALRYARA ARRQTPHFMH ALEWLLFTAL ERSNREITSQ TVLKQSIALL SELPNYLDVI VSVARKTDNT RWESLFKYAG KPSELCVKAL KLKRIRIAAC YILVVDKLEG ETMGREIAVR VMRAALEARE YKLVEDLIKF LLQPADEAAK ENQKPGIFKR VLEVIAPPPN SVIALGGRAD RELALGEPEQ LLLKSHVDSL GRERDVAAMG AFMSETSFDG VAYLKHETDE NGEAYISDFA GSIELAARRL REGKLRRAAS SQSSRTESLF LVDPTRAVGS KVESDATYVT SLLATAREAG CTDWSLLLAT LLGRADVLNE FFTNEPALRE PWMNIAKRVA TNTSDATLKN HLTALVSDI
|
| |