Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_93105 |
Symbol | |
ID | 5003339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 111467 |
End bp | 115087 |
Gene Length | 3621 bp |
Protein Length | 1206 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418760 |
Product | predicted protein |
Protein accession | XP_001419076 |
Protein GI | 145349303 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0761562 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGTGC GATTGTGCGA GCGGGACGGG GACGACGGAC GAGCGCGAGC GATGGATGAG ATGTTGGCGG CGTTGGAGGC GAGCGCGGGC GAAGACGCGG CGGCGATTCA GAGGTTGGCG TTCGCGCTCG AGGGAAGCGG TGGTGGTGGG AACGAGCGCG CGAGCGGGGC GGGGGGGCTC GAGAGGATCG ATGTCGTCGC GAGCGCGGGG CGGGATTGGA GCGCGGTGCC GGATGTCGGT CGGTTGCGAA TCGCCGCGGC GACGACGCGA GAGTTCGCGA TGGAGGACGA CGGCGCGGAG GCGGCGAGAC GCGTGCCGGG ATTGCGGCAC GGTGCGGTGT TCGGACTCGA CCGGGCGGTC GTGGACGAGG GCGACGGCGA CGACGACGAC GACGACGGCG ACGACGATGA CCGTGTTGTG AGTGGTGAAA TAGATAGCGC GAAAGATGAC GACGCCGGCG GACGCGACGC CGGCGTCGCG GACAAGGCGC TCGCGTTAGA CAGAGGGTCA GGCGACTCTG CGTCGCTTTG GCTCGCCGCC GCGGCGGATA TCGAGACTAC GACGTTGCGT TCGTGTTGGG ATACGCGAAA CGCCGGTGCG CGTGCGACTT CGAGTGCGCT CGACGCGGAA ACGTTTGATC TTGCGTATCG ACGCGCGTTT CGGCCGGTGA AGGGGATTGA TACCGAGCCC GTCATCGCGA GCGAGCGTGA TTGCGCGCAC GTCGTCGCAA TGGCGCTCAG TGGGTGTCAA ACGAGCGCAG ACATACTCGT CGCGGCGGCA GCAAGCACAG GTAGTGGTGT ACGCATGCGT TTCGCGTCGA GCTCGACGCT GGCGTTTAGG AACGCTTTGA AAACTATGGC GTCGGCGGCG GAGCGCCGAT GCGAGCTCGA CTACGCCGTC AGCCGCATGA CGACGCCATA CATGCGACCG ATGGCGAGCG CGATACGCGA TATTCTTCGC GCCCACGCCG CAGCGTTGCA AGCCATGCCA GAGGCGACGG CGGAACGGCG TCTCGCCGAG CGCGATACTG TTGACGACCT GACCGAAACG CAATCGGACG TAGGCGACGT CACGTTACTC GAGGTCACCG TGCACACGAA ACGTTTGCGT CGTCAAGTGG ATATTATTTT CAACCTCATC ACCGACGAGC GGTCGTTAGA TACTTCTTTG GTGCCGCATC CGGTGTTGAT CCGTCGTCTG GAGACGTCGC TGTCGCACGT GGAGGATGAA GAAACACCGA TGATTCAGTA CTTGTATGCG CGAGCGGCAA AACCTATCAT CGACGACACC TTGTCTTGGA TATACGCTGC GGCGCCGCCG TTTTCGAAGT CAGAGTTTTT CATAGAGTGC TCGCCAACGT GGTCAAATCT TTCTACATTT CATGAAAGTC CCGTGCGAGG AGGCGCGCCG CCGTGCTGGC TCGGCGGCCG CGCGCACGAT GGCGAAGACG AATTCGCGGA GACCGATACG CTCACCATCG CGCGTGTTCG GTCGAGTGTG AACCCCGCTC TGGCGGTCGA TCGAGCGAAA GAAGTCCTCC GCATTGGTTT GCAGCTTCGC ATTTTACAGC GATTGCCGCA CACGCGCGGG TTTGCACGCG CTGTGAAAGA TGCATTCGTC AAAATTTCGG CGTTTCAAGC GCACACAGAG CGCGATGCCA GGCAGTGGCT CAATCGAGTG CGCTTGCTCG AATCGCGCGT CGTCGCGCTG GCTCGAGAAA GCACGAACAG CATGCACGAG ATGCGCGCCG CACGAGCGCG AGAGTTGGAC TCGGCGAGAG CAGCATTCGT CGCCAAGACG CGCGCCGAAA TACTGGCGCG CGAAAAAACG CGACTAAGAC GTCTTGAGGA GATCGACGAA TCGAAGCGAG TGGAGCGTAA AGGGCAGCTC GAAGAGTTGG ACGAACGTGA AAAGCAACGT CGCGACGCGC GCGCGCAAAA GATTGCCGAC GAGCGCGCAT GGCTTGAAGA TCGAGAGGAA AAGCAGCGCG CCTTTGAGCA ACAAAAAGTG GAAAAGAAGA TGGAAGAAAC GCGCTTACGC CTTGAACAAG AGGATGCGAA ATTAGCGTGG TTTCGATGGC ACGAAAGCCG TCTGGTGCTT AATGATAAGC GAAGAGCCTT CGTGCGCGCA ATGGAATCAG TGCAAGAAGA AGAAGCAGTG ACCGCGCTCG CAAACGCGAT GAGCTTAGGT TTCGCGACGT CGCCATCACC GTCGAACGAC CTCTTCATCG AGCCCTTCGA GAGTGTCACG CCAACAGTGA TGATTGAAGA AGCCGAAGAG GTGAATGTTA TCGAAGAAGA CGACATCGCA TCAGACGATT TTGCCGACGC TTCCGAAATG TTCATCGACG AATCCGCGAC GTCGTCGCCG TCGACAGTCT TATCCTTACA TTCCGCAGAT GACGATGAGC GCAGCGTGGA TATAAAAGAG AAAGACATCG TGGAAACTGT TTCGGCGCCG GACAACATGG ACGAGATGCG TGACGGCGCA AAAGATCTTG AAACGCACGA CCACACATCT CGAGCGTATT TGGGTGAAGC TTTCGGCACG CCTTTGCCAC TTTTGTTGGA ACAAGAAATG CGCGAAATCA TCACACGCCA ATCCGATGTT TTGGGACGGT ACACCACGAC AGTGCTTATG GATCATCTCG CGCTCGAACT TCATCTTGAC GCCATCGTTC GGTTCACCAT GGGAGGTGAA CTCGGCTTCG CGGACGCGCT GCTGAACAAT CTACAATCAA AATGCGTCGC CGCACGAGAA CGAATGACAC AAGGCTCCGC GCGTGTCATG CAAACTGTAT TGGAGGGCGC CATCGACGAT ACAGGTTTGC GAGATGACCA CATGAGTAAG CGTTTACGTT TATGCGAGAA TGCCGATGCG CTGGTGTCTG TTGACCCGTA CAATTTGAAT TTATCCGCCG CCGTCGAGTG TTCGTACGCT GCTCCCTGGC CCTTGAACCT TATTTTTGCG GATGCGGGAG ACGACCACCC GCTCGCGCAC ACGAAAATTG CGCTATTGCA AATACGGCAC GCGGCGAGTG CTGTCAAAGA TGTTTCGGCG TTGGTACACG CGAGCTCAAG AACGCGAGCT TTACTGGATT CGACGCAAAC AAATTTGGAT TTGCGTGCGA GGCGATTGCG CAAACTATCC TTGCTCGCGT CATCTTTTCA GCATTTCGTC GATGCTCTTC ACGGGCATGT GTTCGAAGCC GTACACGTCG GCGCGCGAGG ACGCTTGTTT GCAAGTCTTC GCGACGACGG CGGGAACATC CTCCCACGCA ACATCGACGC CCTTCGCGAC GTGTTTGATG AATTCTGCGC TCGCACGTAC GCTGCGTGTT TCTTGCGCCC CATCGACACG GCGCTCAAGG CCCTGGTGGA CGATGGTTTA CAGCTCGCCC TAGAGTTGAA GTCTCTTCTC GCGTCGAGCG ACGCCGAATC GTTACTCGAA GATGGTCTCG CGTGCGCCGC AGCGCAGCAA ATCCACTCGA AATTTCACGC GTTGATGACG CAACTGTGCT TCCGCGCGCG CATCACTTCC TCCGACACCG CGGTTGGCTT CATCCAACGC GTCGATTTCA ACGAATTCTA TCTCGGCGCG ACCATCGACA TGGAATTTTA A
|
Protein sequence | MYVRLCERDG DDGRARAMDE MLAALEASAG EDAAAIQRLA FALEGSGGGG NERASGAGGL ERIDVVASAG RDWSAVPDVG RLRIAAATTR EFAMEDDGAE AARRVPGLRH GAVFGLDRAV VDEGDGDDDD DDGDDDDRVV SGEIDSAKDD DAGGRDAGVA DKALALDRGS GDSASLWLAA AADIETTTLR SCWDTRNAGA RATSSALDAE TFDLAYRRAF RPVKGIDTEP VIASERDCAH VVAMALSGCQ TSADILVAAA ASTGSGVRMR FASSSTLAFR NALKTMASAA ERRCELDYAV SRMTTPYMRP MASAIRDILR AHAAALQAMP EATAERRLAE RDTVDDLTET QSDVGDVTLL EVTVHTKRLR RQVDIIFNLI TDERSLDTSL VPHPVLIRRL ETSLSHVEDE ETPMIQYLYA RAAKPIIDDT LSWIYAAAPP FSKSEFFIEC SPTWSNLSTF HESPVRGGAP PCWLGGRAHD GEDEFAETDT LTIARVRSSV NPALAVDRAK EVLRIGLQLR ILQRLPHTRG FARAVKDAFV KISAFQAHTE RDARQWLNRV RLLESRVVAL ARESTNSMHE MRAARARELD SARAAFVAKT RAEILAREKT RLRRLEEIDE SKRVERKGQL EELDEREKQR RDARAQKIAD ERAWLEDREE KQRAFEQQKV EKKMEETRLR LEQEDAKLAW FRWHESRLVL NDKRRAFVRA MESVQEEEAV TALANAMSLG FATSPSPSND LFIEPFESVT PTVMIEEAEE VNVIEEDDIA SDDFADASEM FIDESATSSP STVLSLHSAD DDERSVDIKE KDIVETVSAP DNMDEMRDGA KDLETHDHTS RAYLGEAFGT PLPLLLEQEM REIITRQSDV LGRYTTTVLM DHLALELHLD AIVRFTMGGE LGFADALLNN LQSKCVAARE RMTQGSARVM QTVLEGAIDD TGLRDDHMSK RLRLCENADA LVSVDPYNLN LSAAVECSYA APWPLNLIFA DAGDDHPLAH TKIALLQIRH AASAVKDVSA LVHASSRTRA LLDSTQTNLD LRARRLRKLS LLASSFQHFV DALHGHVFEA VHVGARGRLF ASLRDDGGNI LPRNIDALRD VFDEFCARTY AACFLRPIDT ALKALVDDGL QLALELKSLL ASSDAESLLE DGLACAAAQQ IHSKFHALMT QLCFRARITS SDTAVGFIQR VDFNEFYLGA TIDMEF
|
| |