Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27740 |
Symbol | |
ID | 5005610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | - |
Start bp | 100163 |
End bp | 103378 |
Gene Length | 3216 bp |
Protein Length | 1071 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421031 |
Product | predicted protein |
Protein accession | XP_001421709 |
Protein GI | 145354893 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00683941 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0438763 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGGGT ACGCGCAGAC GGACGGCGGC GCGAGCGCGG GCGCGGAGGC GGCGGCGAGG GAAGAATTCG ACGACGGGGC GATGGAATTC ATCGCGGCGT CGAGCGCGCG AGGACGCGCG CGCGGCGAGG AGGGCGACGC GGACGCGCGA GGACGCGCGT TGGCGTGCTC GGTGAGCGGA TTCGGCGCGA GCGGGAGCGC GCGAGCGGCC AAGGTGGAAA ATTTTATCGC GGCGAGACGA CGCGCGACGA AACGGTTGGC GCCGTGGTAC GAGGGATCGC GCGCGAGGAT CCACGCGATG GCGCACTCGC GCGATGGGGA AAGCTTGTTG TGCTGCACGA GCGCGGGAGG GGTGTACGTG GTGCCGATTT TGGAGTTGAC GCGTGACGCG GAGGCGTCGC CGGCGATGCG GGTGTTGGCG TCGAGAGGGC CGAAGCCGGC GAGCGCGCTG TGGTGGTATC GCGCGCTCGA ACCCGAGCGC GAGGATCCGG TGGTGGGGAT ATGCGTCGGC GTGGACGGGG AGGTGCGGGC GTGGGACGTG AAGGAGGGGT CGCCTCTGGG CGCGTGCGTC GTCGGCGCCA AGTGCGCGAG CGCGGAGCTC GCGCGCGGCG CGACGAAGCA ATTTCTCGTC ATCAACGGTG TTCAGGGCGA AGTTTGGACG TTGATGCTGG AGAAAATGTC GCGCGTGGTG AAGACGTCGA CGAGCGAAAA AGGCAAGGTG GAGACGACGA CCAAGGCTGA ATCGTTACCG GACGCCGCGG GTTCGCACGG ATTCGCCGCG CACCTGCTCA AGGATGAGTA CGGCGTTCGC GAGGGGCGGC AGGTGACGCT GAGCGTGCAA GAGACCGGCG AGAGCGATGG ATATTCGCTC ATCGCTGCGT TGATCGATCG CCGTACGTTG GAGTTGTACG ACGTGGATCG CGCCACCGAG CCAAAATCCA CGCACGCGTT GCCGAATCAC ACCGTCGCCG TGCACGTGAC TGAAGATTTA ATATTCGCCT TGGTGCGCGA ACCGGTTTTC GGTGAGGAAG ATCTCGCAGA CTTAACCTCA TTCACCGCGT CCGTGCACGT GTTGGCGCGA AGATTCGGGA CCGACGGTTC CAAGTCGTTT ACGCTGCAGA CGTTTCACGT GCCGCGCTCC GCGGGCGTGC CAAAAAAGTT CCTCGCGGCT GAGCTCCCGG AAACGTACGT GCGAAGACGA GAGACGCTGT GCGGATGCAT GTTGTGGACA TCGTACGGGG TGTACGAAAT TAAGTCAAAA GTCGACATCG CGTCTACGCT GCGATCGTTC ATGTCGCCGA ACGCGGTCGT TGCCTCGACC CGCCGCGACG CGTGGGCGCC GATCGAACGC GACGAGTTCG GTGACAACGC TTTAAGCGTA GATCTCGACC AAGACGAACG CTTAAAACTC GTCGCGCACG TCTTGGACGA GGATCACATG CCCGTGTTCG TCGAGGCTGC GCGCCAGGAG CTCAAGCGAC AAAATTTCTC TCGAGCGCGC GATTTATTCG GGAAGACTGG AAAGCCACTC AAAGATTTCA TCTCGCTGAG CTTAGAGGCG TGGGAGGCGT CGCAAGCACT GACGAACTTC CACGGAAAAT CAGTCGAGGC GTACGGAGGA CAGGCGAACC TTTCGTGGTT GAAGACGGCC GCCGCCGCGC ACGCGCATTT GCACGCGTGG TGCGAAGCGT CGAACGCAGT CGCGTACAAC GCCGCCGCGC ACGATTTGGG AGAATCGATT CAAGCGAAGC TCCAAAAGTT GAACCTGAAA GAAGAACGCT CCCCTGAACA AGCGGTGAAG GATCTTTTGC GCGTCATCGG TGAAGGTATC GAAGCCACCA AAGACGCGGG CGTGCAGCGC GACGTGGCGT GCTCGGCGAG CGCCGCCGTC GTCGCGTTCG AGGCTGCGAA CGCGGTGAGC ACATCCGTGA TCTCGTCGTG TGCGGCGAAC GAGTACTCAG ATCGAATCAA AACGTTATTC ACCATGCTGT TGTCCTCATC ATCCACCGTT GAGAGCTTGC ACATGCTCGG TGGCGTCACA GCGAATTTGA TGACGCAAGC AGCCGAGCAC AGCGGCGGCA CCGTGCGTAC TTGGGAGTCC AAACCACTCG TGTTTTGGGA TCCGAATGTC ACGTACTACG TCATCGCTAC GCAGACGATG GAAAATCTGT ACGACATCGT TCATGTGCTC GGTTTGGACA GCGACACGAC GGACACGTGC GAGGCGTCGC CCTTGGAACT TGTGTACGAC ATGTTGACGA GAGATGAGCT GCGCTCGCTC GCGGATATGG CGCGCGAGGC GAAAAATCTT GGCGTCACCG GTGCAGCTGA AATCGAGCTA ACGATACTTT TGTACTTGGA CGACGAAAAC GCTCTTGCCG AGCGCATCGA AGTCATGTTG GAAGACGATC CAAAGTTGTT TGGTTGGATC GCGTCCAAGT GTCTGGACAA GCGAAAGTTT CGCATCACCG AGTTGGCAGC GTCACAGATG GAAGACTTTG CCACCGCCGC TATGTGCCAC GTGGCCGCTG TTCAAAAGCT CGCCGCTGTG GGCGAGACGT CGCCGTCAAC CTTACAGGCT GAGCTCGAGC TGGGCGTCGA GACGTACGTC GCGCGGGTGG TCGACACGCG CGCGCAAGTG AAATCAATCG AAGACGTAGC ACGATGCTGG CAAAGGAATG GATTGCCTAT CGACGAACTC GAACGTTTGT TTCTTGACGT ACTCGTCTCT CAAGGTCACG CAGAGGCTAT GCAAGTTGTA CTGCAAAGTG ATCTCGGATT TCAATTCAGC GGTCAATTCA TTCTCGCCGT CGCGACGAAA CGAGTAGCCG AGGACGAGTC AAAGTATTCT TCGAAGGATG GCGCTACAAT CGAAAGCGTT TGGTTGAAAA TCAAGCAAGA TCTAGCATCT CGCCTGGATT CCCCCGAGTT TGTTCAAACG CGAGCTTTCG ACGCGATGGA ACTGGACTCT CTGACTTCAG TCGACGGTGC GCAGTGCTGG GCATTCACGT GCGGACACCG CTACGGCTCT GAAGAGTTGC AGCGCGAAGT CAACGACGCA AAGGCGAGAT TGAAGATTCT GGATTTACCC CTGTCGTCCA TGTTACTCGA AAGCGACTAT AAATTGCAAA AGTGCGCGGT CGCGTGTCCA AACTGTGTGT CTTACGCCGT CGAGCACTAC GTCGAAGTGC GTCGCAACAC GAGGGGCGCC GCGTAG
|
Protein sequence | MFGYAQTDGG ASAGAEAAAR EEFDDGAMEF IAASSARGRA RGEEGDADAR GRALACSVSG FGASGSARAA KVENFIAARR RATKRLAPWY EGSRARIHAM AHSRDGESLL CCTSAGGVYV VPILELTRDA EASPAMRVLA SRGPKPASAL WWYRALEPER EDPVVGICVG VDGEVRAWDV KEGSPLGACV VGAKCASAEL ARGATKQFLV INGVQGEVWT LMLEKMSRVV KTSTSEKGKV ETTTKAESLP DAAGSHGFAA HLLKDEYGVR EGRQVTLSVQ ETGESDGYSL IAALIDRRTL ELYDVDRATE PKSTHALPNH TVAVHVTEDL IFALVREPVF GEEDLADLTS FTASVHVLAR RFGTDGSKSF TLQTFHVPRS AGVPKKFLAA ELPETYVRRR ETLCGCMLWT SYGVYEIKSK VDIASTLRSF MSPNAVVAST RRDAWAPIER DEFGDNALSV DLDQDERLKL VAHVLDEDHM PVFVEAARQE LKRQNFSRAR DLFGKTGKPL KDFISLSLEA WEASQALTNF HGKSVEAYGG QANLSWLKTA AAAHAHLHAW CEASNAVAYN AAAHDLGESI QAKLQKLNLK EERSPEQAVK DLLRVIGEGI EATKDAGVQR DVACSASAAV VAFEAANAVS TSVISSCAAN EYSDRIKTLF TMLLSSSSTV ESLHMLGGVT ANLMTQAAEH SGGTVRTWES KPLVFWDPNV TYYVIATQTM ENLYDIVHVL GLDSDTTDTC EASPLELVYD MLTRDELRSL ADMAREAKNL GVTGAAEIEL TILLYLDDEN ALAERIEVML EDDPKLFGWI ASKCLDKRKF RITELAASQM EDFATAAMCH VAAVQKLAAV GETSPSTLQA ELELGVETYV ARVVDTRAQV KSIEDVARCW QRNGLPIDEL ERLFLDVLVS QGHAEAMQVV LQSDLGFQFS GQFILAVATK RVAEDESKYS SKDGATIESV WLKIKQDLAS RLDSPEFVQT RAFDAMELDS LTSVDGAQCW AFTCGHRYGS EELQREVNDA KARLKILDLP LSSMLLESDY KLQKCAVACP NCVSYAVEHY VEVRRNTRGA A
|
| |