Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30983 |
Symbol | |
ID | 5001145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 92864 |
End bp | 95760 |
Gene Length | 2897 bp |
Protein Length | 956 aa |
Translation table | |
GC content | 55% |
IMG OID | 640416566 |
Product | predicted protein |
Protein accession | XP_001417424 |
Protein GI | 145345874 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.484466 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACAATC AGGTGAATTC GTACGATGCG ACGTCGCCGG GTAGGATATT TTCCGATTTA AAGTTGACGT TCCCGTGCGC GGCGACGTGC GGCGGCGACA TTGACGTCGA GTCGACGTGC GATCACGCCG TGAGTAGTGA CAATGACGAG CTCATCTCGT GCATTCACGA CGCGTGCGGT TGTCACGAAT CGTATCAACT CATCGAGGCG TTGTTTTACA TTTGCGACGC GCCGATGAAG GCGGACGGAA AGTTTAACGG CCCCGAATAC GCCAATTTTC TGGACGAAAC CGTCTCGGCG GTCGAAGCGC ACTGTGGAAA GGCTCACAGT GTGCTCAGCA TTCTGACCCC TATCCGAGAC ACGGACGACT CTGGCAATAT TAAGGCCGCG CCCTGCTCGT TGCCAGCGGA TGATCCGGCG CCGGCGCACT GCGTCGCTCA AGCTAACGTC ATCTCTACTT GCTCATATGA GGCACCAATG TGCGGCGCCA ACTCGTCGCG CACGGGTATT CTTCGTGTAC ACGGCAGCAC GACAGAGAAA TGGAGCTGCC CGGCATACAC CTGCGATACG TGCGATGGTG ATACGTGTTC TCGCTGCACT TGGACACGTA GGATTTGCAC GAGCACGCAT GAAGCATACT CCATGCCGTG CGCGTACGAA GGAACCCGCA AAGGTGAGTG CATCGAGTGC GAGTACGGCT GGTTGATGAA CAGCGAGACG CGTGAGTGCG ACTCGTGCGA TCAATATTTC AAGATGGACA GCTCCGGGGC GTGCGTCGAG GATACTGAAG CTTTGAAGAA AGACTTCAAC ATGACGGATG AGGACTTGAA GCAAGCTGAA GAGATGTTCA AAGCGTCGAT CGATACGTCG GACCCTGAGC TCGGAGCTAA GCTTCGCGTA GCTGCACTCG GAGCATTCTG GGACGACTGG GCGTCAGAGC TGGACTCTTT GAAGAACAAG ATCGACGATG GTTTGGATAC CATAAACGAT TTCAAGAATG ATCTCATCGA AGACATACAC GGCATCGGGA ACGCGATCGC GTCTCTCGCG GGTGACCTCG CAGATCTCGG CTCAAAACTC GCCGATTTGG TTTATGACGA AGTGTCAGCC GTTGTGCCGA ATGCCAACGC CCTCATGAAC GGACTCAGTC AGGCTGCGGG ATGCAACAAC GACGACAGCG CGTCTTTGGG CGCCGATCGA AAAACAGTGA AGTCGAGGCG GCACCACGAC GTGCGCAAGC GCGTGTTGGA GGCAGTATAC GGTGTGGAAA TCGCATCGCA CTACGCCGCA CCTTCTGAAA GCGCTCCACT CGGAGGTTCG TGCAGTCACA ATCTTTGCGC AGGCGCGCTG TGCTACCAGC TCGACGTCAA GGACATCCTA AAAAAGCACC CAGATTACTC AGATGGCGTT TTCTCACTCA CCAAAACGAG CTCTGTCCCG GACGAGCCTC GGTTTGAGCG TTTCCAACCC GATTACGTCA AGGCATTTGT CGATGGCAAC ATCAAAGCAT GCGCCGGGGT AACGCATTTT GGGTTGGACA TGAGTGTGTT GAACGACGTC GCACTTGAGT TCGTCAAAGT TGTCGAGCCG ATCGTCGATG ACGCAGTCAA AGCCATCACG GGTTGGGCCG ATGGAATCAC CGACTTCGTG AACGAAATCG GCGAATCGCT CACGGACATC TTTTCGTCGG TGAAATCGAC CGTCGCAAAC ATTAACGTCC CGGGCTTTGG ACGTAAGCTC TTGTCTGGTG ACACGCAAGA ATTCGAGCAA CTAACCAAAC ACGAACGAGC GCTGGTGCTT GGACACATCG TCGAGGAGTA CATCCAACGC GTCAACTTGA TGCAGAAGAA GGCGTTGAGC GCTATTGCCG AAATTCATAA CTTGGTTTCG AATGATCCGC ACATGCACAA CATGAAACCT ATGGCGGATC CCAAATCTAG AGCGCGCCGA GACATCGCCC ACCTCGGCGG ATTCAAGCTC GACTTCAGCG TCATCGAGGA CGGTATCATA TCAGTTCTTC GAGGCGCCTT GAATACGATC TCCACTGGCG TTTCTTTCGA CGTCGATTTT GAGCAAACAT TGGAATTCGA CGCCAAAGGA GCTCTCTTTC AAGAAGGTGA TCTCCTTGAC GGCCAAGCCA ATCAGGAATT GTTCAAGATC ACACCCATCG GCCCGACTGG ATTCGTTTTG GTCGTCGGAG GCGTCGCGAA AGTTCAACTC CCATATTTCT TGCTCGCAGA AGGCAGCGGA AAGCTCGGGT ACAAGATCAA GGGTAAAAAT GTTGGTTACA CGGTGAGCAT CTCCGACGGC GTCGCCACCG TCGCGCCTAA GTCTTCGGCA AGCTTGGACA TCACGCCGAC GATCGACGGC ACGATTTCCT CGAGCCTCAA GGTCGGCGTC GTGGCAGCGC TCGAAGAATT CCACATCAAA CTCTGCTGGG GCGGCATCAT CTGCATCGGC CCCGAAGTCA CAATCAAACA AGGGATTCAA TTCGGCGCGG ATTCCGTTCT AAACGACGGG GTCGGCGCGT CCACGGACTC CTCTTGTTAC AACGGACGCA CCGGTTTGTC TCCCGTCTTT GGCGAATTCC TCACGTCCTA CCCGACAACT TCAAACCAAT GCAGCGCAAG TGCCAACGTC GTCGGCGCCG GCTTGTACGT CGAGTATCCC AAACCGTTCG TGACTACCGA TATTATCACC GAAGTACGCG GCGACACCGA GTGCGTCCAG CCTCTGTCGC TCTTGGACAT GACGAGCCGC GCACAGTATA GATCCAGCCA TTCGACGTCG TGCGCTGCCA GCACAGATCC TGTGGAGGAT TGTGGCGCCG CTGTCGACGC GTGTCCGACC GACTGCCCGA TCTACGCCTA GCCGTTTCGT GCGCGCAGGG CCAATTT
|
Protein sequence | MYNQVNSYDA TSPGRIFSDL KLTFPCAATC GGDIDVESTC DHAVSSDNDE LISCIHDACG CHESYQLIEA LFYICDAPMK ADGKFNGPEY ANFLDETVSA VEAHCGKAHS VLSILTPIRD TDDSGNIKAA PCSLPADDPA PAHCVAQANV ISTCSYEAPM CGANSSRTGI LRVHGSTTEK WSCPAYTCDT CDGDTCSRCT WTRRICTSTH EAYSMPCAYE GTRKGECIEC EYGWLMNSET RECDSCDQYF KMDSSGACVE DTEALKKDFN MTDEDLKQAE EMFKASIDTS DPELGAKLRV AALGAFWDDW ASELDSLKNK IDDGLDTIND FKNDLIEDIH GIGNAIASLA GDLADLGSKL ADLVYDEVSA VVPNANALMN GLSQAAGCNN DDSASLGADR KTVKSRRHHD VRKRVLEAVY GVEIASHYAA PSESAPLGGS CSHNLCAGAL CYQLDVKDIL KKHPDYSDGV FSLTKTSSVP DEPRFERFQP DYVKAFVDGN IKACAGVTHF GLDMSVLNDV ALEFVKVVEP IVDDAVKAIT GWADGITDFV NEIGESLTDI FSSVKSTVAN INVPGFGRKL LSGDTQEFEQ LTKHERALVL GHIVEEYIQR VNLMQKKALS AIAEIHNLVS NDPHMHNMKP MADPKSRARR DIAHLGGFKL DFSVIEDGII SVLRGALNTI STGVSFDVDF EQTLEFDAKG ALFQEGDLLD GQANQELFKI TPIGPTGFVL VVGGVAKVQL PYFLLAEGSG KLGYKIKGKN VGYTVSISDG VATVAPKSSA SLDITPTIDG TISSSLKVGV VAALEEFHIK LCWGGIICIG PEVTIKQGIQ FGADSVLNDG VGASTDSSCY NGRTGLSPVF GEFLTSYPTT SNQCSASANV VGAGLYVEYP KPFVTTDIIT EVRGDTECVQ PLSLLDMTSR AQYRSSHSTS CAASTDPVED CGAAVDACPT DCPIYA
|
| |