Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32968 |
Symbol | |
ID | 5003352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 163505 |
End bp | 166733 |
Gene Length | 3229 bp |
Protein Length | 1031 aa |
Translation table | |
GC content | 62% |
IMG OID | 640418773 |
Product | predicted protein |
Protein accession | XP_001419089 |
Protein GI | 145349330 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.809445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCCACGGCGC GTGCGCGTCC GCGCGCGCGC ATCGCGACGC TCGGGACGAT CGGTCGCGTT CGCGAGCGCG TTCGCGAGCG CGCGCGGGGC GCGATTCGGT CGAGGCGCGA CGCGAACGCT TCGGGACGTC GACATGCGGG CGGATGAAGT CGCGCGGTGC CTGGCCGAGA CGCTCTCGCC CGACGCCGTC GCGCGCGCCG AGGCGCAGCG CGCGATCGAG CGCATGGGAG GCGAACCGGG GTTCGCGGAG ACGCTGGCGT CGATCGCGCT GCGCGGCGTC GAGGGCGCGG TGGTGGACAT CAGCACGAGA CAGCTGAGCG CGGTGCTGCT GAAGAAACAC GTGCGAGAAC ACTGGAACGC GCTGGATGAA AGGTTCGTCG CGCCGGAATT GACGGAAGCG GAGAAGCAAG GGTTGAAAAC AGTGCTCCCG AAGGGACTGG CGGATGAGTC GAGCAAGATG CGCACGGCGT TCGCGGCGGG GATCGCGCAG GCGGCGGCGA GCGACGGCGC GATTTGGGAC GAGCTGACGA CGACGTTGGT GGAAGGAATA CGAGCGAAAC GATCGAGGTC GGAGGTTTTG GGGTGCTTGA AGTGTTATGA GATTATTGCG GGGGAGATTG ACGCGAAGGA TGTCGCCACG GTGGGGCCGA CGTTGTTTCC AGAGTTGTTG ACGCTGGCGA GGCACGGAGA GGACGGCGCG CTACGGAGGC GAGCGGAAAC GGCGTTTTCG TCGACAGTGA GCGCGTTGAC CACGCTCACG GGGACGGAGC AAAAGGAAAT GCGAGACATG TTGTTACCGT ATTTGCCCAC GTGGTTGGAG ACCGTAGCGA TCGCGTTGGA GGGAATGCCG AATCCGAACA ATTTCGACGC GTTGGCTTCG ACCATGGCGG CGCTCACGAG TCTCGCGCTC GCGGTGCAGT ACTTTACCAA GCCCGCGGGC GAGGCGCTGA TGCCGGCTTT ATCCCGTGGG GCGATAATGT TTCACACCAT AGCGCCAGTT TGGGCGAAAT ACTCAGAGGA GACGGATCAC TTGGATCCGG GCATGGATAG CGACGGCGAC ACGGTGAGTT TCGAGGCGGT GGTCACAGAA CTCCTGGAGC TCGTCATCAA CATCGCGGAG CAGCCTAAGT TGAATAAGCT GCTCGAGCCG AATTTAGCGG ATACGTTGTA CGTCACGATG GGTTACATGA CGATGAGCTC ATCGCAAGAG GAGATGTGGA TGGATGATCC GAACCAGTTC GTCGCGGACG AGGACGACGA TTTCGGCAAT GTGCGTGCCG CGTGCGGTCT CATGCTTGAT TCTCTCGGAG AAAGATTCGG CGTAAAGGCG GTCGCTGCGC TCTGGAACGC ATCAAATAGA CGATTGGCGG AGTCCATATC GGCGCAACAA ACCGGTGACT CGATGTGGTG GCGACCGCGA GAAGCGGCGC TCCTCGCCGT GGGCACGATG AACGAGGTCG TCTTGTCGAG CCTGGAGCGC GCGCAAGAAA AGGGCAAACC CGCACCTTTC GACATCGCCG CTTTCATGAA AACGGTGATT GAGAACGATT TGCACGAGAG CACGGCGGCG TCGGCGCCGT TCTTGCGTGG CCGAGCGTTG TGGGTCACGG CTCGCCTATC GAGCGGCGTG CCGACGGAGA TGGCGGATGC CATTCTCAGA GCTTCGGTAA GTTCGCTGGC GCCCGGTTTA GCGCCACCAC TGCGTATTGG TGCGTGCCGC GCGATCGCTG AGTTTTTACC GATCGCGAAG AAGGAAGTGA CGACTCCGTA CATTGGTGAA ATTTACAAAG GCTTGGGCAA TTTGCTCGTC GACGCCGGCG AAGAGACTCT GCATCTCATT TTAGAAGCCA TGCTCGTGCT CATTAAGGCT GACTCCGACG CTGCAGCGGC GTGGTTGAGC GCACTCGCGC CGGCGGTGGT GAAAATATGG GCGGAGTACG TCCGAGATCC CTTGGTGAGC GCAGATACCA CCGAGGTGTT TGAGGCGCTC GCGGAGATTC CTGCGTGCCA GGCGCAGTTG CACACCATGC TCGTGCCGAC GCTTTCGCAC ATACTCGCAT CGCCGAGCGA ACAACCTGAA ATGTTAGTGG AGGCGACGCT CGATCTATTG ACAATCATTC TTCGCCCGGC GTCGCCTGAG ACGGCCAAGG CTACGCACGA CGTGTGCTTC AAGTACGTGT GCGGTCTCAT CATGCAGAGC GACGACGCCG GCGTGATGCA GGGTGCGTCC GAGACGTTAC GCGCATTTCT TCGCGCGGGA AAAGAGAACA TGCTCGAGTG GGGTAGTGGT GACCCGACCG TGGGCGGTGG CGACGTGTTG CGCGCGATGT TTGAAGCCGC GTCGCGCTTG TTGGATCCAA ATTTGGAGGA CAGCGCAAGT CTGTACGCGG CACCTTTGCT GTGTCAAATG CTTCGTCGTT TGCCGACCAA GGTGGGCCCG GTGCTTCGCG ATATCACGGC TGCGGTCGTG GCGCGCTTGC GCTCTTCAAA GCAGCCCAAT CTGTCGGCGT CGTTGTTGAC GGTGTTCGCG CGCATCGTGC ACGTGGACGC CAACGCGTTC ATCGAGCTCT TGATGTCGCT TCCGAGCGGC GGTGACGAGC CGAACGCGTT CGATTTCGTC ATGCGACAGT GGTCAGAGAA GCAATGCGAT GTACACGGTT CGTTTGACAT CAAGTTGACC ACGACCGCGC TCGGTTTGCT TCTCAACACG CAGAGTCCAG CGTTGCACGC CGTCGTCGTC AAGGGCCAGC TCGTGGAGAC GCCCGCGGAG AGCGGCCGCA TTCGCACGCG CGCGCGCGCT CAAGCCAACG GCCCAGAAGT CTGGACCCAA ATCCCACTCT CCGCCAAAAT AGTCGAACTC TTAGCCGACG TCCTCATCGA GTACGCCGAA GGCATGGCCG GCGCCGAAGA CGACGAAGAC GAATGGGAGG AGGAGCGCGA CGACGACGAA GACGACCCCG ACGACGCCGC AGACGACGAC GACTTCACCG GCGAGGAAAA GGAATTCACC GGCGACTTAT TCGAGCGTCT TCTCATGCGC GGCGGTCTCG ACGCGTTCGA TCCCGACGAC GCCGACGAGG CCGAGGATCC CGTGAACGAC ATCGACGTTC GCGCCTTCGT CGTCGGCGGC TTTCGCGCGC TCCACGCATC CGGCGTCCTC GCCCCGCTCG CGCAGTCCAT CGCCACCAGG CACCAGCGCG CCATTCACGA CGCGCTCACG CATCAGTAA
|
Protein sequence | MRADEVARCL AETLSPDAVA RAEAQRAIER MGGEPGFAET LASIALRGVE GAVVDISTRQ LSAVLLKKHV REHWNALDER FVAPELTEAE KQGLKTVLPK GLADESSKMR TAFAAGIAQA AASDGAIWDE LTTTLVEGIR AKRSRSEVLG CLKCYEIIAG EIDAKDVATV GPTLFPELLT LARHGEDGAL RRRAETAFSS TVSALTTLTG TEQKEMRDML LPYLPTWLET VAIALEGMPN PNNFDALAST MAALTSLALA VQYFTKPAGE ALMPALSRGA IMFHTIAPVW AKYSEETDHL DPGMDSDGDT VSFEAVVTEL LELVINIAEQ PKLNKLLEPN LADTLYVTMG YMTMSSSQEE MWMDDPNQFV ADEDDDFGNV RAACGLMLDS LGERFGVKAV AALWNASNRR LAESISAQQT GDSMWWRPRE AALLAVGTMN EVVLSSLERA QEKGKPAPFD IAAFMKTVIE NDLHESTAAS APFLRGRALW VTARLSSGVP TEMADAILRA SVSSLAPGLA PPLRIGACRA IAEFLPIAKK EVTTPYIGEI YKGLGNLLVD AGEETLHLIL EAMLVLIKAD SDAAAAWLSA LAPAVVKIWA EYVRDPLVSA DTTEVFEALA EIPACQAQLH TMLVPTLSHI LASPSEQPEM LVEATLDLLT IILRPASPET AKATHDVCFK YVCGLIMQSD DAGVMQGASE TLRAFLRAGK ENMLEWGSGD PTVGGGDVLR AMFEAASRLL DPNLEDSASL YAAPLLCQML RRLPTKVGPV LRDITAAVVA RLRSSKQPNL SASLLTVFAR IVHVDANAFI ELLMSLPSGG DEPNAFDFVM RQWSEKQCDV HGSFDIKLTT TALGLLLNTQ SPALHAVVVK GQLVETPAES GRIRTRARAQ ANGPEVWTQI PLSAKIVELL ADVLIEYAEG MAGAEDDEDE WEEERDDDED DPDDAADDDD FTGEEKEFTG DLFERLLMRG GLDAFDPDDA DEAEDPVNDI DVRAFVVGGF RALHASGVLA PLAQSIATRH QRAIHDALTH Q
|
| |