Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19066 |
Symbol | |
ID | 5006626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 366545 |
End bp | 369658 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | |
GC content | 54% |
IMG OID | 640422047 |
Product | predicted protein |
Protein accession | XP_001422726 |
Protein GI | 145357031 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.88785 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.539397 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGG CGACGAGGGA CGACGACGAC GACGAGGCGC TCGAAATCGT CGCGCGGGCG TGTTGCGAAG CGATGATGCG GACGTGCGAC GATGCAACGG ACGGACGGGC GGACGAGGAC GAGGAAACGG CGGAGGAGAG GGAGACGGAG ACGGAGACGT GGTTGGGGAC GCTGTGCGCG ACGGCGTCGA CGAGCGGGCG GGCGCTCGAA CGGGCGCTCG AAGCGACGCG AAGGGCGGCG GCGATGGACG AGAGGTTCAC GACGCGCGTG TGCGAAGAGT ACGACGCGAG CGTGCGTCGG TTAGGGGACG CGAAGGATGC GACGTTGGCG GTGTTGATCG AGGCGAGGGC GTGGGGACGC GCGCGAGAGG CGAGCGCGAG CGCGAGCGAC GTCGAGCGAG CGCTGAGAGC GTGTGATGCG AGATCGTTGG AGAGCATGGC GAAATACGTG CTCGAGGAGG TGAAAGAAAG CGACGCACAC GTATTGCGAG CGATCGCGCG CAGAGATTCG GAGATTTGCG TGGAAATTCT CGGCGCGTTG TTGTCTCGGC TGTCGACGGG CGTTCAAACG CGCGCGGCGC TTCGCGCGCT CGAGGAGATT GTCCAGATGG ACGCGGCGCC GCACGTCGAC AGAATGTCGA GTGAAATTCA GCGAGCGGTT GAATATTTAC CCCAGTTGAG CGTCAAAGAG GGGGGAGATA TCGCGCGCAG GGCGGTAGCG GCGCTCTCCA AAATCTCTGG CGTCGCGCCG GTGATGGCTT TGTTCGAAAG CGTCGAGGAC GACTGCTCGA AAACTATTTT AGGCTTGCAA ATGCTCGGTG ACATGATGCG AGCCGGCAAT CATCTTGACG CAGGTATCGC GGCTTTAGAA CGCGCGATGG TGGATGACCG CTCGAGTGTC CGACAACACG CTTTCGTCGT CGCGACGACG TTGATTACGA GCGAGTCCAT TGCGGACGAA GCGATGCTGC AAAAGGCTGA GGCAGAAGCA AAGGTTGACG CCATCTTATC GATCAATGAA ATCTTGAGAG CGTTATCGCA GCGAAGAGAG ACCCCTACGC TGAGTTTTGC GACGATTCAA TCCCTCACGA GCTCGCTCGG ATTGATTTTG GCGCTCGACG TGAGCGGCAA ATCTGTCAAG AATAGAGAAA TTTTGCGGAA CTCGCTTCGA TTGCTATCGG CTCTGAGCGA GTGTCTCATC CTTCAAGCCG AGTCAGCCAG AGACACAGAT GGTGAACAAA TCAACTTCGT CGACGCCGAT GCTTTCGATG ATGCAAACGA CAAAGTGGAG TCGCACGTAT TGGCTAAACT TGTACCAGTC GTTGAAGTCG TCCTCGATTC CATGCTGTCG TACTCGACGT GGTTCAAGGA GATGTTGGAA CAATTAGAAC ACCCTAAGAT CGACGAGGGT ACGAGCGAGA CGCTGGAGAG TTGGATCACG CGATTACTTT TCATGCATCT TCAGCTCAGT TGCGGTTCTG AGAACACGCG GGCTGGGAAC GTGTCATCTG CTTTGTGCAT TGATAGTTGC GTCAGGCTCT CGTCTGACGA TGCGGAATGG TCTCCGGAAT TGCAGCGTCA CATCACGCAA ATTTGTGCCC ACACGTTGCG CGTGGCGCGC GTTTCTTCCG GTGATATGCG TGCTTCTAAT CAGTACGCGA CACTTATGAA TACATGCGCG CGTCAATGCG TCGAAACATT GCGAGACAAC TGGTTTGCTT TCGATAAGAA GGAATTGGCG CCGAACTCAG ATTTGTGGTC GACAAAGACC ACTTTGGATG CTCTTCAAAC GCTCAGAGCG CTGTGCGAAC TCGCTGTCAG CCGTAATGCG TCGTACGGAC CAGATCTTGG CGTTTTACAA GACTTGATGC GCGCTTCTTT CACGGATGAT GAATTAGAAA ACGCCGCAAA GGAAGTCGCG TCTTACGGTA AACCTCATCT TGCGAGAAGT GATTACGAAG CCGCCGGACG CGCAGGCGTA GAGGCCCCAG TCTCAGGTGT TGTTGCCGCG CTCGTGACAT CTTTGCACCA TCAGCTCCAA GAGACGTATA ATTCACGCAC CGTGATCGAT ACGTGGGATC GGATCATCAA TGAAGTTTTG AACTTGCTGG ACACTATGCT GCCATTGGTG GAAAATGCAG ACTTGGCGTC GGCTTCTGTC GTCATGCTTG AGGACGATAT TATTCAATTG CGTGAGTTGC CCATCGAGCC TACGTGTCAC ATCGCTTGCA CGATCTTACA GTATCATCGT CGACGATTCC CTCTGCCAGA TTGCGTCAAG ACCGCGTCAG AGATGTTGAA AATGGCCTTG AGCTATTCCT TGACATCCAT ACCCGATGAG CTCAACGATT TGCTCAGTGT CTTGAATGAA ATCGTCACAG ATTTATCATC TGACAACGAT CAGCACGGTA AAATTGAGAA ACAAGATGCG TTCATCGCTC TGACGACGAT TCTAGTCAAG TCGGGTGATA TATTGTGGCG CCATGTGCAC TCAAAGCAGA AGTCGTTGGA CGACGTCGAG GCAAACTTCG TGTCGTTGGC TACGCATTCA TTCGAACGTT GCATTTCCGC GGCTTCGAAG CTTTCGCGCA CGGATGGAAA TCAAAAACAT TTGGAAAGTC AACGTCGAAA TGCATCGTTA GTCCTCGCTG GCATTCGCTT GCATCACAAC ACGGATAAGA TGTCACATCT GTACGACGTC AAGGTGCTCA GGTCGTTTGA ACTCAAACGA CGACAGTTTA TGGTTGTTTG CGCCGGAGTT TTCACCAACT TAAGGGCGCT TCAGCTCGTG AAGAACAGTG GCACGCCGCT TGATCGCGGT TTAGTGGCTA TTGGTAACGC GCTCGCGATG GATGACATCG AACTCTACAG CGAAAATTTG TCGCGTCTCG GTGAGAGCGT GCAAAACGTC GACGGAGAGG TGTTGAAATG CTGCCCGTCG AACGTTTCGT CTCGGGAAAA TTCTGCATCA AAGCCTCGTA AACGCATACG AAATCCTTAC TTGGACGCCG TCGTGGCGCA AGAAGGCGGC GCGCAAGATG AGTATGACGA TATGGCGGAT TTCATCGTGT GTAAACCTGG ACGAGATTAC CGCACCGTAC TCGGGCTCAC GTGA
|
Protein sequence | MATATRDDDD DEALEIVARA CCEAMMRTCD DATDGRADED EETAEERETE TETWLGTLCA TASTSGRALE RALEATRRAA AMDERFTTRV CEEYDASVRR LGDAKDATLA VLIEARAWGR AREASASASD VERALRACDA RSLESMAKYV LEEVKESDAH VLRAIARRDS EICVEILGAL LSRLSTGVQT RAALRALEEI VQMDAAPHVD RMSSEIQRAV EYLPQLSVKE GGDIARRAVA ALSKISGVAP VMALFESVED DCSKTILGLQ MLGDMMRAGN HLDAGIAALE RAMVDDRSSV RQHAFVVATT LITSESIADE AMLQKAEAEA KVDAILSINE ILRALSQRRE TPTLSFATIQ SLTSSLGLIL ALDVSGKSVK NREILRNSLR LLSALSECLI LQAESARDTD GEQINFVDAD AFDDANDKVE SHVLAKLVPV VEVVLDSMLS YSTWFKEMLE QLEHPKIDEG TSETLESWIT RLLFMHLQLS CGSENTRAGN VSSALCIDSC VRLSSDDAEW SPELQRHITQ ICAHTLRVAR VSSGDMRASN QYATLMNTCA RQCVETLRDN WFAFDKKELA PNSDLWSTKT TLDALQTLRA LCELAVSRNA SYGPDLGVLQ DLMRASFTDD ELENAAKEVA SYGKPHLARS DYEAAGRAGV EAPVSGVVAA LVTSLHHQLQ ETYNSRTVID TWDRIINEVL NLLDTMLPLV ENADLASASV VMLEDDIIQL RELPIEPTCH IACTILQYHR RRFPLPDCVK TASEMLKMAL SYSLTSIPDE LNDLLSVLNE IVTDLSSDND QHGKIEKQDA FIALTTILVK SGDILWRHVH SKQKSLDDVE ANFVSLATHS FERCISAASK LSRTDGNQKH LESQRRNASL VLAGIRLHHN TDKMSHLYDV KVLRSFELKR RQFMVVCAGV FTNLRALQLV KNSGTPLDRG LVAIGNALAM DDIELYSENL SRLGESVQNV DGEVLKCCPS NVSSRENSAS KPRKRIRNPY LDAVVAQEGG AQDEYDDMAD FIVCKPGRDY RTVLGLT
|
| |