Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28022 |
Symbol | |
ID | 5005874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | - |
Start bp | 117340 |
End bp | 121452 |
Gene Length | 4113 bp |
Protein Length | 1370 aa |
Translation table | |
GC content | 52% |
IMG OID | 640421295 |
Product | predicted protein |
Protein accession | XP_001421977 |
Protein GI | 145355456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.647512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.726838 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACTT ATTACGGATT TGAACACGCG GAGAGTGGAT TGCTGTTGCA GCGCCGACAG CGCGGATCGA GAAAGTTGGT GTTTTGTTCC ACAAAGTTTG GCGTCAACGA ACAGTTTGAT GCGACCGAGG GCGAGCGCGG GAGCTTGAAG TTGATCAATC GGCGGTGCTT CGGGGCGTGG GAGGTGATTT TGAAGGCTTT GGAGACTCCG GCCGAAGTGA GCGCGCGAAA GGAATCGTTT CAAGAGCACT GGCACGCGGC GACGGAGAAG GCTGTGACAC ACGGCTTTGG CGCCATGGAA AATCTCGTTC GCACGTTTCA AAAGGCGCGA GACGGACACG AGCACGAGTT GACGCATTTG AATAATGTTA TTATTCGCGC GATGCGTCAA AAGAGAGATC ATAAGATTGC GTACAGAGTC TTTTTCGCGT GGCGTAACTC GACGATGAAA TCAAAGTCGT ACAACATCTC GGTTCGAAGG GCCGGTGCTT TTCTCGGCGA ACGCATCGTC ATGACGGTGC GGAACGTCTT CGATGAGTGG AGGGAGCGAT GCGATCGCAA AAAACGCATG GTGCTAAAGG CTGATGAGCG ATATCAGAAA ATTAGAATTA GATTTTTGCG CGAGTACTTT TTTGAGTGGA AAAACCGTCT GTCTCGCGAC AAGTGGTGTC GATTAGCCGT GCAACGATGC TTGAAAAAGT CCGAAAGACA AATGAAACTC GCGGTTTTGA GTGTGTGGAA AAGCGATGTT GATAAATCAA AGGTCGATCG AGAAAAGAAA CGAAGAGCCG AGCGTATGAT GCTGGAAATG ATGAACCACA AGCTGTACTC GGCATTTTAC AGTTGGCGAG ACGCCGTGAC GCAGAGTCGC ATGAACGATG CCAAGGCACG TCAGAGCGTC GCGAAACTGT CGACTAGACT GATATTTAAG GCATTCGTCG AGTGGCGCTT AGTCGTAGAC ACTGCGCGCG CGGAAGCGAT GGAAGGTAAA AAGGCCATCA CGTGGTTTTT GTGCTCGACG CAAAGAAGAG TCTTCACGCA GTGGGTCGGC GTCGCGCGAG AAAGTAAGCG CTTACAGCGA ATGGCGGCGA GATTTATCAC CCGGCGAACA TCCTTACAGC TGTGCAACGC ATTTTATGAG TGGAAAGAGA TGCTACACCG AAGTTCTGTG TACAAGGTTG CGATGGAAAA AGCAATCAGA CGTTGGCAAC AGCGACGTCT CGCCAAGGCA TTTGCGCAAT GGAGCGAAGT CGTGGAGCAC AAAAAGTACG TGCGCGTGCA AGCTCACAAA ATGGCTGAAA AAATGCGAAT AAATTCCTCG ACAGCGGCAC TATCAATGTG CTTCTGGGGA TGGCTTTCTA TCGCCCAAGA ATCGCGCAAC GTGCGCGTGA CGGAGCAGTT GGCGAACGAA CTCTTGGAGC AGCGTTTGGA CATTTTCTGC AAGATTCATG CGACTCGCAA GGCGAGAGCA GCTTTCGTGT ATTGGTACAA GTATGCCATG AGCCAGCGAG ATCAACGATT GAAATTGACT CTCGCACTGA ATCGTATGAC GTCTAGACTC CAGTTCACTG CATTCAACAC GTGGGTGCAA GTGGTTGAGG ACAGGAAGCG TCAACGTGAG CTCATGCGAA CCGTTTTGAT GCGCGCATCG AATCGTCTCA TCTCATGCGC TTTCAACGCT TGGCGCGAAG TCACCGCGGA TTCGATCGCG GCAAAAATTC ATTTGAAAAA TATTGAAAAT ATCGTCAACC TTCAAGCGAA GAATGCCGCG AAAGAACGAC TGAAGAGAAC GTTTTTGCAG TGGAAAGACT ACGCTGTCCA CACGCGGCGT CAGCGGCGGG TGGTGGCAAA GGCTATCACT TCCATACGCA AGCAGGCGCA AGCTAAAGCT TTCGCACGAT GGAGAGCGTC GGCAAAAATA TTTGCGCAGC AGCGAAGAAC GCTCGTTCGT GTGACGCAAA AGATGCAACG AAACAATCTT CGTATGGCGT TTGATACCTG GGCAGAACGT GTCGACGAAG CCAAGGTGCA TCGAGTCATT TTCCAGAAAG CGATTCAAAA GATGTCGCAG TGCAAGCTGT ATTACGCGTT TTCAGGATGG GTTGCGCGCG TCGGCGAAAA GAAGACCCAA CGCGCGTTGC TTAACCGCGC CGTTTCTCGA TTCAGAGGAA GACGTTTGCA CGTGGCGTTT TACGACTGGT CGAGCACCGC CGCGGCGCTG CGACATCAAC GACAAGTGAT CGAGAGAGTC GTGTCAAGGA TCAGGAACAG ACTCTTGGCA GGTGCGTTTG AGCAGTGGAA GCAACGAGCG AGCGAGCAGC GAATCGATCG ATGGAAGATG GATCGGGCTC TCACACGTCT CACGCAACGA GTCATTTTCA CAGCTTTCAA TACTTGGCTC GACCACGTTC AAACAAAGAA ACGTTATCAA GCAATCATCG GCAGGTTCTA CGAGAGATTT AGAGACAGAT CTTTGCGAGG AACGTTTAAA ACGTGGGTTC ACGCCACGCA GGAGGCGAAA ATGCGTAGGA TGGCTGACAT GAAACAAGAG CAACTTCGAT CGAACAAGTT GGCGCAAATC TTAGGAAGCG TGAAGCGGCA ATCTCTCGGT TACGCATTCA TGCAGTGGCG CGATCATGTA CAGGAAATCA AGCAAATGAA AGTAAATGAA AGCAAAGCAC GCGGTGTGCT GGCGCGAGCG CGAATGCGCG CGGTAGCCCG TGCGTTCAAC CGCTGGGTAT TTTTCATCGA TGAACGACGA CGCGTCATGG ACGCCGCTCA CATGGTGATT CTTCGAGTGA AGCAGCGTCA CTTAGCGTAT GCCTTTGACG GTTGGTTAGA CGCCGTGCAC AAAAAGAAGC GAAATCGACT ACTCGTCGCG AACAGTTTGA GGAAAATGAG GTACAGAATC ACAGTCAGGG CGTTTTACTC TTGGATTGAA AGCGTGGACG AAGCTCGCGC GTCTCGTGCG TACGAACGTC GCATCGAACG CGCAGTGAAG ATGTCTTTGA CAAAAGTGCT GAATCGAACG CTTTCTCGAG CTTTCAACGC GTGGAACTAC AAGATGATCG AACAAAAGCG TCACAGGACG CTCGTTTCAA AGTCTTTGCA CCGCGCGCGC AATAAAACGC TGGCGCAGGC CTTTGATGGT TGGTCCACGC ACGTGCTGAT GATACGCAGA CAAAAAGAGC TCGTTTCTAC CAGCCTGCAA CGCATGCGCC GTCGGGCTCT CGTCAAAGCG TTTAATAGTT GGTCGGGATA CATGAAGCAA ATACGTTCCT TCCGTGTCGT CGAACGGCGA CTGCAAAATG TCGAGCGCGC CATAGCGCCA CTGCACGTGA CTCATTCCTC GGTTTCAGAC ATGGTCCGAG TCAACGTAGC CATGCGCTGG GGGCTCGCTC GTAACGAGCG CATCTACAGA AACCCAATGT TCATGGCCTG GGTGCGGTAC TCGCAAAGAA TTTCTGAGCA CAGAAATCGC ACGGTGAAGA AGATGCACGA TATTCTTGCA GACAGAGCTC GACGAAAGTT TTTGCGCTCG TGGCGACAGT TCACGGAAGT GATGAAGTAC CATCGATTGA AAACCGAGCA TAGGCAGAAA CGAGTCGTTC GCAAGATTTT TAGCGAATGG AAAATGAACG CTCGATCACC CAGCGGCGCG CAGGAGACGT ATTCGCTCAC CAAATATGAA CGCCCGACGA TTTCGACAGG TTGGGATTTT GACAAGTCTT ACGAAGAAAA TATTCGTCTT CTCGGACGCA ATCGGGTGGC GTCGAAGGAA TTCGCGTATG GCTCTCGATA TTCAACTCCG ACAGCATCGC CTCGAACGAT GCCGACGGTC GTCGAACCAG ATTCGTACGA AAAGCAATAT CGTGCCGTGA TGAGCGACGT TCAAACCATG GAAGCTGAGG TCGAAGCCCT GTCGACGGTT CGAGAAAGCT TGCAAGACCA ATTTGACCTT CTCGCGCGCG ATGAAGCCTT GCGCGCCACG TTTGCTCGAG CGACGTCGGC TTCTGTTTCT CTATTGGAGC AAACTAAGAG CACGTACTAT AGCGACCGGC GATTCAGCGA ACCATTCAGC CCAGTCAAGG TGAGTCGACA ACCGCCGCGG TGA
|
Protein sequence | MKTYYGFEHA ESGLLLQRRQ RGSRKLVFCS TKFGVNEQFD ATEGERGSLK LINRRCFGAW EVILKALETP AEVSARKESF QEHWHAATEK AVTHGFGAME NLVRTFQKAR DGHEHELTHL NNVIIRAMRQ KRDHKIAYRV FFAWRNSTMK SKSYNISVRR AGAFLGERIV MTVRNVFDEW RERCDRKKRM VLKADERYQK IRIRFLREYF FEWKNRLSRD KWCRLAVQRC LKKSERQMKL AVLSVWKSDV DKSKVDREKK RRAERMMLEM MNHKLYSAFY SWRDAVTQSR MNDAKARQSV AKLSTRLIFK AFVEWRLVVD TARAEAMEGK KAITWFLCST QRRVFTQWVG VARESKRLQR MAARFITRRT SLQLCNAFYE WKEMLHRSSV YKVAMEKAIR RWQQRRLAKA FAQWSEVVEH KKYVRVQAHK MAEKMRINSS TAALSMCFWG WLSIAQESRN VRVTEQLANE LLEQRLDIFC KIHATRKARA AFVYWYKYAM SQRDQRLKLT LALNRMTSRL QFTAFNTWVQ VVEDRKRQRE LMRTVLMRAS NRLISCAFNA WREVTADSIA AKIHLKNIEN IVNLQAKNAA KERLKRTFLQ WKDYAVHTRR QRRVVAKAIT SIRKQAQAKA FARWRASAKI FAQQRRTLVR VTQKMQRNNL RMAFDTWAER VDEAKVHRVI FQKAIQKMSQ CKLYYAFSGW VARVGEKKTQ RALLNRAVSR FRGRRLHVAF YDWSSTAAAL RHQRQVIERV VSRIRNRLLA GAFEQWKQRA SEQRIDRWKM DRALTRLTQR VIFTAFNTWL DHVQTKKRYQ AIIGRFYERF RDRSLRGTFK TWVHATQEAK MRRMADMKQE QLRSNKLAQI LGSVKRQSLG YAFMQWRDHV QEIKQMKVNE SKARGVLARA RMRAVARAFN RWVFFIDERR RVMDAAHMVI LRVKQRHLAY AFDGWLDAVH KKKRNRLLVA NSLRKMRYRI TVRAFYSWIE SVDEARASRA YERRIERAVK MSLTKVLNRT LSRAFNAWNY KMIEQKRHRT LVSKSLHRAR NKTLAQAFDG WSTHVLMIRR QKELVSTSLQ RMRRRALVKA FNSWSGYMKQ IRSFRVVERR LQNVERAIAP LHVTHSSVSD MVRVNVAMRW GLARNERIYR NPMFMAWVRY SQRISEHRNR TVKKMHDILA DRARRKFLRS WRQFTEVMKY HRLKTEHRQK RVVRKIFSEW KMNARSPSGA QETYSLTKYE RPTISTGWDF DKSYEENIRL LGRNRVASKE FAYGSRYSTP TASPRTMPTV VEPDSYEKQY RAVMSDVQTM EAEVEALSTV RESLQDQFDL LARDEALRAT FARATSASVS LLEQTKSTYY SDRRFSEPFS PVKVSRQPPR
|
| |