Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17704 |
Symbol | |
ID | 5004857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 501619 |
End bp | 504681 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420278 |
Product | predicted protein |
Protein accession | XP_001420868 |
Protein GI | 145353102 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.9022 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGCG ACGCGCGCGC GACCGGCGTC GACGTCGACG CGCGACGCGC GAACGTCGAC GACGCGACGC GACGCTCGAA GGATGCGCGC GAGACTTTGC GCGAGGGACG CGCGCGCGAC GACGACGACG CGCCGCGCGA CGACGACGAT GGCTCGGACG ACGGTCGCGT CGTCGCGCAC GAGGATTACT CGTGCGCGAC GCCGTTCGAG ACGCTGGCGC GGGACGCGGA GACGACGCTG CGGGAGTGGT TGGACGAGGA CGCGGCGACG GTTTCGACGC GCGCGAGCGC GGCGGCGGCG TTTCGACGCG AGTTTCGCGA ACACGGCTTG CACTGGCGGA AAGAGGCGTA CGGGTGCGAG ATGTACGTTG ATGTCGAACG GCTCAGACGC GGACGCGCGT GGGAGACGCT GCGAGGGAAG AGTTTGCTCG AGCTCGCGCG CGACGCCGGA CGCGACGGAG CGCGCGCTTC GTCCGAGTAC GACGCGAGGA AACGAACGGT GGATGGGTTC GATTACGTCA CGTTTTTCGA TTGTGCGTGC GTGATTTTAG TCCGTCCGAT GAGCGCGAGC GGGGCGGGGC AGTTCGTGGA CGAGGACGAG GCCCTGACGG TGCGATCCGC GTTCGCGTTG GCGATGTCGG CGCTTCGCGT CGGCGGCGCC GTGGCGGTGG CGACCCCGCG AGGTTTTTAC GAGCGCGATT TTTCCGCGGA CGTGTGCGCG GGTGATGATA ACGACGATGC GGTTTGTGAA ACCGACGTCA CAAGGTTCAC GCGTTTTAGT TCGCGCGAGG AGGTAACGGC GCGAACTTTC GCGGGGTGCG TGAACGCGTT TTACAGAAGC GTTCGCGAGA CGCGTCGCTC GAACCCATCT ACGGCAGATA TCGACGCGGA AACGAAAGCC TTGGAAAGTT CGATTGTTTC AGCGCGCATC ACGCGCGAGT TCACGCGCGC AAAGGCTACG ACATCGCCGA TGACGTCTGA AATTGAAGAC GAGAGCGACG ACGACGATTT CGGTTTGCGC GTCGACGAGT GGGACGACGA TTGTCCTTGG GCTGCATGGG TACACGCGGA GGATCCGTGG CGGACGTTAG AGGTGGACGC GCTCTGGCTC GATGTGCGCT TAGATTGCGT CACATTCTCC GATCTCGACG TCGAAGACGC GCCGATTTGG CGTTTGCGGG GCGACCTCAC GCAAGCGGCG CGAGATCGAG GCGAGGAGGA CGACGCCGTC ATTTCGAGCG ACACGTCATC GATGAGCGAG GCGATATTCT CGCTCATCGA GAGCGCGGAG ATCATTCGTG GAGAAGATGT CGAAGATGTG ACGGGGGCAT TGGACACGAT GACTTTGGCG AGCGCGGAGT TTTGGGTCGA GCGCGGATAC CAGACGCCTC ACATACCGAA CGACGAAACG GCTCGAGGGG TGACGCGTGA TATCTTTACG TCCTTCGCGA CGACGGCGTT GGTGACGCAA GAGTCGTTTC GTGAACCGTA CAAAACGGCG CCTGCCAATT CTATCCTTGC CCGTTTCGCC CTTCACGCGT GCTGCTCGTT GAAAAATCCG CGCGCGGTGG CGCATTCGTG GAACGCTTTC GCTCGCGAGT TGCGACGCGA ATACTGGGAA CGGGGACGAC TCGTTCCAGG CGTCGTGCGC GATGAAATTC TAGGCATCGA TCACGGCGCA TGCATAGTTT ACCAAAAATT GCAGATGCTT AATCAATGCA TCGCGCGAAG GAACGCGAAG AGCGACGCCG AGTTGCACGC GTCGACGCGA GCGAGCGACG GCGCGCAGCC GAGATCCGCC GACTCGCCAC GTGGTGACGA GTTCGATCGT TTGCTCAAAA CCACGAGCTC GACGTCGACG AGAATGCGTC ATCCCGTGGA CGATGAATTA GATTTAAACG CGCTGTTGAG TGGCGACGTT GAGCTTTCGA GCGCAGCGCT CGAGAGAGCG ATGAGCGGGA GCGAGAGCGC GACGACGGCG AAGTCGGACG AGTACGCGAG CGCCGAGGAA GATTTGTCGC CCGAAGACGG CGACGACAGC CGCGCGGAAG GGGTCTCAGA GACACTGAAA ATTCGTTTGT TGAACGCACC GCATGCGTTC ATTCGCGCGC CGATCACGCA AGAAGCACCG TGCATGACGG AGGACGCGTT GGCGGAGCGC GAAGCGGCGC TTCGAGCGTT CGGAGACGAC GACGAGGGTC GCGCCGCGCG ACAACGCATT CAAAGCGATT CGCTCGTGTC CGATATGTCC GCGTTCAAAG CCGCGAATCC ACGCGCGGTG TTTGAAGATT TCGTGCGTTG GCACTCTCCC AAAGATTGGA TCGTCTCCGG CGCCGGAGAA GAACAAACAG ACGAATCGAG CGGACGCTTG AGCGACAGAA TGCGCCGCGA CGGAAACACG TGGCTCGAGC TATGGACGCG CGCGCCGCGC GTGCCGGCGC ATAAACAGCA CCCTCTGTTT GATCCGATCG TTGAGGGCGA GAAAGCGATG CATTATCTCG AGACGATCCC AGCCACCGCG CTCTTCGACG TCGCCGCGCG ATGCGCGTGC GCGGCGACGG CGGCTATTTT GTCGTCGTGG TGGCTCACGT CGTCGGAGGA AGCATCAGCG CGAGCACCGG CGGAGACGCA CGAATCGATC AAAACAGCAA TCGACACGTG CTCGCACTTT TTCGCGCGCA CCGAGCCGTT GACGCTGGAC GAGTACGATT TCGTCTTCGC CTCGCTCCAA GTCGCCGAGC GCGCGACGTT TCGCGCGGCG TCCGCGCGCG CCAGGCTTCC CGACGCCCCC CCGGATCTCA TTTCCCGCCT CCTCACCGCC GCCGACGCGT GCGAAACCAT CCGCGCTCGC GATATCCCGG CCTCATCCCT CCACGTCGTG TACACCGACT GCCAATCCCC CGCCGAGCGC GCGTACTTGG CGCTTCGCCT CGAGCGTCGT CGTCGCGTCG CCGAGTTCGC CGTCCGCGCC TCGCCACCCC TCCCCGACCA CGCCCATCGC GTCATCGCGT ACGACGACCA CACCCGCATC TGCGTTCGCA CCGCCGCGCG TCCGCGCTCG TAG
|
Protein sequence | MASDARATGV DVDARRANVD DATRRSKDAR ETLREGRARD DDDAPRDDDD GSDDGRVVAH EDYSCATPFE TLARDAETTL REWLDEDAAT VSTRASAAAA FRREFREHGL HWRKEAYGCE MYVDVERLRR GRAWETLRGK SLLELARDAG RDGARASSEY DARKRTVDGF DYVTFFDCAC VILVRPMSAS GAGQFVDEDE ALTVRSAFAL AMSALRVGGA VAVATPRGFY ERDFSADVCA GDDNDDAVCE TDVTRFTRFS SREEVTARTF AGCVNAFYRS VRETRRSNPS TADIDAETKA LESSIVSARI TREFTRAKAT TSPMTSEIED ESDDDDFGLR VDEWDDDCPW AAWVHAEDPW RTLEVDALWL DVRLDCVTFS DLDVEDAPIW RLRGDLTQAA RDRGEEDDAV ISSDTSSMSE AIFSLIESAE IIRGEDVEDV TGALDTMTLA SAEFWVERGY QTPHIPNDET ARGVTRDIFT SFATTALVTQ ESFREPYKTA PANSILARFA LHACCSLKNP RAVAHSWNAF ARELRREYWE RGRLVPGVVR DEILGIDHGA CIVYQKLQML NQCIARRNAK SDAELHASTR ASDGAQPRSA DSPRGDEFDR LLKTTSSTST RMRHPVDDEL DLNALLSGDV ELSSAALERA MSGSESATTA KSDEYASAEE DLSPEDGDDS RAEGVSETLK IRLLNAPHAF IRAPITQEAP CMTEDALAER EAALRAFGDD DEGRAARQRI QSDSLVSDMS AFKAANPRAV FEDFVRWHSP KDWIVSGAGE EQTDESSGRL SDRMRRDGNT WLELWTRAPR VPAHKQHPLF DPIVEGEKAM HYLETIPATA LFDVAARCAC AATAAILSSW WLTSSEEASA RAPAETHESI KTAIDTCSHF FARTEPLTLD EYDFVFASLQ VAERATFRAA SARARLPDAP PDLISRLLTA ADACETIRAR DIPASSLHVV YTDCQSPAER AYLALRLERR RRVAEFAVRA SPPLPDHAHR VIAYDDHTRI CVRTAARPRS
|
| |