Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25083 |
Symbol | |
ID | 5003848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 583512 |
End bp | 585794 |
Gene Length | 2283 bp |
Protein Length | 740 aa |
Translation table | |
GC content | 60% |
IMG OID | 640419269 |
Product | predicted protein |
Protein accession | XP_001419639 |
Protein GI | 145350494 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.156442 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.477507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGCGT GCGCGATGGG AAACGATGCC GCGGAGCCGT ACGCGAGCGC CGCGGGACGC GCGACGCAGG AGGTGCGGAA AGCGAGCGAC GCGACGCGCG GGGGACGGGG AGAGATCAGA GGAGAGACGC CGGATTTGCC GCTGTCTTAC GCTCGAAACG GCGGGACGTG CTCGACGAGC GATGACTCGA GGCCGAGTTC GAGGATGGCG ATGGGCAAGG ATGGGAATTC GCGCGCGAGC GGGGACGGGA CGACGTCGGG AATGGAGGAA GAGCGCAGAC GAGCGAGAGA GGCGACTCGC GCGTGGTTCT CGGAGCGCGC GGCGAACGGA GATCTGGAGC TCATGCGCGC GGAGTTACAC TCGATGGGAC AGAAGGAGTT ACAGCGCATG TTTGTCGAGA TGTTTGACCG CGCGACGACG AGCAACAATA ATCAGTGGTT GCGAAGACGA ATCGCCAACG GGTTGGGCTT GGAGGACGTC GCCGAGCACG TCGTGAGCTC GCAAGCCAAG TTGGCTTCGG CGACGAACGA ACGCGTTCCA TCGAAGAGAC AAAGTGCGCG TCGCAATGCG GTGGAAACCG CGGCAGTAAC ACCGGCGGCA CAGGCGACGC CGAACACGCG GGGCGCGGAG TCCCCGGACG AGGCGGCGGA GTTCACCCCT GATGGCGTTC GCAAGTCTCG TCGAGCGGTG AAGCCGAAGG CGATATTTGA TTCCGACGCG TTTCCTTCCG CGGCCAAGGT GAGTGAGGAG CGTAGGCGAG AACGAAAGCG ACAAGAGGCA TTGGAGCAGG CGGCGACGTG CGCGAGTGGT GTAACGACGG ATCATCACGG CGGCAAGGCC GCCGTAGGAC GCCGCGTGCG AGTGTACTGG CCTCTCGAAG GCAAGTTCTT TGCGGGTGTC GTCACTGCGT ACAACGCGCG CACTGGCTTG CACCACATCG ACTACGACGA CGGCGACAAG GAAGAAATCA AGCTGGCGAC GCCCGAACAG CCTCGCAAGT TTGAGGCCAT CGAGCACCCG GCGTTGGCGT CGACGCCTTC TCCGGCGAGT GACTGGCCTG CACTTACGAA TACTGTCGAT TCATCGATAT CGTTGTCTAT GCCGCAGCCG GGACGACCTG CTGATATTCT CAGTACGCTC CCGTCAAGCT GGCCGGCGGT GGGATCGCTT GTGTGGGGTC GGGTGCGCGG TCACGGCTGG TGGCCCGGAG CGGTGCACGA CAAGGACGCG AGTCACGACA TGCAAGAGAT CAGTTTCTTC GACAACAGTA GGGCTCGACT TCACCGCCAC GACTTGTTGC CCTTTCAACA GTATTACATG GTGCTGCGCG ACGCGAAAAA GACGCACGCG TACGCCGAGG CGGTTTCTCG AGCGGCGGAA ACGTATGAAA GCCGACGGCA GCGAACGGTG AAGCGACGTT CGAAGAAGGA AGAGCAATCA AACGTCGAAG AGTCGAGTGA GCCGCCGAGA ATATGGCATT TCGAACGCGA GGGGGCAAAG AAGTCGCACA AGCGGTCGAA AGACGTTGAC GTCGACGACG CAGGCGCGGC GAAGCGAGGC AAAGTGAACG AACACTCGAC GACGGTATTC GGTTCGATTG AAGCTCCGAA GACTCTGAAC GATCTAAATA AGAGCTTGGA AGAGATGAAG GCCAAGATGT TACCGCTCGC GAAAGAGAGC CGAAAGACGC TCAACAAACA TCTGGTAGAC ACCGCCAGAA AAGTGAAAAC GGAGGCGGCT GACGACGACG AAGAGACACT GATCGCCGCC GATGAACGTG TCGTGGTGCT CGACGAGTTG GCGTCCATCG AATCCTTGAT TGCGTGGAGC GAAAACAAAG CTGCGAAGTC GCCCGCCAAG ACTCCCGATT TGTCCTTCAT GACAAAACGC GGCGATGACA TTTCACCCTC GGACCCGAAA GGGCTTCTAC GACAACCGAG TGAAAACTTG TTGTGCTTGG GTGAGATTAG TGAGTTTTTC GACGGTACCG CGCGAGCGGA TCCCTTGGGA ATCGACGATC CAATTGTCGC CGATTTGGGC GCGGACGAAG TCGCCAAGAT GACCGCGCCA TCCACGCCCG AGCAACACGG CATCGAAGGC GAGGGCAGTC CGTTCGCAAA AAGCAACATG AGTGACTCGT GCACCACGCT TCACGCCTCT TACGGTGACA AAAAGATTAA CGTCTCAGAG ACGCCGGTGA CGAAAGGAGC GCTCTCCGCC TGAGCGTAGA GGCGATAGAT TACACAACAT TTTCCACCAC CGTCCTCTTC GGTGGGTCCA CTT
|
Protein sequence | MPACAMGNDA AEPYASAAGR ATQEVRKASD ATRGGRGEIR GETPDLPLSY ARNGGTCSTS DDSRPSSRMA MGKDGNSRAS GDGTTSGMEE ERRRAREATR AWFSERAANG DLELMRAELH SMGQKELQRM FVEMFDRATT SNNNQWLRRR IANGLGLEDV AEHVVSSQAK LASATNERVP SKRQSARRNA VETAAVTPAA QATPNTRGAE SPDEAAEFTP DGVRKSRRAV KPKAIFDSDA FPSAAKVSEE RRRERKRQEA LEQAATCASG VTTDHHGGKA AVGRRVRVYW PLEGKFFAGV VTAYNARTGL HHIDYDDGDK EEIKLATPEQ PRKFEAIEHP ALASTPSPAS DWPALTNTVD SSISLSMPQP GRPADILSTL PSSWPAVGSL VWGRVRGHGW WPGAVHDKDA SHDMQEISFF DNSRARLHRH DLLPFQQYYM VLRDAKKTHA YAEAVSRAAE TYESRRQRTV KRRSKKEEQS NVEESSEPPR IWHFEREGAK KSHKRSKDVD VDDAGAAKRG KVNEHSTTVF GSIEAPKTLN DLNKSLEEMK AKMLPLAKES RKTLNKHLVD TARKVKTEAA DDDEETLIAA DERVVVLDEL ASIESLIAWS ENKAAKSPAK TPDLSFMTKR GDDISPSDPK GLLRQPSENL LCLGEISEFF DGTARADPLG IDDPIVADLG ADEVAKMTAP STPEQHGIEG EGSPFAKSNM SDSCTTLHAS YGDKKINVSE TPVTKGALSA
|
| |