Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_3730 |
Symbol | |
ID | 5005459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 509277 |
End bp | 510500 |
Gene Length | 1224 bp |
Protein Length | 408 aa |
Translation table | |
GC content | 55% |
IMG OID | 640420880 |
Product | predicted protein |
Protein accession | XP_001421302 |
Protein GI | 145354036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0269126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00961002 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATTGCGGGCT CCATCGGGCC TGAAGCTGGC GCGCGCTTCG GCTCGGCCAT TTGCGTAAGA TTAGAACAAG AGCAAATGGA TGGAAAGGCG AGATCATGTA GCAACATAGT GAACGTTCTC GCGAGATTGT ACACGTGCGG GCTGTTTCCG TCGGCGTGTT GTTACGGTTT TCTAGTCACC TTGGCCAAGA GTTTGAGTGA GCTCGACTCT ACGCTCATGC TCACCTTGCT TCGCATCGCA GGGAACCGCT TGCGTTCAGA AGATCCAGTG GGGATGAAAG AGTTCATCTT GGCTTTGCAA GCACGCGTGG CTGAGTTACA GAAGGAGCGC GGCGACGGCG AAGACGGCGG CCAACTCTCA AAGCGCGCAC GCTTGATGCT CGAGATGGTC ATCGATTTGA AAAACAACAA GAAGCGAGAC AGCGCACAAG ATGTCGGTAA AGATCAGTGG GGCTTCCCTG TCGCGCTCAG CAAGTGGTTG CGGGGGACAA ACGTGGGCGA GGCTACCGTG GCGCTTCGCG CATTGACGTA CGAAAAGCTC ATCAAAACTG AGAGTCAGAA AGGCCAGTGG TGGTTACCCG ACGCCGCAGG AACTGCAGAG TGGTTCGCGG CGCGTGCAGC GCAAGGTGCG ATCACCGAAC AAGCTGGGAA GACACGCGAG GGTGGTGAGT TGCTTCAACT TGCGAAGAAG ATGCGAATGA ACACAGAAAC GCGTCGAGCC ATTTTTTGCG TCGTCATGGG CGCAGATGAT TTTGCCGATG CTCTCGAGCG TTTACTGCGC TTACCACTCG CTGACAAGCA AGATCGTGAA ATTCCTCGCG TTCTGCTCGA GTGCTGTCTT CAAGAAAAGG CGTACAATCC GTACTACGAG GTTCTCGCCA GCAAATTGTG CGAGCGTCAG CGCTCGCATC GGCTGACGTT TCAGCTCTGC ATTTGGGATC AGCTCAAAGA GATCGACGAT CCGTCATCTT CGGTCAGACG CATATCGAAC ATGGCGCGAT TTTTTGCCGG ATTGGTGCTT TCAGGTGCGC TCGCACCGAC TGCGCTCAAG GCTTTGGAAT TCGGCGTCGA CATCGCGCCG CGCGTCGCGC TCCATCACAA GCTCTTCCTC CAAACTGTCC TAGACGATCG CTCGCGAACG TCTGCCGCAG ATAGCCTCTT CCAAAGGATT GCTGTCCACC CAGAACTCAT GTCCGCCAAG GCTGGCTTCT TACGACTTCT TCGT
|
Protein sequence | IAGSIGPEAG ARFGSAICVR LEQEQMDGKA RSCSNIVNVL ARLYTCGLFP SACCYGFLVT LAKSLSELDS TLMLTLLRIA GNRLRSEDPV GMKEFILALQ ARVAELQKER GDGEDGGQLS KRARLMLEMV IDLKNNKKRD SAQDVGKDQW GFPVALSKWL RGTNVGEATV ALRALTYEKL IKTESQKGQW WLPDAAGTAE WFAARAAQGA ITEQAGKTRE GGELLQLAKK MRMNTETRRA IFCVVMGADD FADALERLLR LPLADKQDRE IPRVLLECCL QEKAYNPYYE VLASKLCERQ RSHRLTFQLC IWDQLKEIDD PSSSVRRISN MARFFAGLVL SGALAPTALK ALEFGVDIAP RVALHHKLFL QTVLDDRSRT SAADSLFQRI AVHPELMSAK AGFLRLLR
|
| |