Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14900 |
Symbol | |
ID | 5001376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 117057 |
End bp | 119060 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | |
GC content | 69% |
IMG OID | 640416797 |
Product | predicted protein |
Protein accession | XP_001417149 |
Protein GI | 145345292 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.258569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTTCG AGCTCGACCC CGACGACGTC TTCTACCGCG CCGCGGACGC GCGCGCGTCG CGCGACGACG ACGACGACGC GAATCGCGTG CGCGTCGAGG TGCACTTTTC GGGACATCGC TCGACGCTCG ACGCGAATTG CGACGCGGTG CGCGCGACGA TCGCGCGCGT CGCGCGCGGA CACGTGTGGG CGCGCGATGC GCCGCGCGTG GCGCCGATCG CGGGGTCGAA CGCGTGCGAG TGGTCGGTGT GGGTCGGAGA CGACGTCGAC GACGCGTGGT TGGCGGCGGC GTGCGCGCTC GAGACGTCGA GCGAGCGCGC GGACGTCGCG CGCGCGGTGC GAGTCTGGGA CGACGACGGG GAGTTCACGC TGATCGAAAC CGCGGAGGCG CTGCCGGCGT GGGTGACGCC GGAGCGCGCG ACGGGGACGA CGTTTTTGGT GAAGGGCGAG CTGGTGGTGC GGGACGGCGG GGCGACGGAA CGCGCGGGGG GCGGGGCGAA CGCGCGAGCG CGGACGAACG CGGCGTTGGA GACGCTCGAC GACGCGGAGA GGATGCCGGA GGAAGCGCAG AAGATTTTGA GGGCGAGAAC GGACGCGGCG ACGCGCAAGG CGAAGGAAAA TCGTCACGAC GCGTTTGCGA TTACGCCGAG GCGCGTCGCG CGCGTGCTGC GACGAGAACC GCAACTCGTG GCGGCGGCGA TCGAGAGTTT GCGAGCGCGC GATCCCGCGG GCGTGCGCGC GGCGGCGAAG ATGACGCACT TCGAGCCGGT GGATTTTCAC CCGACGCTGG TGCGAATGTC GCGATGGTTG TTCTCGGAAA TCAGTCGCGA GCGTTTCGAA GCGCCGGAGC GATATCCGAT GCCGCCAAAG AGCGCGGATG ATTTCATCGC GCGCGAGCTC GGGATGAAGA TCGCGTGCGG GTTCGAGATG TTACTCGCCG ATCGCGGACC GGTCGTCGAC GCGTCCGCGA GCGACGCCGA ACCGGTGGAC GATCGGGCGT GGACGACGTT TAAGGCGAGT TTGACGGAGA ACGGGTATTT TAGAAACGAA ATGGTTGGAA GCGCGATGTA CAGGACGCTC CTGGCGAACG CGGTGCGCGA GTACAACGCG TCGCCACAAG CGTCGACGTC TCGCCGTGCG CAACGTTCGG CGCCCGCCGA ACGCGTTCGC GAGATTCTCG CCGCGCCCGA GAGCGACGAC GACGCGTTGC GCGCGGCTTC GCCGAGCGAC GCGAGCGATG AGACTTGGCT TCTTGAAGCC GACAAAGCGC TGAACGAGGA GTTAGCCAAG CTAGAAAAAG AGCGCGAGCG CACGGTGTGC GACGCGACGA GGAGCGCGAG ATCGTTCGTC GAGCGCGAAT CCGGATTCGA AGGCGTGGAG ACGCGATCGC GGCGGGTGAA TCACAACGCC GCGCCGGGCG CGTGTCCGGG TGATTTAAAC ATCCCCGACG GCGATGCCGG GTTCAGCTTA GACGCTCGAA AGTTCCTCAG CGAACTCGGC AAGGCGCTCG CGATCGAGGA CGACGACAAG TTGCGACGGT ACCTGGACGC CGACGGCGCG GACTTTGATT CCGACGACTC CGACGACGAC TTCAATTTCC CCGACGACTC CGACTCCGAC GCCGCGCGTC GCCGCGCGCG CGACTCCGAC GACTTACCCG AGGACGATTT CTTCGCGTCG TCTTCCGATT CCCTCGACGC GGACTCCACC GACTTCGTCC ACGTCGGCGA CGACGCCTCC GACAGCGACG ACGACGACGA CGACGACGCC TTCGCCGCCC ACTACGACGC CGTCCTCCGC CGACAACTCG CGTCCACCGA CCTCGACGTC ACGCCATCCG ACGCCACCGA CGCCGACGTC TCCGCCGCCC TCGCGCGCGG TTTTCTCCAC AGCGCCTCCG CCGACGCCTC GCGCGCCGGT CCCGCCGCGA GTCTCCTCGC CGCCGCCGGC GTCCCCGCCG ACGTCGCCCG CGCGCTCCAC CTCCACCGCC CCGACGCCCC GTGA
|
Protein sequence | MRFELDPDDV FYRAADARAS RDDDDDANRV RVEVHFSGHR STLDANCDAV RATIARVARG HVWARDAPRV APIAGSNACE WSVWVGDDVD DAWLAAACAL ETSSERADVA RAVRVWDDDG EFTLIETAEA LPAWVTPERA TGTTFLVKGE LVVRDGGATE RAGGGANARA RTNAALETLD DAERMPEEAQ KILRARTDAA TRKAKENRHD AFAITPRRVA RVLRREPQLV AAAIESLRAR DPAGVRAAAK MTHFEPVDFH PTLVRMSRWL FSEISRERFE APERYPMPPK SADDFIAREL GMKIACGFEM LLADRGPVVD ASASDAEPVD DRAWTTFKAS LTENGYFRNE MVGSAMYRTL LANAVREYNA SPQASTSRRA QRSAPAERVR EILAAPESDD DALRAASPSD ASDETWLLEA DKALNEELAK LEKERERTVC DATRSARSFV ERESGFEGVE TRSRRVNHNA APGACPGDLN IPDGDAGFSL DARKFLSELG KALAIEDDDK LRRYLDADGA DFDSDDSDDD FNFPDDSDSD AARRRARDSD DLPEDDFFAS SSDSLDADST DFVHVGDDAS DSDDDDDDDA FAAHYDAVLR RQLASTDLDV TPSDATDADV SAALARGFLH SASADASRAG PAASLLAAAG VPADVARALH LHRPDAP
|
| |