Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17947 |
Symbol | |
ID | 5004943 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | + |
Start bp | 515227 |
End bp | 516356 |
Gene Length | 1130 bp |
Protein Length | 322 aa |
Translation table | |
GC content | 60% |
IMG OID | 640420364 |
Product | predicted protein |
Protein accession | XP_001421018 |
Protein GI | 145353434 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 67 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCCAC CGCTGGTACC GTTCGTGGTG TGCGGCACGA AGAAAGGTTC GGTGCCGTTC GCGCGCGTGA AGACGAAATA CGGCAAGACG CACACGAGAA TTGACAACTG CTCGGGCGAT GTTTCGCTGC TGTTGTCGAC GCTCACGCGC GCGCTCGGAT GTGGTGGAAG CGTGTGCGAT GATAAGAAAT CCGTCGTCAT CGCGTTCGAC GAATCTTTCG ATCGCATCGT GAAAGTACTG AAGTCGTTCG ACCCGCGGTG CGTACGCGGC GTGCGCGGTG CCGAGACGAG CGTTCCCGTC TCGAGAGCGA GCGCGAGCGC GCCGACGAGA GAGCCGAGGA CAAAGCCGCG ACACGGTGCG ACGGGGAGTG GGAAAGGTGT GAGTGGTGCT GCGGGGCGTG CCGAACCGAT CGTCATAACG AAGCGAGGCG CGCATGAGAG CGCTAAACAG TCCATCGACG GATACCGCTT GCTCTTGAGC AGGTGGCCGT ACTGGGACGG CGACGTTTCG AGAATGTACG ACATGTATCA CAGACATCGA AGGCTTAATG ACGATGTGGT GATGTCGTTC GATGCAGACT CGATTCTGCA ATCCAGTGCG ACGGACACGA ACGCGATGAG CGACTTCGAC CGCGCGGCGC AATGCGCTTC GACGGAGGAT GCGCTTCGGA CGCTTGGCAT GTTAGCTATG CCAAGCGAGT TTCGTCAATC GCGGCTCGAA CGACAGCGCG AGGCCGCGAA GAGGAAAGAA AGCGTCGCAC GTGTGTCGAG TGGGCCCAAA ACGTCTTCCG CGGTTGTCGA CATTGATCCG TTTGCTGAAT ACTTGCAGCA CGATCGTGGG TTTGCGCCGG CGCGCGTTTC GTCAAAGCCC GCCGGCGTAA GTAGTCCACC GCCGTCGATG AGGACGGCGA AGCCCGCGCG AAGACCAACT GGAGGTTCTT ACACGAATGC TCGGAAGCAC GGTGCTTCGA CGTCGGTGTC GAGGCTCGCA CGTGTTCCAT CGTCCGAGTC CGATGACGAC ATTGTGCCGG CGCGCAGGCG CCGAGACTTC GACGTTGGTT TCAGGCGCAC GGCGAGCGGA AAGTTTACAT TTGGCGGAGA GGATCGAACG CAGACGCGCG TCGCATTTGA
|
Protein sequence | MSPPLVPFVV CGTKKGSVPF ARVKTKYGKT HTRIDNCSGD VSLLLSTLTR ALGCGGSVCD DKKSVVIAFD ESFDRIVKVL KSFDPRCVRG VRGAETSVPV SRASASAPTR EPRTKPRHGA TGSGKGVSGA AGRAEPIVIT KRGAHESAKQ SIDGYRLLLS RWPYWDGDVS RMYDMYHRHR RLNDDVVMSF DADSILQSSA TDTNAMSDFD RAAQCASTED ALRTLGMLAM PSEFRQSRLE RQREAAKRKE SVARVSSGPK TSSAVVDIDP FAEYLQHDRG FAPARVSSKP AGAPRLRRWF QAHGERKVYI WRRGSNADAR RI
|
| |