Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15947 |
Symbol | |
ID | 5002195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 804804 |
End bp | 805790 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | |
GC content | 63% |
IMG OID | 640417616 |
Product | predicted protein |
Protein accession | XP_001418337 |
Protein GI | 145347775 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.386834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCGGA CGACGACGGC GACGACGCGG ACGACGACGC GCGTCGCCGC GCTCGGCGCG CGCGCGACGG GGCGAGGACG AGGACGAGGA CGCGACGCGC GGGCGCGGGC GAGCGGGGAC GCGCAACGCG CGGGCGCGGA CGGAGACGCG CGCGACGATG CGCGACGCGG CGGCGCGACG ACGCGAAAGA ACCGGCGGGG TTTGTTGGGC GCCGGGTTCG CGGCGGCGAC GGCGGGGCTG ACGGGGCTGA CGCGCGCGGC GAACGCGGCG CCGACGCTGC CGAAAGAGCT CACGGAACCG GATGAAATCT TCCGAGAAGA CTTCAACGTT CAGTTCGCGG GGTTGACGGT CGATCATAAA GATTTAATCT ACGCCCTGGT CGTCGGGCAG ACGATCGGAT TCGTCGGAAG CGCCGTCGGC GGCGCGGAGG CTCGTAAGCG CGCTGAGGAG ATCGAACGGT TGAACTCGAC GCTGCTGAAG GTGAACAAGG AAGTGCGAAA AGAGCTCAGG AGTAGTCAAG GGCGCAAAGT GGCGTTCTCC ACGATGGATT CCTCGGACGA AGCTTCGAGC GAGACGGTGC TCGAAATCAT CAGTTTGCTC AAGAGCGGCA AGTCTAAACT CAAAGCGCAA GCGTCCCAAG AGGCGAAAGA AACATTCACT AAGGCGCGTC AACTCATCGA TGCTAACCAG AGCGCGCTGA AGGAACCGTG GAAGGCTGTT CGAAAGGCTG AGCGCGGTCT CGGCGCGGCG TCGGCGCGCT TGGGCGAGTA CGACGAAGCC CTTTCTCACA TGAAGACTGT GTTGAAGTTA TCCACCGAGC ACGACGACAC GAGCGTCGCC ACGGACGCGT GCGGCATAAT AGCCGATATT TACGCCGAAA TGGATCAAAT CGAAGTCGCC GCGGATTGGT ACGACAAGTA TTTTGAATCC TTGGCCATCG AAGACGCCAA GGAAGCCGCC GAGGCGGGCT CGAGCTCGGC GCGTTGA
|
Protein sequence | MMRTTTATTR TTTRVAALGA RATGRGRGRG RDARARASGD AQRAGADGDA RDDARRGGAT TRKNRRGLLG AGFAAATAGL TGLTRAANAA PTLPKELTEP DEIFREDFNV QFAGLTVDHK DLIYALVVGQ TIGFVGSAVG GAEARKRAEE IERLNSTLLK VNKEVRKELR SSQGRKVAFS TMDSSDEASS ETVLEIISLL KSGKSKLKAQ ASQEAKETFT KARQLIDANQ SALKEPWKAV RKAERGLGAA SARLGEYDEA LSHMKTVLKL STEHDDTSVA TDACGIIADI YAEMDQIEVA ADWYDKYFES LAIEDAKEAA EAGSSSAR
|
| |