Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_86229 |
Symbol | |
ID | 4999586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 812131 |
End bp | 813624 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | |
GC content | 58% |
IMG OID | 640415007 |
Product | predicted protein |
Protein accession | XP_001415938 |
Protein GI | 145341690 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGC GGGCGACGGA CGCGGGCGAC GCGGAGGCGG TGAGGACGCT GCTGGCGAGC GCGAGGGACG ACGAGGCGCT GGGACGCGCG CTGGCGATCG GGGACGTGTG CGCGATCGGG GACGGGACGG TGGCGTGCGA TGAAAACGGG AGGGTGGTGA AGATACGGTT GCGCGGACGA GGGCTGCGAG GGAGGCTGCC GGCGCGCGTG CCGAGGATGG AGCGGTTGAC GACGTTGGAC GCGAGCGATA ACGCGCTCGA GGGAACGATT TCCGAGGAGT TCTTGTTGGC GATGCCGAAT CTCGTGGAGC TGCATTTGAA TTTGAACACG CTCGAGGGAC CGATTCCGAA GTCGATCGGG ACTTTGACGT CGCTCGAAGT CCTGAATTTG GACGGCGGCT TCACAGAGAG ACAGAGACGA TTCGACGCGT CGGTACCCGT CGACAAGCGC GAAGTGTACG GATTTCCGGG GGTTCACTTT GGCAATCAAC TTTCGGGCAG CGTTCCCGCA GAAATCGGTA ATCTCGTCAA GCTCAGAGAG TTGAATTTGC ACAGAAATAT GCTCGGTTTG AAGACGGATC GTCTCGCTCT CCCGGCGAGC GTCGGGAATC TCGTCGAACT CGAATCCATG GACGTGTCCG GGAACCAACT CCACGGCGCC ATTCCACAAG AGCTCGCCAC ACTTCCACGC CTTCGAACGT TAAATCTGGA CGATAACTTC ATCAGCGGGT TTATACCGAG CGACTGGTCC TCCGCCCTGG CGCTCGAGTC CTTGATCGTC GAAGGCAACT TTTTGAGCGG AACTTTACCT CGATTCCTAC CGCGAAACTT GACGTTCCTC GACGTGCACG AAAATTTGCT TACGGGTTCG ATCAACGAAC TTTCTCGGCT AAAGCGGCTC GAGGGCGTGC TGCTCGATCG CTGCCGATTC ACGGGTCCGG TGACTCTATC CAAGGCGTCT CTGCCGAATC TCAAAATGAT TAGCCTGGCA AACAACGCTG GGCTTTGCGG TGCACTTCCA GACTTACCAT TCGCCACGAA GGAGCAAAAG GAAGCCGAGT GCGACAAACC GTGGGAACTT TGTCAACTTT GGCCGGAAAC AGCGGGCACG CTTCTCGGTT TGGGGTCATG TCAATGCGCT GCGCCAGGCG AAGGCTGTTC CATCTGGGAA CTCAGCGGAA GGAAAGAAAC GTGCTGCGAG GAAGGTACGT GTCTCAGCGT TCCTTTCGAG TTCGGTGGGA AGCAGTGTGT GCCATGTTCA GACGCGTATC AGAAATGTGG TGGGAAATCT TTCGACGGCC CAACTTGCTG CAAGCGCGGT CTAACGTGCA CTCGAATTGA CGACGACAGG TCCGAGTGCC TGCCCTGTAA CGCGCAGTGG GAGCAGTGCG CCGGCGCACT TTACGACGGA CCCAAGTGCT GTGCGCCTGG CAACTCTTGC GTGTACTTTG ATGCGTATTA CTCGCAGTGT CAACCTTCGT CGGTGTTGTT ATAA
|
Protein sequence | MATRATDAGD AEAVRTLLAS ARDDEALGRA LAIGDVCAIG DGTVACDENG RVVKIRLRGR GLRGRLPARV PRMERLTTLD ASDNALEGTI SEEFLLAMPN LVELHLNLNT LEGPIPKSIG TLTSLEVLNL DGGFTERQRR FDASVPVDKR EVYGFPGVHF GNQLSGSVPA EIGNLVKLRE LNLHRNMLGL KTDRLALPAS VGNLVELESM DVSGNQLHGA IPQELATLPR LRTLNLDDNF ISGFIPSDWS SALALESLIV EGNFLSGTLP RFLPRNLTFL DVHENLLTGS INELSRLKRL EGVLLDRCRF TGPVTLSKAS LPNLKMISLA NNAGLCGALP DLPFATKEQK EAECDKPWEL CQLWPETAGT LLGLGSCQCA APGEGCSIWE LSGRKETCCE EGTCLSVPFE FGGKQCVPCS DAYQKCGGKS FDGPTCCKRG LTCTRIDDDR SECLPCNAQW EQCAGALYDG PKCCAPGNSC VYFDAYYSQC QPSSVLL
|
| |