Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15435 |
Symbol | |
ID | 5001706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 412662 |
End bp | 414497 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | |
GC content | 57% |
IMG OID | 640417127 |
Product | predicted protein |
Protein accession | XP_001417762 |
Protein GI | 145346576 |
COG category | [R] General function prediction only |
COG ID | [COG1161] Predicted GTPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.183176 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.664844 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGA CCGTGTTCAC GGTCGAGAGT CTCGGGTTAA AGTCGACGAA GACGCGGGCG AGCGAGAAGA AGCGCGGGAT CGGTAGCGGC GGAGCACACC GAAGAGGGGT GACGACGGTG CTCGCGGCGG AGGCGAGCTA CGACGTGGAG CGACGCGTTC GAGACGGCCG ACGACCGTTG ATGAGCGAGC CGAGAGGGTA CGACGAACGA ACGCCGGACA TGTTTCAAGA CGACGCACGG TCGTTGAGGT TGATGCCGGC GCGGCCGAAA TGGGATTATG AATTGAAAAG AGGACGGTTG CACGCGAGAG AACGAAAGGC GTTCGTGAAG TGGTTGCGAT CAGCAAAGGA GGCGATGATC GAGGCTGGGG GGTACGCACC CGCGTTTGAG CAAAACATAG AGGTTTGGAG ACAGTTGTGG CGGGTGTTAG AGCGCTCAGA TGTGGCTGTG GTGGTCGTCG ACGCGAGGAA TCCGATGTTG CATTTGCCGC CAGCGTTGTA TGCGCACGTG ACGCGGCGAT TGTGCAAACC TCTCGTCGTG GTGTTAAACA AGACGGATGC GGTACCGATG CGTGCGATCG ATGAATGGGC TGCACATCTT ATGGCGTCTT TGCCGGGAAT CGACGCAGTT GTCGGTTTTT CTTCCCGCGA CGAAGCACCC GAGGACGAAC GGTTTTGGGA TCGCAAGGAC AAAAATCACG AGGAGCGAGA CAACGCGTCA GTGCGCATGC ACCGTCAATC GAGTATTCCC ATCGGACGCG ACGCGTTACT GCGCGTGTGT CAAGAGCTCG CGCGTTCGGG CAAACGTTTC GACGCGGCGA TGGAAGACAC GATGGAAGAC GAACATGATG GAAACGACGA CGGCGAAGAA GAAGGGGAAG AAGAGGAAGA AGAAGAAGAG GTGGACGAAG AGGGCGAAGC GTACGTGCGG CAGGCGTTGA ATCAAGAGCG CGAGGAAGCA GAACGAGTGA AAGCTGAAGG CCGAATAATG ATAGGACTCG TCGGGCACCC CAACGTGGGT AAGTCTTCGA TGGTAAACTC TATTCTTCGA CGTAAAGCGG TGAGCGTGAA GGCGACGCCG GGACACACGA AGACATTGCA GACGCTCATT TTGGACGATC ACACGTGCTT GTGCGATTCT CCTGGCCTAG TATTTCCCCG CATCGATATT TCTCCCGCGG AGCAAATCAT CGGCAATCTC ATTCCTCTTC CAGTCGTCCG CGAGCCATTC AGCGCTATTC GTTGGATCGC CGAAGCAAAG CTCGTCGGTG CGGAGCGGTG GCAAGCGATT CAGCGCAAGT TCAGTGGGTC AAGTGATGGT AAGCTCGCGG CATCGCTCGC CGCGCCTATT ACGAGCGTGC TGAAAGTTCG ACCATCGAAG GAGTTCGATC CTGAGACGCT CGAGTTGCTG AACAACGAAG ATTTGACAAA CGACGAATTG CCGTGGTCGC CGCTCTCGCT GTGTCAAGCG TATGGCAAGA TGCGAGGATT CGTGAAAACG CGCGGCGTCG ACGTCCAACG CGCGGGGCAA GTCATTCTGA GCATGGTTTA CGACGGCAAG ATTCCGTACG CGATTCCGCC GCCGACCGGC GCGCCTGTGA TGCGTTCGAA GGCGGATGTC GTCCAAATCG AAGAAGACAA CTTAACCGAC GACGAGTCGG ATGACGACGA AGTCAACGTC GTACGCTCTG GGTTCTCCGC GCTCGCCATG GATTCGGACG AAGAGACAGC GGCTCAGCAG TACGCTTACA GCTCGGACGA GGACACGAGA GGGACATTCA TTTTGAGCCC TGAAGATCTC AAGGAATCCA AACGAGGCCA GCGCGGTAAA AAGTAA
|
Protein sequence | MATTVFTVES LGLKSTKTRA SEKKRGIGSG GAHRRGVTTV LAAEASYDVE RRVRDGRRPL MSEPRGYDER TPDMFQDDAR SLRLMPARPK WDYELKRGRL HARERKAFVK WLRSAKEAMI EAGGYAPAFE QNIEVWRQLW RVLERSDVAV VVVDARNPML HLPPALYAHV TRRLCKPLVV VLNKTDAVPM RAIDEWAAHL MASLPGIDAV VGFSSRDEAP EDERFWDRKD KNHEERDNAS VRMHRQSSIP IGRDALLRVC QELARSGKRF DAAMEDTMED EHDGNDDGEE EGEEEEEEEE VDEEGEAYVR QALNQEREEA ERVKAEGRIM IGLVGHPNVG KSSMVNSILR RKAVSVKATP GHTKTLQTLI LDDHTCLCDS PGLVFPRIDI SPAEQIIGNL IPLPVVREPF SAIRWIAEAK LVGAERWQAI QRKFSGSSDG KLAASLAAPI TSVLKVRPSK EFDPETLELL NNEDLTNDEL PWSPLSLCQA YGKMRGFVKT RGVDVQRAGQ VILSMVYDGK IPYAIPPPTG APVMRSKADV VQIEEDNLTD DESDDDEVNV VRSGFSALAM DSDEETAAQQ YAYSSDEDTR GTFILSPEDL KESKRGQRGK K
|
| |