Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119569 |
Symbol | Cup201 |
ID | 5000335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 584613 |
End bp | 585944 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | |
GC content | 46% |
IMG OID | 640415756 |
Product | Conserved protein of unknown function |
Protein accession | XP_001416429 |
Protein GI | 145343653 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.434648 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGACC AATGTACTAC AGAAAATAAG AGCGTTCAGT TAGACCAGTT GCAGCGGCGG TACCGTTCAC AAATACGACG GATAAAACAG TTACAAAGCA GCGGAGCGGA GAAAGTTCAG TTGCGCAACG CAGCAACAGT TCTTCGTTCG ATTAAAGATG ATATTCGCTT AGAGCACAAA AACAGGACGC GCCGGAAGGT CGTTGACAAC TGTGCAAATC TCTATCCACG GGCGTGCAGT ATACCTACAG ATGACGAGGG CTTTGTAACG TCCTTTCCTG CTGGACAAGT ACTCAGTGGT CCAGAGACGG ACAAACACGA ACTCTTGGAT TTTTTTAAAA CGTACGGGTT CGTAATCTTT CGCGATATCA TCGGTCCTGA AGAGTGCGAA GAAACTGCAA AAGAAATTTG GGACCACTTG GAGGCCAGGA ATCCCAGTCT TCAACGTGGC GTCCCACACA CTTATTCAGT GTTATCTTCC AAGACCTATG GCCTTGCACC CGAACCAGCA CTATTCACAG CACAGATGAT AAGAAACAGA TGTAACGAGT ATGTGGTCAA GGCACTGCGA CTTTTATTGG GGCACGCGGA TATATTATTG TCACATGACC GCTGGTGCTT CTACAGACCA ACAAAAAGTA TAAGTATAAA AAATAGCCAA CACTTCATGG ACATGCCCAC CTGGAAAACA CCGAGTAATC TTCACTTGGA TTTGAATCCT TGGATGTATA TAAATGGGAA TGTACCTTCA CAGACGCTGG ACTACAAAAA CTTGCGCGAT TTTAGCAAAG AGATGAACAG TGTTACACAG GTAACTGGGC CACACCTACA AGGGATTCTA TCGATCACGG AGAACAAAAA CGAAGATGGC GGCACGGTGC TCGTTCCTGG CTTCCATAGC GTGTTTTCAG ATTGGGTCGA ACATTTGGGG GCCATGAACA AATATACGAA CCACAACGAT TCCAGTACAA ATAGGCTCGT GTGGCGAGGT CACGGTGCAG GGAGCTTCAA GTTTGCGGCT GTGGACCCTA TTCACAATTT GAAACGCAGA ATTTCACTTC GGGCTGGTAG CTTTTTAGTC TGGGATCAGC GCATTGTTCA CGGTTCCGTA CCAAATAATA GCTCCAACCC TCGAATGGCA CAATTTATCA AGGCCTTTAA AAGTCACGGG ATATCCAAAC AGCAGTTCTA CGCGAGATCT AAAGCTATCC ACAAGCACAT GAAAGTAGCA AGAACGCTGA AATTAGATAC GCTGACGAGC GACTCACGTC GAGTGTTAGG TCTCGACCCT CATCTTAACA AACTAAACGG TACCAGCGGA GTCAGTATGT AA
|
Protein sequence | MVDQCTTENK SVQLDQLQRR YRSQIRRIKQ LQSSGAEKVQ LRNAATVLRS IKDDIRLEHK NRTRRKVVDN CANLYPRACS IPTDDEGFVT SFPAGQVLSG PETDKHELLD FFKTYGFVIF RDIIGPEECE ETAKEIWDHL EARNPSLQRG VPHTYSVLSS KTYGLAPEPA LFTAQMIRNR CNEYVVKALR LLLGHADILL SHDRWCFYRP TKSISIKNSQ HFMDMPTWKT PSNLHLDLNP WMYINGNVPS QTLDYKNLRD FSKEMNSVTQ VTGPHLQGIL SITENKNEDG GTVLVPGFHS VFSDWVEHLG AMNKYTNHND SSTNRLVWRG HGAGSFKFAA VDPIHNLKRR ISLRAGSFLV WDQRIVHGSV PNNSSNPRMA QFIKAFKSHG ISKQQFYARS KAIHKHMKVA RTLKLDTLTS DSRRVLGLDP HLNKLNGTSG VSM
|
| |