Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28172 |
Symbol | |
ID | 5006088 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | - |
Start bp | 366282 |
End bp | 368373 |
Gene Length | 2092 bp |
Protein Length | 568 aa |
Translation table | |
GC content | 66% |
IMG OID | 640421509 |
Product | predicted protein |
Protein accession | XP_001422048 |
Protein GI | 145355603 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0175348 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTGGG GCACGACCAG CGCGTGGAGC GCACAGGTAC GATCGTAGCG CGCGATTGAA AAATCCTCAC GACGCGCGTC GCGCGTCGCG GACGCGCGCG ACGCGTCGCG CGCGCGTCGG CGGCGCGAAA TGCATCGAAC GACGCCCGAC CGATGGCGGC GCCGTGGGGC GACGCGTGCG CGTCGCCCGC GACGCGCGCG ACGCGCGCGC GCGCGCGATT CGCGCGCGAG ATGATGGAGA AATTGCGCGC GCATCGTTCG CGAGACGCCG GGGCGACGAG GCGCGCGATC GAGGGCGCGG CGCGGTCGAG ACGGTTCGCG CGGAGATCGC GGCTCGGACG TGGGCTCGAT TTTTAAAAGC TAAACTCGAC GAAAACGAGC GCGCACTGAC GGAACGACGC GCGCTTTCGC GATTCACGCA GACGAAGGAA GGAAACGATG GAGTGGAATC GTTAGCGGTG CCGGCGCAGG CGTTCCCGTC GCTCGGCGGG GAGGCGGCGG AAGCGGACGC GAGCGGAGCG TTTCCGTCGC TCGGGGCGGC GGCCAAGGTG AAGGAACCGA AGAAGAAGAC GAAAGGGGTG AAACTTTCGC TCGCAGAATT CGCGGGTGCG TCGGGTGGGT CGTCGTACGC GCCTCCGGGA CGAGTCGGTG GATCGGCGTT CGGTCGAGGA CGCGGCGACG ACGTCGTGCT GCCGAGTGGG CCGAGCTCGC GACCGGACGA CGACGGAGGC GTCGGTCTCG GTGGTGGATT CAACCAAGGC GGTGATCGTT ACGGCGGTGG TGATCGTTAC GGCGGTGGTG ATCGTTACGG CGGCGGTGAT CGCAGTGATG AGCGTCGTCC GGGTCGGTAC GATCGAGACG ACCGCCGCGA CGGTGAATAT GGCGACAGAG AGGTTCGCGA AGAGCAAGGT CCTTCTCGCG CGGATACGAG CTCTAACTGG GGCGCGGACC GTCGCGCGCC GCCGCAAGAT GATCGTTACG GCGGCGACAG GCGTGCAACA GGATTTGGCG ACCGTTACGC CGAGAGAGGT GGCGACCGTT ACGGCGGCGG CGACGATCGC TACGGCGACC GCGAGGAGCG CAGACCGGGA TTTGGCGATT TCGCAGACCG GCCGCCGCGT GAGGATCTCG GACCGTCGCG CGCGGATGAA AGCGACAGCT GGAGTAGAGA CAGAAAACAG TTACCGGAGC GTGATTCGCG ATACGACGAT CGACCTCCGC GCGAAGAACG CGACCGCGAC TGGGGCTCTT TGCGTTCTCG CACCGCCGCC GCCGAGCCGC CGTCGGATGA TGGTTACGCT CCGCGAGGCG ATCGCCCGAA GCTTCAGCTC AAACCACGCT CGGAAGCCGC TCCTACGAGC GCGAGCGCGG GCTCAAGCTC GCTCTTCGGT GGCGCTAAGC CCGTAGACGT CAAGTACGTC GAGGACAAGC CGCGCGAAGC GATTCCGATT CGCTCGGAAG AAAACCCCGA ACGTACCGAG CGCAAGCAAT ACGACGAACC GAAAGATTCC GATGGCGACA GATGGGGCCG CAAGTCTTCG TTCGCCCCGC GCAAAGAGGA AGAACTCCGC GAAGCGACTC CCGAGGAGCG CGCCGCGAGG CCCAAACTGA ACTTGCAAAA GCGTTCCACC GACGCCCCCG TCGGCGCCGC GGCGAAGAGC TCGCTCTTCG GCGGCGCTCG CCCGCGCGAA GAGGCGCTCA AGGAAGCCGG TCGCGATTGG CAAGCCGAAG ATCTCAAGCG CTCCGTCGGC GCGGTCAAGC GCAAGGAGTT CAAAGAGGAG AAGGAACTCA AGGAGAAGAT TCAAGCCGCC AAGGACGCCG GCGACGACGT CAAAGACCTC GAACTCGAGC TCACCAAGCT CTCCCTCGAG CTCGACGACA AATTTCGCTT CGCCAAAGGC AGACCAGAGC CCAAGGACGG CAAGGACGGC AAGTCCAAGG ACACACCCGC GCCCAAGGAC AAGCCCGCCG CCGCCGAGAA ATCGGCGCCC GCGCCTCGCG AGCGCGCCGC GCCCGCGCCC AAGCTCGCCG AACCCGCCCC CGTCGTCGCT CGCAACGCCT TCGACGCCCT CGCGGATAGC ATGAATTCGT AA
|
Protein sequence | MAWGTTSAWS AQTKEGNDGV ESLAVPAQAF PSLGGEAAEA DASGAFPSLG AAAKVKEPKK KTKGVKLSLA EFAGASGGSS YAPPGRVGGS AFGRGRGDDV VLPSGPSSRP DDDGGVGLGG GFNQGGDRYG GGDRYGGGDR YGGGDRSDER RPGRYDRDDR RDGEYGDREV REEQGPSRAD TSSNWGADRR APPQDDRYGG DRRATGFGDR YAERGGDRYG GGDDRYGDRE ERRPGFGDFA DRPPREDLGP SRADESDSWS RDRKQLPERD SRYDDRPPRE ERDRDWGSLR SRTAAAEPPS DDGYAPRGDR PKLQLKPRSE AAPTSASAGS SSLFGGAKPV DVKYVEDKPR EAIPIRSEEN PERTERKQYD EPKDSDGDRW GRKSSFAPRK EEELREATPE ERAARPKLNL QKRSTDAPVG AAAKSSLFGG ARPREEALKE AGRDWQAEDL KRSVGAVKRK EFKEEKELKE KIQAAKDAGD DVKDLELELT KLSLELDDKF RFAKGRPEPK DGKDGKSKDT PAPKDKPAAA EKSAPAPRER AAPAPKLAEP APVVARNAFD ALADSMNS
|
| |