Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29704 |
Symbol | |
ID | 5006881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | - |
Start bp | 146541 |
End bp | 148513 |
Gene Length | 1973 bp |
Protein Length | 518 aa |
Translation table | |
GC content | 65% |
IMG OID | 640422302 |
Product | predicted protein |
Protein accession | XP_001422907 |
Protein GI | 145357400 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 0.0581528 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00877603 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | CGCGGCATCG CGTCGGCGCT TCGCGAATGT CGCGCGCGAC CGACGCGGCG ACCGGCGCCG CGTTCGTCGC GATCGACGAC GACGACGGGA CTCCGACGGT GGTGTTCGCG ACGCTCGCGG GCGACGACGG CGCGCTCGCC GCGGAGCTGC GCGCGATGTG CGCGAACGTC GTCGACGGGA GATTCGCGTT TCCCGAGCGT CGCGCCGCGG CGCGCGCGCG GACGACGGAC GGCGCGACGA CGACGACGCG ATTCACGTTT TGCCTGACGC GCGCGGACGG CGCGCGGACG ATCGGGGCGG TGCGACGCGT CGGCGCGAGG CGGGCGGCGG CGGCGCTGAC GCGGACGGCG CGGTTCGAGT ACCACGACGA CGTCCTGCGA CACGCGATGA CGCGCGCGAT GGAGGCGCTG GACGCGACGG CGCGCGGGAC GACGCGGGAA CTCGCGCGGG GGGACGCGCT GTGGGAGTAC CTGGATCGAA CGATCGGGAC GCGGGCGCCG ACGGCGGGGA CGACGGCGGA CGTCGGGCTG CCGTGGACCG AGTTCGCGGG CGTCGCGACG CAGAGCGTGC GGCTGCGAGC GCCGGATCTG AGGAAGGAGT TTAACGGGGG GATAAAATTT ACGGCGCTGT TGGACTGCGG GCACGTGGAG GCGGTGCTGG CGCTGTTCGC GGCGTTGGTG ACGGAGCGAC GGGTGGTGCT GACGAGCGGG CGGTTGGAGG CGGTGAGCGC GGCGGTGCAC GCGGCGAACG CGGCGCTGTA TCCGCTGAGT TGGCAGCACA TATTTTTACC CGTCGTTCCG GAGGCGTTTT TGGATTATTT GACGGCGCCG ATGCCGTTTT TGGTCGGTTT GCATTCGAGT CTGTCGGAGG ACGCGAAGCG ACTTCCGTGC GACGACGTCT TTACGTTGAA TCTCGACGAC GGATCGTTCA CGTACTTTGA GGAAGATTTC GAGGCGATTC CGTCCGGTCC GTTCACGCTC CTTCGCGTCG GGTTGCTCAG GGAGATCGAG CGCACGCGAG GTGAAGACTC GCAAGCGGTG GCGCGCGTGT TTCGAACGTT TTTCTCGAGC GTGCTCGGGC CGTACAAGCA GCACATCAAG GGCGTCGTCG CGCATCCGCC GCCTCGAGAC GCCATCATCG CCGATAGTTT ATGGCTCGAC CAACGAGGAC TCGAGCTCGG AAACTTGAAG CACGCCGGCA TTCTCGCCGC GATGCGAGGG ACGCAAATGT ACGAAGTCTT CGTGCGACAG CGATTGCGCA TGTGCGCCCA GGTGGCGAGG AAAACCGGCT TCGTGCCCAT GGGGGACGAA ATCGTCGATT TCGATCTCGA AGAGGGCGAG TTGACGCTTT CTGATTTAAT GATGCGAGGT CAAGCGTTGT CCGAGCAGTT CGCCTCCGCT TCGTCGCACG CGTTCTCCGT GGGAAGCTCC GCGCTTCGCG AGACGATGCG AAAGGCGAAG AAGGCGTACT CGACGTCAAA GACGATTCAG GGCATGCGCG AGGCGTTCGC ATCGAAACGT TCGCAGCTCG CACACTCAAT AGGAAAACTC AGAGAAAAGA GTTCGGAGGA TTTGCAGGCG AAATGGGCGG AGTGGACGGA CGAGGGAACG GCGAAGAATT TGAGTTTCGC CGAGGCGTCG ACCGCAGAGG CGCCGGTTCC CGCGCCGCCG CAGACGCAAT CACCGGTGGT CGAGCGCGAA GCCGTCGAAG TGAACGAACC AGTGGCGATG AGCGAGCCCA TCGTCGACTC GCTCGCATCG GCACCGACGC CGGCTCCGAC GCTCGCGTCG GCACCGCCGC CACGCGTCGA ATCAGTCTCG GGCGCGGCAA ACATGATGGC GAACTTGATG TCTTTCGACG ACGCCGGAGA CACGCTCGCC CCCGTGACGC CGTCTAAGCG CACGGGCGCG ACGATGGCAG ACGTTGACGA GCTCCCAAAC TTAATCGACA TGTAGAAACG TAGGAATAGC GCG
|
Protein sequence | MEALDATARG TTRELARGDA LWEYLDRTIG TRAPTAGTTA DVGLPWTEFA GVATQSVRLR APDLRKEFNG GIKFTALLDC GHVEAVLALF AALVTERRVV LTSGRLEAVS AAVHAANAAL YPLSWQHIFL PVVPEAFLDY LTAPMPFLVG LHSSLSEDAK RLPCDDVFTL NLDDGSFTYF EEDFEAIPSG PFTLLRVGLL REIERTRGED SQAVARVFRT FFSSVLGPYK QHIKGVVAHP PPRDAIIADS LWLDQRGLEL GNLKHAGILA AMRGTQMYEV FVRQRLRMCA QVARKTGFVP MGDEIVDFDL EEGELTLSDL MMRGQALSEQ FASASSHAFS VGSSALRETM RKAKKAYSTS KTIQGMREAF ASKRSQLAHS IGKLREKSSE DLQAKWAEWT DEGTAKNLSF AEASTAEAPV PAPPQTQSPV VEREAVEVNE PVAMSEPIVD SLASAPTPAP TLASAPPPRV ESVSGAANMM ANLMSFDDAG DTLAPVTPSK RTGATMADVD ELPNLIDM
|
| |