Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29968 |
Symbol | |
ID | 5000205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 293934 |
End bp | 295712 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | |
GC content | 59% |
IMG OID | 640415626 |
Product | predicted protein |
Protein accession | XP_001416121 |
Protein GI | 145342076 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.586507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCGC GGCGCGAGCG CGCGCGCGCG CGGCGACGGC GCCGCGTCGC CCTCATCGCG TGCGCGTGCG CGCTCGCGGT GATCTGCGCG CGCGCGGTGA CGCCGCGACG CGCGCGCGAG CGGCGCGCGG ACGCGCGAAC GCCGCGAGAA CTCACGCTCG CGTCGCACGG TGCGTTTCTG GCGCTGGACG TCGAGTCGAA GGCGGCGCGC GTGATACATC GCGGCCGGGG GGTGTATTAC GGCACGTTCG AGGACGGATC GTCGTCGTCG GACGCGGTGT GGGTGGCGTC GAGGCCGGAT AACGCGAAGA CGCGGACGCG GACGGCGACG CGCGGGGACG CGTTGCTCAG AATAGACGCG AAGCGGGGGG AGATTTTAGA GGAAAGGGCG ATCGATGCGG CGTTCACGCA CGACGTCGTG CGACGAGGTG ACTCGGTGTT CGTCGCCGAC ACTGGGAACG GGCGGATTTT GGAGTTGGAG TATCCGTCGA TGAGGACGCT TCGGGCGGTG GAGTTGTCGG TGAAGGCGCA CGTGAATACG CTCGCGCCGG CGGACGCGAG CGAGTACGGC GAGCACGCGG TGTGGGCGGT GCTTCATAAT TTAGGTCCGA GCGAGGTGGC GTTGATCGAC TTGGAAACTG GGCGCGAGTT GCGACCGAGG TTGACCAGAG TGGGCACGAA GTCGCACGGG TTGGTCATAT ACGAAGATCG ATTCATCATG CTGAACAGTG GAGAGGGGCA ATTGATTTCC GTGGACCCGT CGGGTGTCGA AGATTACGAG ATTTTATGGA CCGATGATTC GCGGACGTTC ATGAAAGGGT TGTGCGTGAT CGATGACGTG GCGTTTTTCG GCGTTTCGGC GTTCGGAAGA AGAGAAGACC GAGGCGATCC AAACAAGTCG AGCGACGTGG TGGCTTTCGA TCTCGTTCGT AAGCGCGAGC TCTGGCGACA GACCGTGATG ACGCACGGAT TGCTCAACGT CATCGCCGCG CCGCAAATAG AGTCGAGCAC TTGGTCGCAC GCACATTGGA ATGCAAACTC GCCGTCGTTG TACGCCGAAT GGATTCCTCT GGCGGCGAAG CAAACGAACA ATTGTGAAGC GACCGATCGG AATTCCTTGC TGTTGACGTA CGACGACGTC GATGTACATC TGCTTCGTGA TTACATTCAA GGCCTCCCTC GCGACGTGTT CAAAGAGTCT GGCAAAAACG GAAACGCTTT GTTGGGTGGG AGAGACGGCA ACATGCAAAA GTTCAAGCCA AACGTCGATG GCATGCTCTT GTTTTTCTCA GATCGAGGTG GCGAACACGT TTTTGAGTTT CCATTTTGGA AACGTCTCGA GCCGTACGTG CAACCTGTGC TGATTGATTT GTTCACCGAT CAACTCGGCG TCTCGGATCC GTTACGACAC GTGATACGGC TACAGCTGGC GGTGATGAAT CCTGGTTCTG AAATTTTGCC ACACGTGGAT ACGGGCGACT GGGCGAGAAG ACACCATCGA TTTCACGTTC CAATCATAGT CCCGCAAAAC GCTGGAGCGG TGGAATTCGT CATGATGCCA GAATCCGGTC AGGAAATCGC CGTTCCGCTC ATCGAAGGTC GTCCATTTGA GATAAACAAC GCGGTGACGC ATCGAGTACG GAACAGCGCC TCGTCTTGGC GCATCCACTT ACTCGTCGAC TTTAGCGAGC AACCGACGGC GAAGAGGCAC GTGCTCAAGC CTGGGGACGT TTGTGACTAC GCCAAAATGG GCAGCGGTCG GTGTGTGGCT GTCACTTAG
|
Protein sequence | MTARRERARA RRRRRVALIA CACALAVICA RAVTPRRARE RRADARTPRE LTLASHGAFL ALDVESKAAR VIHRGRGVYY GTFEDGSSSS DAVWVASRPD NAKTRTRTAT RGDALLRIDA KRGEILEERA IDAAFTHDVV RRGDSVFVAD TGNGRILELE YPSMRTLRAV ELSVKAHVNT LAPADASEYG EHAVWAVLHN LGPSEVALID LETGRELRPR LTRVGTKSHG LVIYEDRFIM LNSGEGQLIS VDPSGVEDYE ILWTDDSRTF MKGLCVIDDV AFFGVSAFGR REDRGDPNKS SDVVAFDLVR KRELWRQTVM THGLLNVIAA PQIESSTWSH AHWNANSPSL YAEWIPLAAK QTNNCEATDR NSLLLTYDDV DVHLLRDYIQ GLPRDVFKES GKNGNALLGG RDGNMQKFKP NVDGMLLFFS DRGGEHVFEF PFWKRLEPYV QPVLIDLFTD QLGVSDPLRH VIRLQLAVMN PGSEILPHVD TGDWARRHHR FHVPIIVPQN AGAVEFVMMP ESGQEIAVPL IEGRPFEINN AVTHRVRNSA SSWRIHLLVD FSEQPTAKRH VLKPGDVCDY AKMGSGRCVA VT
|
| |