Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18125 |
Symbol | |
ID | 5005336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 425731 |
End bp | 427800 |
Gene Length | 2070 bp |
Protein Length | 689 aa |
Translation table | |
GC content | 51% |
IMG OID | 640420757 |
Product | predicted protein |
Protein accession | XP_001421289 |
Protein GI | 145354009 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0142809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.181943 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCACC GATCGTACCG TGTATGCAAT GGACGCACGC TCGCACTCAT CGCGCTCGTC GTGCTTGTCG CACGCACGCA GCGCGTCTTC GCTTCGCCGA CCGACGAGCG AAGATTCTGG ACGCCCACTG TCGTCGCCGT GGACGACGGC GTCGGTTCGA AGATCCGACG CATGGGAAAC GATGTGGAGA AATCGAAGTC TTTCGTGTTT GAGAACACGG TAGCGGAATC GCTGACAAGA GATGGCGTCG TCGCGGTGCG AGTGCGAGGC TTGAACGAAG CAAAGACACA GAGCGCAAAA CTCTTCATGG ATTGCGCGCG GGTTACGGAT AATGTTGCAG TAGAAGAGTA CGACGATGGA TCCAGGCGCG TGACGCTCGC CACGACGACG AAGCAGAGGG GGAGTGGTTT CGGGGAACCC ACGTCCAAAG CATGTAAGAA GTTCGAGGTC ACGACGAAAA TGCTTCGCGC GAAAGTTTCT GTGGCCGAAC AAGCAATCAC GGACGTGTTT CAGAGCAAGT TTTCAGATCT GAGTGAATTT GCGAACGGGG TACTGATGCA GGGAGAGAAC GGCGAGACCT ATAAAACTTT CTCGGAAGTA TTGAAAGACT GCGAGCACCT TGAACATTTT CACGGCTACA CGATTTCACC ACAACAAAAA TTGAAGAATA GTAAAGTGCG GACGATCGAA GAGCACACGG ATCAAGGTCT ACTCATCGCA TTCGTCCCGG CCATAATTGT CGATGCAGTG ACGGGAAGGC GCGATAAGTT TTCCTCAACC GGCGATTTTT ATATGACGCT TCCTGGGCAT CGCAAGGTGC TTCTCCATTT TGACACGCTT CCAGACGATG TCGTTATTTT TATGCTGGGT CAAGCAATCG AGCAACAAAT CACTCCCAAA CTTGCAAACG GCCTTGAGCT TCACGCTGTA CCACACGCAA TGGATATGCC CAATCTTAAA TCTAATCAGT TCAGATTTTG GTATGGGCGA ATGTTCCTCC CACCTGAGAA TGCCTTGGAC GAAAGCCATG GTCTCTCTTT TGGGACAGTG CGCGCTAACA TCATCGAAGA TTTCGCACGT GGGCTGAAGA CATCCGTGGG ATGTGGTCGT GAGCTCCTCT CCGTGGCAGG GAGCGGTTCA TGCGCTAGCA ACCAGATATG GTGCTGGATG CGATGCATGA ATCACTCAGC GCCGGCGAGC AATCCACTCA CATGCAGGAC GGGTCATAGT GTCCAATGTT TGAGCCAAAG ACTGGAAGTC TGGACGGATG CCGACAGTCA TGGTGATTAT AATCCTGGAT GCACCGACGA GACCAACGAG ACAAAGCCCG TCACCGATCC ACCGACCATC CCAGCTCGAG CAGCCTCTTG TACAGATGGA AACGCCTGGG AAACTTTTTT GAACCACTCC GAATATGAAA ACAGCTTGGA GCTGTTACAA GATAAGTTAT ATTTACTCTG GACTGTTCAA AATGGTAAGT TAAAAGCCCG AGTTGCCTTC AATGGCAAGA TTGGTTGGTT TGCTCTCGGC ATCAAAAACC TAGGAGGGAA ACACAACGGG ATGAACGGGG CGAACATCGT CATGGGCGTC CACGACCCTC ATCCAATGGG TCACGACAGC AATTTCGGTT CTCCTTATGT TGGAAAGAGC AGTGCGAAAC AATACAAAAT CCACGATCAT TCTTCTGCTT TTCGACATTG GAATGACACT GCTGACGTTG CCTCACCCTT CGTGAGTTCG ACGACGTTCA ATTCATGTTT CAGTTGTATG CGTTTCGAGA CGGCCTCGAT ACGGGGCGAG GCTCTGAATC TCACCTCTGG TATTAATGAA CTCATCTGGG CTGCTCATGA TGGCACTTAC CTCAAAGGCT ACCATGAAGT ACAAGGATTC GATCGGAGAG ATGCACGCGG GCATCTCACG CTTGACCTGA CGCGAAATTA CGGTCGTGTA ACGTGCGGCT TCTCAAGAGG CTGGGCTGCG GCGAGTCAAA CATGTGCTTT TTCGGCAGCA AGCACTCCTT CAGCGATAGT TTCTATATCC ATTGCTGTCA TTGCAATCCT CGTTATGTGA
|
Protein sequence | MRHRSYRVCN GRTLALIALV VLVARTQRVF ASPTDERRFW TPTVVAVDDG VGSKIRRMGN DVEKSKSFVF ENTVAESLTR DGVVAVRVRG LNEAKTQSAK LFMDCARVTD NVAVEEYDDG SRRVTLATTT KQRGSGFGEP TSKACKKFEV TTKMLRAKVS VAEQAITDVF QSKFSDLSEF ANGVLMQGEN GETYKTFSEV LKDCEHLEHF HGYTISPQQK LKNSKVRTIE EHTDQGLLIA FVPAIIVDAV TGRRDKFSST GDFYMTLPGH RKVLLHFDTL PDDVVIFMLG QAIEQQITPK LANGLELHAV PHAMDMPNLK SNQFRFWYGR MFLPPENALD ESHGLSFGTV RANIIEDFAR GLKTSVGCGR ELLSVAGSGS CASNQIWCWM RCMNHSAPAS NPLTCRTGHS VQCLSQRLEV WTDADSHGDY NPGCTDETNE TKPVTDPPTI PARAASCTDG NAWETFLNHS EYENSLELLQ DKLYLLWTVQ NGKLKARVAF NGKIGWFALG IKNLGGKHNG MNGANIVMGV HDPHPMGHDS NFGSPYVGKS SAKQYKIHDH SSAFRHWNDT ADVASPFVSS TTFNSCFSCM RFETASIRGE ALNLTSGINE LIWAAHDGTY LKGYHEVQGF DRRDARGHLT LDLTRNYGRV TCGFSRGWAA ASQTCAFSAA STPSAIVSIS IAVIAILVM
|
| |