Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14529 |
Symbol | |
ID | 5000623 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 279302 |
End bp | 281069 |
Gene Length | 1768 bp |
Protein Length | 541 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416044 |
Product | predicted protein |
Protein accession | XP_001416609 |
Protein GI | 145344167 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.32918 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCGT ACGTCGCCGA TGCGATGACG TGCGGCGTCG AAGGACGGTT GCGCGACGAA GCGTCGTCGA GCGATCGGGA CGGCGGTGGC GCGCGCGTCG TCGCGGCGAT CGGTGACGGC GGTGATGGAT GCGCTGTCGA TGCGCGAGGA CTGGCGCTGG GCGAGGCCGC GGGACGGGTT GAAGCGATGG AACGCTCGAG CGCGGCGGAC GCCGAGGCGC TGGCGCGCGC GATGATAAAC GCGACGACGA CAGAGTTGTG GTTGAGGAAC GATTTTCGAG CGGTCGAGGA GCTGCTAATC CCGATATTCA GCGTTCGCGA CGACGACGAT GGCTGGCGAT GGATGACGCT GTGGGTCAAC GCGAATGGAG ACGCGCGAAT GGGCGGCGAC GATTTCTTTA CGCCGAAGTC GTTCTCTCTC GAGGCGGATG AGGAGAATCA GACGCACGTC GTCGTAATCG TCGACGATGC CGCGGTGAAA ACATATGTAA ACGCCGCGTT AAAGTTACAA ATGCCTCGGG GGGATGTATG GAGGCTGTCG CGTCCATATG CGCGAGCGCG CATCGTCCTC GGAGGAAGGG CGACCAACGC CGATATTGCG TGGTCGGGCA CGATATACGC TGTCGCGCTC TACCCGTTTG CGCTCACCGC GAGTGACATC GAAAACATTT ATGCAGCCGG TTTACCTCAC GCTACGCTCG AGACGTCGGG CGAAATTATC GATATTGACG TGAATCGAGG CGAGGCTATT TCAGCTACGT TTGACGTCGA GACACAAATT GAATTCGATA GCGATCGCCT CGACGCGCTG TGTGATGAGA ATGAGAGTTC TTGCGATTTG TACGTTGATT CTGAATTGAT CATCCTAGAG TGGCCACTTT ACGGGACGGT TTACTCGTGT GGATATCCGC GATCTCAGGG CAATGTACGG CGCCTGCCCG CGGACGAGCT GTGCTACCTT GCGACGAATG CTCCGAGCGG TGTAGACGAA CATTCGATGG TGTACCGCGA TGCATTTAGA TTTGAAATCG CCGGTATCTC TCAGTCACCA CGGTTCAAGA TTGCGGTTGA CGTCCACAAG GGCGTGCTTG CGTGTGACGT CAACATCCAT ATGGACGAAG GAGGCGTCGC TGTCACGACG CTTCAGATCA TGAACACTGG ACACGAATCC GAAGACACGG GCTTCATAGT GAACACATTT ATGGAGCATG GGGTATTTTT TTTTGAGAAT GGAACGACCG TACCGATTTA TGAGCGCATC GTAGGACACG CGTGCGGGGC TCAGCCCGCA TGCAAATATT ATTCAGTCGA TCAGCAAATG CGTCGATGGT GCTTAAACGT CGTTATCGTG CCGTTGCCGG AGTATTTCAA TGCGCAAGAT ACGTTCTTGG GTCCAGAATA TCCACCTTGG GAGATGAAGC CGACGAAGAT CTCATATTCG GGCTTTGTCG GCGAACACCT ATCAAAGCCG GCAACGTTGA ACATCACCGT GAGTGCCGTA TCGAACGAAC CGCGGCTTCG AGTGAACCGC AGTGTTTTTA CAGCAAAATA CCTCGATCGG GTGAGTATCG GAACTTTTCG AATAGACGCG GGTGATTACG ACTCAAGAGC GTCGCGCGTC GCTATTCGCT CCGCAAAAGC AAATTTGTTA TCAGTCTCCA ACCTGAGCGC CGTGGCGCTT TGTGAGAATC CGTTCGTCGC AGGTGATGGA ACCGGCAATG ATGAGATTGT CTTTGATGCC GTACCGAGCG TCATCGCCTC CGTGCTGA
|
Protein sequence | MRAYVADAMT CGVEGRLRDE ASSSDRDGGG ARVVAAIGDG GDGCAVDARG LALGEAAGRV EAMERSSAAD AEALARAMIN ATTTELWLRN DFRAVEELLI PIFSVRDDDD GWRWMTLWVN ANGDARMGGD DFFTPKSFSL EADEENQTHV VVIVDDAAVK TYVNAALKLQ MPRGDVWRLS RPYARARIVL GGRATNADIA WSGTIYAVAL YPFALTASDI ENIYAAGLPH ATLETSGEII DIDVNRGEAI SATFDVETQI EFDSDRLDAL CDENESSCDL YVDSELIILE WPLYGTVYSC GYPRSQGNVR RLPADELCYL ATNAPSGVDE HSMVYRDAFR FEIAGISQSP RFKIAVDVHK GVLACDVNIH MDEGGVAVTT LQIMNTGHES EDTGFIVNTF MEHGVFFFEN GTTVPIYERI VGHACGAQPA CKYYSVDQQM RRWCLNVVIV PLPEYFNAQD TFLGPEYPPW EMKPTKISYS GFVGEHLSKP ATLNITVSAV SNEPRLRVNR SVFTAKYLDR VMEPAMMRLS LMPYRASSPP C
|
| |