Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29431 |
Symbol | |
ID | 5006744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 220881 |
End bp | 222371 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | |
GC content | 61% |
IMG OID | 640422165 |
Product | predicted protein |
Protein accession | XP_001422687 |
Protein GI | 145356952 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.386445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.32597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGGCG CGTCGATCGG GAATAAGTTG TTTGACGTCG TGAGCGTTAG GCACGCGGCG AGGGATCACG AGCGGGAGGC GACGTCGGGG TTGAGCGGCG TGCGGCGATT GACGGCGGAG AAGATGGAAT TGGACATGGA GCGATATCCA AAGGCGAGGT GCTTGGATGG GACTCCTGGG GCGTATTACG TCAATCTCGC GCCCATACGC GTGCGAAACG TAGACGACGA GTATCCGTCG GCGAAGCGGG GGAGCGTCGC GCGCGCGGGA GATTCGAGCG GGGGATCGGG AAGCGCGCGC GAGTTCGCGA CGTCTAAGAC GTGGGTCGTG ATGTTGCAAG GCGGTGGCGA GTGCACGAAC GCCCCAGAGT GCTCCGAACG CTCTGGGACG GAAAGAGGAT CGAGTGAACT GTTGCCGGAC GAGATCGTGT TTGATCGAGG CATCCAGGCG GTGACGGCGG ACGACGACGG CGAAGATTTG CCGTTTTCGC GAGCCAACAT GGTCACCGTG GGGTATTGCT CGGGTGATGT GTACATGGGG CGATCTGACG AGGCTGATGC GAGTGGGATG TGGCACTCGG GCGCACACAT CGTCGAGGCT GTTTTACAAG AGCTCGTCCG GGCGTACAAC ATAGAGGACG CGGACGTCAT CGTCTTGGCG GGCCGAAGCG CGGGGGGGAT CGGTTTGATC GCGCAAGTGG ACCAGTGGGC GGAACTACTT CGCACAAAGT TCAGCGCCAT AGCGCGGAGC ACGGTGAAAA TCGTCGGTGC GCCGTTTGCT GGGTTTCATT ACTTTCATAA CGATACGGAG GGCGCCGCCG ATGATTCGCT CAAGTACGTA CCGTGGGACG AGGCTTCGTT CAAGCAGTAC GTAGACTATT GGCACGCGAG CGAGAGCCTT CCCAAGGCGT GCGTCGAGGT GAATCAGGAC GCACCGTGGA GATGTATGGT GGCGGACTAT TCCTTCCCTC ACACGCGAAC GCCCTTATTC TTTTCGCAAG CGCTTCTAGA TTCCGTCGTA ATGCGGTTGC ACGACAATTT TGGCGGCGAC TTTACGCGAC ACAAGCAAGT CACGTTCGCG CACGAATGGC AGTCGCAGAT GCGTCGCGTT CTCGAACCTG CGATGTCACA CGCCACCGCC GGCGTGTTCG CGCCGTCGTG CTACATGCAC ACCGATTTCG ATGGCATCGT CATCGACGGT ATCTCCCATC ACAGGGCGCT CGCCGAGTGG GTGTTCGAGA ACAAACCGAT CCGTCTCATC GACGATTGCC GGGAACTGAT GTGCAACCCG ACGTGCAGAT CGCGCGATAA GTCGAGCACG CTCTCCAACG ATTTAGACGA CGGCGCGCTC GGACACGCGT TCGATCGCAA GCGCCGGAAG GACGAAGACG AGCTCTCCGC CGAAAAAGTC GCCGCCGAGC GCAAGACGGA CGACGCGCGC GCGCGACGCA AAAGCAACCG CCGTCGCGCT CGGCACCGTC CATCGGATTG A
|
Protein sequence | MAGASIGNKL FDVVSVRHAA RDHEREATSG LSGVRRLTAE KMELDMERYP KARCLDGTPG AYYVNLAPIR VRNVDDEYPS AKRGSVARAG DSSGGSGSAR EFATSKTWVV MLQGGGECTN APECSERSGT ERGSSELLPD EIVFDRGIQA VTADDDGEDL PFSRANMVTV GYCSGDVYMG RSDEADASGM WHSGAHIVEA VLQELVRAYN IEDADVIVLA GRSAGGIGLI AQVDQWAELL RTKFSAIARS TVKIVGAPFA GFHYFHNDTE GAADDSLKYV PWDEASFKQY VDYWHASESL PKACVEVNQD APWRCMVADY SFPHTRTPLF FSQALLDSVV MRLHDNFGGD FTRHKQVTFA HEWQSQMRRV LEPAMSHATA GVFAPSCYMH TDFDGIVIDG ISHHRALAEW VFENKPIRLI DDCRELMCNP TCRSRDKSST LSNDLDDGAL GHAFDRKRRK DEDELSAEKV AAERKTDDAR ARRKSNRRRA RHRPSD
|
| |