Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17129 |
Symbol | |
ID | 5004094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 441331 |
End bp | 442995 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | |
GC content | 68% |
IMG OID | 640419515 |
Product | predicted protein |
Protein accession | XP_001420177 |
Protein GI | 145351640 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.463695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.719168 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCGC GCGCGCGCGC GAGCGACGCG AACGCGGACG CGGACGCGAA CGGCGAGGCC GTTTGGCTCG AGCGCGTCGT CGCGCGCGGC GTGCGACGCG CGCGCGACGC GAGCGGACGC GCGCGCGAGC CGAGACTGCG GGACGTGTGC GCGGCGGTGG ACGAGGAGGT CAGGCGGGCG CGAGGGGCGG ACGCGGCGGC GCGAGGGCGC GCGAGGGCGA TGGCGTTCGC GCTCGGGCTG GACGTCGAGA CGACGTGGGA GGAAAAGGCG GCGGCGTTGG AGCGCGGCGC GGCGCGGGGG GAGGCGAGGG GGGCGGGCGA ATCGGCGACG ACGCGCGAGA TGAGCTCGAT CGAGGCGCTG GTGGCGGTGG ACGACGAGGC GTATTTGGAG ACGACGGTGG GCGACGGCGC GCGAGTCGCG ACGCCGACGG GAGCGGATTC GGAAGGCGCC ACGCTCGACG ACGACGAGAG CGAGCTCATC GCTCGAGCGT TGTGCGAAAT GAACGCCATT AAGCGCGCGT TCGCGGCGTG GAGACGCGAG GCGGTGAAGG AAAAGTATCG GCGAGCGAGA GAACGCGAAA TCGAGTCGTT GTGCGCTTTC TCCACCGGCC GCCGCGCGCG CAAGATGGCT CTGGCGGCGA TTGGCGCGTG GCGCACGAAT GCGCGCAAAC GTGTGCGATT GCGCGCGTTC GCGGCGAAGC GCGCTCGACG CGAGCTCGCG ACCACGGGCC GAGCGGTCGC TCGCGCGTGG GCGCGAATCG CGCGAGACGA CGCGGCGAGA CGACTCGAAG CCTTCAGGCA GACTCAAACG CGACGAAACC GTCGAGCGAT GCTCGCGGCG TTTAACGCGT GGCGTTCGCG CGCGAGAGCG CGCGTTGCGC GCGAGAGCGC GGACGTGTGG CGACGAATCA AACTCGAGTC GACGGCGCTG GATACTTGGT CTGCGGTCGC GCGGGCGTGG AAATCTGAGC GACGCGCGAT CGATGCCGCA CGTCTCGCTC GCCAGCGCTA CGTCAAACGG TCGGTCACGC GTGTGTGGCG TGTAAGAGTC ATTCACGCCG AACGTCGCGC CGAACGTGCC GTCGTCGCGG TAAAATTTTC GCGTTGGCGT CGACGAGTGC GAGACGAGAG ACGAGAGCGC GAGCGCATGC GATTCGCTTC AAGCTACGAC GACGCGCGTG TCACGGTCGG CGCCTTTGCG CGGTGGTGTG CGGTCGTCTG GGCCGCGAAA GAGCACCGAG CCGAGCTGGC GCGTGCGGCA AAATTTCATC GCCTGTCGCT CGCGGCGAGC GCGTTTTACG CCTGGCGAGC GGCGACGAGT CAAGCGAAAC GAGAGAACAG GTCACTAAAA ACACGCGTAT TCGCGCATTG GAGATCAATG AAGGAACACG CGTCACAACT GAACGCGCGC GCGACGTTTT ACCGCGATTC ACAAGCGATC ACGTTCAGCG ACAAGTATCT GGACGCGAAT TGCTTTGCGA TGTGGCGCTC CTTCGTCTTC GCGCAGCGAC GACGATACGC CGCAATGGAA CTCGCAGATA CGTGGCGCGT GAAACGCGCA TTCGCGCCGT GGCGCGCTCG CATCGGCGCG GCGCATTTCA GCGACGACGT CGAAAACACC GCGCCCCCCG TCGCCGTCGG CGAAATCGAG TGGCAACGTT TTTAA
|
Protein sequence | MAARARASDA NADADANGEA VWLERVVARG VRRARDASGR AREPRLRDVC AAVDEEVRRA RGADAAARGR ARAMAFALGL DVETTWEEKA AALERGAARG EARGAGESAT TREMSSIEAL VAVDDEAYLE TTVGDGARVA TPTGADSEGA TLDDDESELI ARALCEMNAI KRAFAAWRRE AVKEKYRRAR EREIESLCAF STGRRARKMA LAAIGAWRTN ARKRVRLRAF AAKRARRELA TTGRAVARAW ARIARDDAAR RLEAFRQTQT RRNRRAMLAA FNAWRSRARA RVARESADVW RRIKLESTAL DTWSAVARAW KSERRAIDAA RLARQRYVKR SVTRVWRVRV IHAERRAERA VVAVKFSRWR RRVRDERRER ERMRFASSYD DARVTVGAFA RWCAVVWAAK EHRAELARAA KFHRLSLAAS AFYAWRAATS QAKRENRSLK TRVFAHWRSM KEHASQLNAR ATFYRDSQAI TFSDKYLDAN CFAMWRSFVF AQRRRYAAME LADTWRVKRA FAPWRARIGA AHFSDDVENT APPVAVGEIE WQRF
|
| |