Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17711 |
Symbol | |
ID | 5004863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 523653 |
End bp | 525911 |
Gene Length | 2259 bp |
Protein Length | 752 aa |
Translation table | |
GC content | 57% |
IMG OID | 640420284 |
Product | predicted protein |
Protein accession | XP_001420874 |
Protein GI | 145353115 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGTA CGAGCGTGAT GCGAGCACCC GAGCGCGGCG TCAACGACAC GTCGACGCTG ACTCCCACGT CGGCATCGTT ACTGCGGTAC GCCAATCACG AAGTCATTCG AGCGCTGTTG CGACGAGAAA ACGACGACGA GCACGGCGAG GATGACGATG ATGGTGATTT CGAATCCCTC GATCGACGCG CGAGGGAGAT CGAAATGAAA CTCGTGGACG TCGAGCGTCG CTCGATCGAC GCTCACGCGG ATGAGTGCGA GAATTTACTG CAACTGCAAG GCGAAGTGCA CGCGTGCGAC GGGATATTGA GCGACATGGA ACAAACGTTG AAGACGTTTC AGGACGATTT AGGGCGTATA TCGATGGAAA TTAGGGAGTT ACAGCGTTCG AGCGAAGATT TACGCGTGCG AAGTGCAAAC CGCGCGCGAG CTGAGCGAAA ATTAGGCGAC GCGGTGGAGG CGCTTTCGGT ACCGCCGTCG CTGATTAACG CCGTGTTTAA CGGCGACTTC GTCGGCGGTG ACGGGGGATT TGTCGAAGGC GTTCGCGAAC TCGAGGTGAA GTTGGATCAT CTCAGTGTGT CCGCATCGAA AGGCGCGGGT CGCGCTGTAA ACGACGTCGC GCCAGAACTC GAGCGACTGC GCGTCAAGGC GGTCGACCGC GCTTGGGCCT TCTTGTACGG TGAATTCGCG GCGTTGAAGA AGCCACGCGC GAACGTACAG TTAATTCAAG AAAATTCACT CGCGAGACAC GCGCCTCTGA TCGAGTTCCT ACGTAAGCGA GGACCAGAGG TGTATTGGGA GGTGAAGAGC ACTTACGCGG ATGTCGTGGG AAAAGTGTTG AAAACGGCGA TGACTTCGTA TTTGGATTCG CTCAAGCGCG TCACAAAAGC GCCTAGATCT CGAGTTTTGC TCGCATCCAG GACGCGCACG ACTTCCTCGA CGACGGCAAC AACGTCGGCG ACGACCGACG TCGCGTCGGC GGCGGCGAAC GCCTTGGCTG GAATGTTCGC GGGCATCACG TCGTCGCCGG CGTCTTCTAG TGCGTCCAAG ACTGATGAGA CGAATGGCGA CAGCTTGTTT GATCTTGGTA ATCGCGCGCG TGCCATGTAT GCGGCAGAGA CAGAGCCTCC GTTGGTGGTG CATCGGCAAT CGGACAAGGT GAGGCAAGAA GCGCACGCCT ACGAGGAGTT GTTCCGAAGC GCGCATCGCC TGCTCATAGA CACCGCGACG TTCGAGTATG CGTTTTGCGA AGTATTTTGG AAAGGCGAGC GCGAAATGTT TGAAATCGTT TTCACCGGTC CACTGATGGC GTACAACGAT TTCGTCGCCC AAGGCGTTGC GCAGTCTGGG CACGACGCCG TCGGGCTGTT GATTGCCATC CGCGTGAACA ACGCCCACAG ACGCGTGATG AATCGTCGCC GAGTGCCCGC GCTGGACGCG TACATCGACA ACTTGAACAT GATACTTTGG CCGAAGTTCA AGCACGCGTG TGATGCGCAC GTCAAGTCGC TCGAGGACAC GCGAGAGTCG TTCGAGCCCA ATCCAGAGTC GCCGAGTTTC ATCGTGAAGC GATACGCAAA CTTTGTCCTC GCGCTCACCA CAGTGGCACA TTCGCGCATG GGCGCATCTT CTGCGGATGA ACTTAACGTT ACGAATCAGG TTGATTTGTT TCTCGATCGT TTGCGACGGT CGATGTACGA CTGTGTTACG AGTAAGCTGT GCGCTTCGCT CAAGGCGTCG CCTCGCTCGC GAAGCGCGTA TTTGGTAAAG TCGTACGATC ACATCTGCTC GACGTTAAGC TCTTTGACGA ATCTCAACGA TGATGACGGT GTCGAGAGAT GCGCTGAGGA TGACGAACTC GCCACAGAAC TCGCGAGTCT GCACTTCTTT GAAGAAAAAC TCATCGAAGA GTCCAAAGCG TTTGTGTCGC ACGCGTTGGC GGAGCGCTTC CCGAGAATTA CGACGATTTC GCGACGCCGT CGGAGCGGTG AAGGCGCGAG CATTGATGCA CTCGTCATTC GTGACGCACT CGTCGCGTTT CAACGCGAAT GGCGTGATGC TTTGAAAGCG GCGCACGAAG ACTGCGTGTC GTGTTTCGGC GCCGCGCGCG ACTGGCGCGC CAACGACCTC TTCCACCGTT GCTTGGCGGA ATTGGTATCG ACGTACTTTG CCTTGATCGA CGACGACACC GATGAGACCA TCATCGTCAC CAAACCCACC TTCACGCTCG AGGGCAACCG CTACGTCTCG AGATCGTAG
|
Protein sequence | MDRTSVMRAP ERGVNDTSTL TPTSASLLRY ANHEVIRALL RRENDDEHGE DDDDGDFESL DRRAREIEMK LVDVERRSID AHADECENLL QLQGEVHACD GILSDMEQTL KTFQDDLGRI SMEIRELQRS SEDLRVRSAN RARAERKLGD AVEALSVPPS LINAVFNGDF VGGDGGFVEG VRELEVKLDH LSVSASKGAG RAVNDVAPEL ERLRVKAVDR AWAFLYGEFA ALKKPRANVQ LIQENSLARH APLIEFLRKR GPEVYWEVKS TYADVVGKVL KTAMTSYLDS LKRVTKAPRS RVLLASRTRT TSSTTATTSA TTDVASAAAN ALAGMFAGIT SSPASSSASK TDETNGDSLF DLGNRARAMY AAETEPPLVV HRQSDKVRQE AHAYEELFRS AHRLLIDTAT FEYAFCEVFW KGEREMFEIV FTGPLMAYND FVAQGVAQSG HDAVGLLIAI RVNNAHRRVM NRRRVPALDA YIDNLNMILW PKFKHACDAH VKSLEDTRES FEPNPESPSF IVKRYANFVL ALTTVAHSRM GASSADELNV TNQVDLFLDR LRRSMYDCVT SKLCASLKAS PRSRSAYLVK SYDHICSTLS SLTNLNDDDG VERCAEDDEL ATELASLHFF EEKLIEESKA FVSHALAERF PRITTISRRR RSGEGASIDA LVIRDALVAF QREWRDALKA AHEDCVSCFG AARDWRANDL FHRCLAELVS TYFALIDDDT DETIIVTKPT FTLEGNRYVS RS
|
| |