Gene OSTLU_28087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28087 
Symbol 
ID5005908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp214504 
End bp216016 
Gene Length1513 bp 
Protein Length465 aa 
Translation table 
GC content64% 
IMG OID640421329 
Productpredicted protein 
Protein accessionXP_001421879 
Protein GI145355253 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.000149848 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.278973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGCCACTCG CGCGTCCCGC TCGCGTCGCG CCTCGCTCGC GCCGCACGTC GCGCGCGATC 
GCGAGTCGCT CGAAATGCCC GCCGCGAGGC GCGCGCGCGC GGCGCCCGCG CGCGCGCAGC
GAAAACCAAC CACCGAGAAA GAATCCACCG CGCGCGACGA TGTCGACGCG AAACCCGACG
ACGACGTCGC GGTGAAGCGC GAACGGCGCG CGGCGGACGC CGACGCCGTC GCGTCCGACG
ACGAGTCGAC GCGGGCGACG AAACGGACGA AATCGAGCGA GACGGGGACG GGGACGGTTG
GCGGCGGCGG CGACGCGTGC GACGACGATT TCAAGACGCT CGTGCGCATC GATAACAACG
CGGGCAGAGT GATCGGTAAG GGTGGCGAAA ATGTCAAGTA CATCGAGAAC ACGTGCGGAT
GCGTGGTTGA ATTTCGCCGC GATGAAGGCG TGGCGGTGGT TCGGCCGAAC GTCGTCGGCG
CGGGCGGCGC GCGGCGAACG AGCGAGGAGA AGCGAGCGAG CACGCAACGC GCCAAGGCGC
TGATCGAAGA GGTGGCGAAC ACGGGACAGA TCATGGAGAT GTTAACGGCG CACGTGCCGA
GCGATGTCAA CGCCGGGATA CGGATCGATG TGAGCGTGGT GCCGGAAAGC GTCATTTCGG
GGGACGACGA GGAGGAAATC GAAGTGGCGA TTCCGTGCCC GGGGAAAGAG GGTCGCGTGA
TCGGTCGCGG CGCGGCGACG ATTCGAGAGA TAAGCGCGAG GAGCGGGGCG AGTTGTCACG
TCGTTAAAGG TAGTGGCGTG TGCACGGCAA AAGGAAAGCG CAAGTGCGTG AGAATCGCGT
ATCAAATGGT GCACGACACG CTGCAGCTGC AAGTAGATAG ATTCGCGGCG CCGAGCGCGC
CGGCTGTGGC GCCACAGATG CAACCACATG CTTTGCCGAT ACAAGGTTTG CCTCAAGGCG
CAGTGTTACT TCCGAATGGC ATGGCCATGG TGCCTCTGGC CGCGATGGGA CTCGCGGCGC
CGGCGCCGAC CGCGGCGCCG GCGACGACTA TCGAGGTTCC TTGCGCCGGG AACGAGGGAC
GAGTCGTCGG TAAGGGTGGT GAAATGATCA AACATTTACG CGCCGTAACG GCGTGTGGGG
TTGATATTAA GAATAACAAA TCGCCGCACG CGGTGGTGGC GATTACTGGG CCGCTCGCGA
ACGCGCAACT GTGCGCCTCG TACGTGCGCG AAGTCATGGA AATGGGCGAC ACTCGCGCGA
CAGGGGGATT AAGCGGAGCG CCTCTACACG CGGCGCCGGC GAACACACCT CCGTTCATCG
TGCCACAGCA GCAACAAGCG TATCAACACG CGCCGGTGAC GCCACATTAC GCGCCGGCGC
CTCACGGAGC CGAGACGTGG GTACGATACT ACGACGCCGA AGGTAAGCCT TACGAGCACA
ACCCGGTGAC GAACGAAACG CGTTGGGTGT GAGGACTTTG CGACGTGAGA CGAATGGATA
ACTTTAGATT AGC
 
Protein sequence
MPAARRARAA PARAQRKPTT EKESTARDDV DAKPDDDVAV KRERRAADAD AVASDDESTR 
ATKRTKSSET GTGTVGGGGD ACDDDFKTLV RIDNNAGRVI GKGGENVKYI ENTCGCVVEF
RRDEGVAVVR PNVVGAGGAR RTSEEKRAST QRAKALIEEV ANTGQIMEML TAHVPSDVNA
GIRIDVSVVP ESVISGDDEE EIEVAIPCPG KEGRVIGRGA ATIREISARS GASCHVVKGS
GVCTAKGKRK CVRIAYQMVH DTLQLQVDRF AAPSAPAVAP QMQPHALPIQ GLPQGAVLLP
NGMAMVPLAA MGLAAPAPTA APATTIEVPC AGNEGRVVGK GGEMIKHLRA VTACGVDIKN
NKSPHAVVAI TGPLANAQLC ASYVREVMEM GDTRATGGLS GAPLHAAPAN TPPFIVPQQQ
QAYQHAPVTP HYAPAPHGAE TWVRYYDAEG KPYEHNPVTN ETRWV