Gene OSTLU_15647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_15647 
Symbol 
ID5002243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp77605 
End bp79395 
Gene Length1791 bp 
Protein Length596 aa 
Translation table 
GC content62% 
IMG OID640417664 
Productpredicted protein 
Protein accessionXP_001418147 
Protein GI145347382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0712404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGG CGACGGCGAC GCGCGCGAGC GACGACGCGT TCGAGTACCG CGCGCGGCGA 
ACGAGCGCGG GGGGGTATTA TCGCCCGACG GCGTCGAGCG ATGGACGCGC GGGCGCGACG
TACGAGGTCG CGGAACGCGG CGGCGAGCAC GCGGAGGAAG AGACGAGGAC GCGGAGGTTC
GACACGGCGT CGGCGCGCGG CGTGACGACG TTCGACGCGC GGACGTGCGA GGCGATCGAG
TTCGTGCCGA TCGACGCGTG GTCGCGGGAT AAGGCGAATT TTGAAAAGTT GCGGTCGATG
CGAACGTTTC GACTGCACAG ACTGTGGAAG GCGTTCGCGA CGATGCGCGC GCACGCGCGA
AGGAGAAAGT TTCGGCGCGC GCGCGCGCGG TTTAAGGAAT CGTCCGCGAT TTACGGTGAT
GCGTTCGCGG GATACGCCGT TCCGACGATG ATGCGCGTGT ACGACGCGTG TCACTCGATC
GCGCGAGACG CGCGCGTGTT TCGGCGCGAG GAAGCGCCGG CGACGAAGGT CGAAGAATCG
TACGAGTTCG ACGACGAAGC GAATCTCGCG CCGGCGACGT ACGACGCGCG AGCTCTGTTC
GAGACTCTGC TACGACTCGC AAACGAAGGC TCGACGCACA TTCGTGACGT CGCGTCGGCG
ATTTCCAGCG ATGTGGAAAT CGCCAAAGAC GCGATCGAGC GACGATTTCT CGCCGATTTG
GATGAGCGTC TACAGCCCAT GATCGCGGCG ACGAAGCGGT ACCGTCGACG ATCGAGCGGA
ACGAACGGTG GAAAGCCAGC GCTCGCGCCC ACGATCGGCG ACGATCGACT TGTGACGGCG
TCAGAGTCGA GAGACGCGTA TCCTTACACC GAGCGCGCGC TCGTGGCGGC GCTTCGAATG
AAGCTGGAAT CTTTCGAGCG CTCGATTTGG ATGTGTTTTC GAGTCGCGAT TTCGCGCGCG
AGAGACGCAT CGCTCGAAGA ACTCACCGCG TACGTGAGGG ATGCGCGTTC GTCGGAGTCA
GCGATATTTC AAACCACGTT TGATTTCGAA ACGTCCGCGC TCGTACCGAG CTCGGATGCG
TTCGAGCGAG CGATTCGAGA CGGCGCCGCC GCCTGGACGT CGTCCTCGCT CGGCGAGCGC
GTCGGCGACG CCGTCGATTT GCGCTCGCTC GAGGATGAAG AGTTTGAAAA TCGCATCGAT
AAATTTTGTG ACCTCGTCCG CGACACGTTC GCGAGCGCGC AAGAGGCGTT GCTGGTCGTC
GAGACGCGGT TGCGCGAAAA ATCGCCAGAC GCCGTCGCGG ACGCAGACGG AGGCGATGAC
GAATCGTCGA TGATTGAACG TTTGAGCGAG ATCGTGGCGC GTTCGAATGC GTTCAAGAAG
GAAGTCGACG CGTTGCCAGG ATCGATTCGT CCGAACGACG GCGCGATCGC CGTGGACATC
GTATCGCTGA AGCGCGCTTT GAAACCGGTG GCGTCGGCGA CGATTGACTC CGCGTGCCGA
ACCGCCGTGG ATTGGGCGGC CGAACGCGCG CAGACGATCG CGCAGAAGTT CACGCGCGTG
AAAACCCTCG ATGATAACGA CGATGAGACC AAGTTGAGTA AGATCCGTGA TCTCCGCGAC
GAGGCGCGAG AGCTCGAGAA TGTGCATTTG GCGATGAAAC GACTCGGCGC AAACATTCCG
GATTTCGACC GCGCGGCGTT CAAAGGCGTC GTGGAGGAGA TCGAAAACGC GCTCGCGGGT
GACGACGGTG GCTACGAAAA AGATGAAACA AAGCACGGCG GCGATCGTTG A
 
Protein sequence
MATATATRAS DDAFEYRARR TSAGGYYRPT ASSDGRAGAT YEVAERGGEH AEEETRTRRF 
DTASARGVTT FDARTCEAIE FVPIDAWSRD KANFEKLRSM RTFRLHRLWK AFATMRAHAR
RRKFRRARAR FKESSAIYGD AFAGYAVPTM MRVYDACHSI ARDARVFRRE EAPATKVEES
YEFDDEANLA PATYDARALF ETLLRLANEG STHIRDVASA ISSDVEIAKD AIERRFLADL
DERLQPMIAA TKRYRRRSSG TNGGKPALAP TIGDDRLVTA SESRDAYPYT ERALVAALRM
KLESFERSIW MCFRVAISRA RDASLEELTA YVRDARSSES AIFQTTFDFE TSALVPSSDA
FERAIRDGAA AWTSSSLGER VGDAVDLRSL EDEEFENRID KFCDLVRDTF ASAQEALLVV
ETRLREKSPD AVADADGGDD ESSMIERLSE IVARSNAFKK EVDALPGSIR PNDGAIAVDI
VSLKRALKPV ASATIDSACR TAVDWAAERA QTIAQKFTRV KTLDDNDDET KLSKIRDLRD
EARELENVHL AMKRLGANIP DFDRAAFKGV VEEIENALAG DDGGYEKDET KHGGDR