Gene OSTLU_15541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_15541 
Symbol 
ID5001788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp657343 
End bp660699 
Gene Length3357 bp 
Protein Length1118 aa 
Translation table 
GC content56% 
IMG OID640417209 
Productpredicted protein 
Protein accessionXP_001418078 
Protein GI145347232 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACG AAGACGCGGC GGCGATCGAA CGCGCGCGCG CGGACGATGC GAGCGTCGCG 
TCGACGCGCT CGGGGGACGA TGCGGGCGCG ATGGCGCCGC GATGGCGCTG CGTGGCGTCG
AGCGCGGATT TGCGAGCCGT GAGCTCTAAA CGCGTTCGGT TCACGTGCGT CGATGCCACG
CGCGCGCTGG TGATTTTGGG GGCGAACACT GGGTCGGCGT ACGTGTTCGC GCGATCGAGG
GCGCGCGATG ATGGCGCGGG AGGCGTGGAA CGACGCGCGA GATTCGTCGC GGTGGTGTCG
CCGGAATTGG TCGAGCCGAG CGCGAGACGA CAGGGGAATG TGGGCGCGCG CGCGGCGCCG
CAGAGCGTGC GGACGATCAG AGCGTCGCCA TGCGGACGAA TGTGCGCGCT GGGGTTCGCA
GATGGACACG TGAGGGTGAT TGAGCTCGAT GGATTGACGC GGGAGGAGGC GCCGCGGAGA
AGCGCGCTGG GATCGACGGT GGCGTTTCTG TCGAGCACGC ATGAAGGACG AGCTGTCACG
GCGCTGTCGT GGTCGAGCGA TTCGCGGGTG TTGTATGCGG GAAGCGATCA AGGGGTGGTG
ACGGTGACGT CGTGCGCAGC GTTCGTGGAG TGGTGCGACG GTGGCCGCAC GGGCGCGAGA
CCGGCGCCGG TGAATAAGAC ATCGTACACA GACGTGAGTA GTGCGGTGCA TCAGCTCGAT
GCGTCACCGA GCGGTCGTCA CGTGATAACA AGCGCACAAT CTAGCGCACA ACTCGTGATT
GTGCAGGGAG CGCATAGCGG AACGACGTCG AATATAGGTA GCAAGCAGCG CGAAGGATCG
TACGGAAGTT GCTTCCACGG GTACGCAAGT TTGAGCGTAG AAGATGACGA TCCCGTGGAA
GAGGAAGACG AATGGGACGA AGAATTAGAA GGTGTGAGGG TAGTAGAGCA TGCTGTGGTG
TCACGACCGG GGCGTAAGCT GTGGATTGCC AAAGTGGACA ATGTTGACTC TGAGCGAGCC
GACGTGGAAA TCATGGCAAC CATCAAACCC GAAGTGCCAG TGCCTTCCAG CGTACCAGGA
TGGGATCAAT CGAGTGAATC CGCGGATGCG GTGAAACGCG CGTCAAAGAA ACTAGAGTTT
GGCCTCTTGC ATCGTTTAGG ACCTTGCGTG TTGTCGACGA CGGAGCGCGC AGTCGCCATC
ATCGACGTCG CTACTCCAGC AATCGTTAGA TGGTATCCCC TAAAAGAGCC AGGGAGCGAA
AGTTTGAGCG CTGGTTTCGT CGATGCATGC ACAGTTGATC ATCGAGCGTT TTTCCTGACG
CCATCTGAAG ACAGTGGGAA TTCGGTGTGG TGTTTGGAAT CATTCGTCGA CGCCAAGGCG
CTTGCGCATG ACGTCGTGAG CGAGACATCC TCTGCAAGTG CATTCATCCG TGCCCTAGAC
ATTTGCCGCA AGACAAACTC GTACGACGAC GGCTTATTCA GAAGGGCGAA AGAGGCGTTG
GATTCGTCGG AAGCGGACGC CGCCGGAGTA AATAGTAGGC TTGCGTTTCT CGTGCAATGG
GGCGAACAAG TGGGCTCGAA GTTGGCGCCG GCGAAGAAGG ACACCGAAGT TGACCTGCGC
GAGCTCGTGA CCGAGGAAGA AGTATCATCA CCGAAAGCAC CAAAGACGCC GGACTTCGAT
CTCGGAAAAC CACCTCTCGG TAGAGATCGT TCGACGATTG ATCGACCATT GTCCGAATTG
GCTGCCCCAG CTTCGAACGA GTTCCCGAAG GCGGAAACCG AAGGAGGCAT CTTCTTTTAC
AATCCTCGTG GCGTTTCCAA GACATCCAAG GTAGATGGAG AATTCGCGCC GAAGGCAAAT
GGAAAGGTGA AGAAGCGACA CGCGCTCATT CTGGACGATG TCGAATCAAA CGAGCCTGCA
GTCATACTAC AAAGCAACGT GAAAGTAGTC TCGCCGACAA AATCCAAGAA TTACGGGGTT
GATTTACCAA ACAATGACAT CGAATGGGAA GAGTGTGACG CGTTCGATGC TCAAGAGTGG
CAAAAGGCGA TGGATAGCGT GAAGTGTTTG CTCCCAATTC GAGGGGCGCA TTTTGAGCAT
TGGAGCTGTT ACGAAGACAT TAAGACGACG TCGACACTTG ACTCCACTGA CACTCCATTG
CCCGAGCAAA ACGCTCGGAT TCAGTACAGA TTTATGACGA ATGTACGAGT CGCCGCGATC
GTCGACTCCG TCACGAAACT TAAAGAGTGT CGAATGAGCC TCGACGCGTC GATTCTCCTG
CCTTCGCTTC GGCGTTGGCG CAATATACGC GCCGAGTCGA TTGAACTTCT GACACGCTTT
GAAAAGGGCG AAGATGCACC ACTCACCGCT AAGATAAAAC TGCAATGGAG TTCGCTGTTA
GAAAACGTTG AGAAAGAGCT TGACGCAGTG TGTGATGAAC TCAAGCTCGA CGTCGCAGCG
GTGAAAAAGC AAACGCCGCC AAAGAATCGT GAGACTCAAA CGTCTCTCGC GTCAGCGGAA
CATTCGACGG ACTCAATGAC AATGGCAAGT GTAACTGCAG CGTTGACGGA AGGAGGTGCG
ATTGATATGA GTCAATCTTT GGAGTCCGAA TGTGTGGCAT CGCTTCGATC AACCGATGTT
GCAGAAGCGT CGGCTATCGT ATCAGAATGT CTTAGGCGGG CGCTTCTCCA GACTTTAGAG
TCGCTTGACG GCGCTGACAC ATCTGCGGCT CTCGAGCACG GCGTATCGCA TCTTATGTTA
GTCTCGCGCG TAGGTTCCGC CGCAGTTGGA GCGGCGGAAG TCATTCGAGC GCTTGCCAAG
GCTTCAGAAG AGGAGAAGTT GTCGAAAGCG ATATCACCGA TGAACACGAA CGAATCCATA
TCCAGAGCTC TCAGTCGTAT TTTCACAAAT GTCACCACAT TCTTGACTTC CGAAAATTCT
GTGGATTTGG ACTTACGCCG TGGCGCTGCC AAGCTTCTCG AGCCACTTGA CGCACACTTA
TCGCGGCCAC CCATGCGGCA GTTTGGAAGA TTTCCACAGC TCCAAGCCGC GCTCGCGGCG
GAGATCGACG GTGTCGTGGA CCAATTACCG TTTGTATGTC GCTCGTCTAC CGATGCTGAT
GACGGAGTGT CACAATTTAC TCTGAAAGTG CCTCACGAGA ACCCAATCGC GCCCGCCATC
GAAGATTTTG GCGACTGGGG CATCAAGATG GATCTTCGTC GCTGCCCCGC GTGTAGTCAC
TCCCTCCTGT GCCCCAACGA CGGCGAGCTC ATCACCTTTA TGTGCGCGCA CACGTATCAC
AAAGCGTGTT GCGCGGCGTC TATGGCTTGT TTCGCGTGTT GCGCCGACTC CCGCTGA
 
Protein sequence
MPDEDAAAIE RARADDASVA STRSGDDAGA MAPRWRCVAS SADLRAVSSK RVRFTCVDAT 
RALVILGANT GSAYVFARSR ARDDGAGGVE RRARFVAVVS PELVEPSARR QGNVGARAAP
QSVRTIRASP CGRMCALGFA DGHVRVIELD GLTREEAPRR SALGSTVAFL SSTHEGRAVT
ALSWSSDSRV LYAGSDQGVV TVTSCAAFVE WCDGGRTGAR PAPVNKTSYT DVSSAVHQLD
ASPSGRHVIT SAQSSAQLVI VQGAHSGTTS NIGSKQREGS YGSCFHGYAS LSVEDDDPVE
EEDEWDEELE GVRVVEHAVV SRPGRKLWIA KVDNVDSERA DVEIMATIKP EVPVPSSVPG
WDQSSESADA VKRASKKLEF GLLHRLGPCV LSTTERAVAI IDVATPAIVR WYPLKEPGSE
SLSAGFVDAC TVDHRAFFLT PSEDSGNSVW CLESFVDAKA LAHDVVSETS SASAFIRALD
ICRKTNSYDD GLFRRAKEAL DSSEADAAGV NSRLAFLVQW GEQVGSKLAP AKKDTEVDLR
ELVTEEEVSS PKAPKTPDFD LGKPPLGRDR STIDRPLSEL AAPASNEFPK AETEGGIFFY
NPRGVSKTSK VDGEFAPKAN GKVKKRHALI LDDVESNEPA VILQSNVKVV SPTKSKNYGV
DLPNNDIEWE ECDAFDAQEW QKAMDSVKCL LPIRGAHFEH WSCYEDIKTT STLDSTDTPL
PEQNARIQYR FMTNVRVAAI VDSVTKLKEC RMSLDASILL PSLRRWRNIR AESIELLTRF
EKGEDAPLTA KIKLQWSSLL ENVEKELDAV CDELKLDVAA VKKQTPPKNR ETQTSLASAE
HSTDSMTMAS VTAALTEGGA IDMSQSLESE CVASLRSTDV AEASAIVSEC LRRALLQTLE
SLDGADTSAA LEHGVSHLML VSRVGSAAVG AAEVIRALAK ASEEEKLSKA ISPMNTNESI
SRALSRIFTN VTTFLTSENS VDLDLRRGAA KLLEPLDAHL SRPPMRQFGR FPQLQAALAA
EIDGVVDQLP FVCRSSTDAD DGVSQFTLKV PHENPIAPAI EDFGDWGIKM DLRRCPACSH
SLLCPNDGEL ITFMCAHTYH KACCAASMAC FACCADSR