Gene OSTLU_14831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_14831 
Symbol 
ID5000812 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp941189 
End bp946052 
Gene Length4864 bp 
Protein Length1563 aa 
Translation table 
GC content61% 
IMG OID640416233 
Productpredicted protein 
Protein accessionXP_001417100 
Protein GI145345183 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCGAC GGGCGAAGCG CGCGGTGAAA GACGCGCGCG TCACCGCGAA GGCGCGCGCG 
ACGGTCGAGA CGCGCGACGA GGCGTACGCG CGCGTGCCGC TTCGAATCGA CTTGGACGGC
GCGCGCGCGA GCGATGACGA CGAGGACGCG CGCGACGAGG ACGCGCGAGC GTCGCCGCGC
TTGGTGGCGC ACTACGGATT CGACGCCACG ACGTCCGCGG TGTCGTACGC GAGGGACCAA
CACGCGCTGT TCGCGGCGAG CGAACACGGC GTGAAGGCGT TCGGCGCGCG CGGGACGGAG
TGCTTGTTCG CGTCGGCGTC GGGAGGCTCG TCGGAGGCGG TGCGAGGAGA GTACGTCGCG
CCGTCGCGCG TGTATCGATG CGGACGAGAC GGCGACGTCG AGGTGTGGGA CGGGCGCGAG
CGAAGGTCGC TGGCGAGCGA AAATTTAGAC GCGGGCGACG ACGAGCCGAG CTGCGCGAGC
GAGGCGATGC GAGGGACGAA TTTCTTCGTC ACGGGAACGG TGAGGGGGGA CGTGTGCGTG
CACGCGTTGG GAGTGAACGC GCGAGGAGAG ACCACGGTGG CGAGAAGGCG CGGGTACAGG
GTGTCGGCGT CGCGGGCGTT GTCGCGCGTT TTGCCGACGC GAAACGCCGT GGCGGCGGTG
CGCGCGCGAC CGGGAAGAGA CGAGCGAGCG ATGATGCTCA TCGCGTGGTC GGACGGGGCG
CTCGCGCTGT GGCACTTGCA CGAACAACGG TCGGTGGCGG TGACGTCGCC GCGAGGCGAC
GCTGTGGACG AGACGGAGGC GACGGACGCG TCTTTGACGT GCGCCGAGTG GCTCGACGCC
GAATGCGTCG TCGCGGGGTA CAGCGACGGG CGCGTGAGAG TGTGGAAAGT GCGCGCGGGT
ACGGTGATGA CGGAGGAAAT CGTTGAAAAG CAGTGCATCG TGCCGCACGT TTTGATTAAT
CCGTCGTACA AGGGCGCGCT GACACCCATT CGCGCGTTGA AGACGTACGT GAGCGAAGAC
GACGACGCGG CTCGCGACGT CGCCACGTGG TTCGCGTGCG TCGGCGGCGA GCCAATCGCG
TGTCCAGATC CCGTGATTTG TTTGCGAGCG ACGCGATGTG ACGGGGAGTT TCGAATCGAA
AGCGCGGGTG CGATTGCGCT GCCTTGGTTC GGTCCAGTGC TCGACGCGAC GTTGGTGCCA
TATCGCGACA CTGTGGAATC GATTTGCGTT CTGTCCGAAG GCTCGCAGCT ACACCTACAC
GACGTGCGTT ACGGAGTTTC GAGCGACGAC ATCGCGCGCC CGCAGGAAGT AATGGTGATG
CGACCGACAC TTTCGAAGAC GTGCGCGCCG AGTATCTGCG CGACGTCGCG CGACGTCGCG
CGGGCATTTG AGCAGAACGC CAAACGCGTC GCGGTTCCAG AAAATGAAGT AGACGCGTCG
TTCGCTTGGA AAGGGTCGAA ATGGCCAATC AGCGGTGGAT GCGACGTAGT CGATGACACG
ACATCGCCTT CATCCGCGCT GCGTGCGCGC ATCGTCGTCG GCGCGTTCGG CAGAGACGGT
TCTGGAGTGA AAATCTACAT CGATCGCGAC GGACGATTGA TGTCTGGTGG CGCGATCGCG
CCGAACGGCG ACTCGATCAC GAGACTGCAC GTCGACGCCG GTGGTGCTTT GCTGATCGTC
GGTCGGGTGA GTGGAAACCT TGAAATATAC GCGTTGCGCG AGTATCCAGC CGACGGTGCC
GAAGCGACTT CGCGAGTACG TACGCACGCG CGCAACTTAA ACAAGAGCGA CGCCGAAAAC
GTTGACGAGG AGCGTGCATT TTTCGCCGGT GAATCTCGCG ACTTTGTCGA TACGGATTCC
GCAGCGGCGT GTATGACTTC GGTATACACA CTCATCGGTA GATTTAGCTC GGCTGGCAAA
GCCATATCGT GCGTTCGAAC GAACGCCGCG GCGACACTCC TCGCCGTCGG CGACGTCGCA
GGGTGTGTGT CCTTACTCAA TCTTCAACGA GGAACGAAAA TGTGGACGGT GTCGTTACCT
ACTTCTGCCG AAGGAAAGCC ATCGGCCGTG GCGGATTTCG ACTTTGGGTT ACCGCTTCCA
GACGCTCCGG ATGAGTGCGT CCTGGCGGTT CTAGAATCAA ACTGTAGCGT TCGTTTCCAC
GCGCTTTCGA CCGGTTCGCA GATTGGGAAG ACTATGACCC CGAAGTCAGC GGCAGAAGAC
GTGGCGCTTG CAATCTCGCT CCTTCGGCTC GATGGTACGG CGTCAGATAT AATTCCACCA
GCGCCCGTCG CCAAAAGTTG GTTCACACCG CCGACATCGT TTGCTTATCA ATTTCTCGCG
TCGGAGTGGA CTGTGGACGA CGCGTCTGCG TCAGACGACG AAGACAAAGT GCTGTCGACG
GACGAGGAAG ACCGTGTCGA CAGCGACGAC GTCATTAAGG GAAACCCAAA GCTGACCGCC
ATTGTCGTTA CTGTGGCGAA AGATTCGATG CGCGTATACC ACGCAGTCGG CTGCTCTCGA
GGAGAGCGAT TTACGTTGCG AAAAGAACAC CTCGACGAGC CGCTCATCGG TGCGTTCGCC
GTTCGAGATG ACGGTAGCGA AAACGAATGT GCGTTCGGTC ACGGACGGAA ACGTTCTCAC
ATCGTCGCCT TGACGGAGTT TGGGAGAATG GTGGCGTACG CGTCGCCGTC CCTGCAAATG
CGCGGCGTAT TTGGTCCGGT GCCGGCGTTG TCCAACGCCA ACGCGACGTG TTGCTCTCGA
GGCGGCGCCG TCGTCGTCGT CGCAGATGAC GGCTTGTCTA TCGCTCGACT CGAAGCATTT
GGTGACAGCT TTCCCGCGCA AGGTTGTATC ATCGATCTCG AAGTCGAGAG CGCCGCGGAA
GCCGCGCGAG CGGCGCGACA CGCCATGGAA GAAGACGATC CCGAACTCGC GCAGCAAACC
AAGTCACCTC GACGAGAAGT ATCGTCAGCG TCGACGACAC CCATGAAGAC CAAAGCGCTG
ACGATGAGTG AAGCGCTACG ACATCGCGCA AAGGCGGCGT TTGAAAAGCT GGAAGAAAAG
TTCGCGCAGT CACCACGTGA CTCATCGTCG TCGCCAGCGA CGCGCAAGAT GTACACGACG
ACGGACCTGG CTGTCTTATT CGCAGATGCT CGCATAGAGG ATCCCGCGCC GGTCGAAAGG
ACGACAGAAT CTGCAGAGCG CGATGAGTTG TTCGCGACAT CGTCGTCCAC GCCAGCCATC
GCGCCTGTGC GTCGGAGCGC GTCTTCAGTC CGAGCCAAGT ATGGACGAGA AGTTACTTCG
CAAATGAACG AAACAAAAGA CATGCTCGTC GAGCGTGGAG AGAAACTTAG CAGGCTTCAA
GATAAATCAG CCTCGCTCGA AAATGACGCG GCGGATTTCG CCTCGCTCGC TCGCGAGATT
CGTAAGCAAT CCGAACGTCG GTGGTTTTAG ATGCGATTTT AGAATTTTTA GTAGCAGCGC
GACGCCGAGC GGCGTCACAA GTACACGCGC ACCTTTCATG CCTCCCGCTG GAAATTGGCC
CAAGTGGGCC AAAAAGCGTC ACGCTCGAGA CTTCACGGCG CGCGACACCG TGCTCGAGGA
GCTGCGCAAG AAGACCCACC ACGGCGCCTC GCTCCTCAGA CGCGCGTTGA AGAAGGCCAA
AACCTTTGAC GAAGCCAAGC TCCGCCGGCG TCTCAAGGCG CCCGAGGACG CCGATAAGCT
CAAGCGCGCG TTGCACGCGA CGCGTAACGT CGATATCGAC GTCTTGGCTC GGCAATGCGC
GTCGGCGTGC GCGCAGAGAG TGGAAGAGTC GCTCGAGTTG GCATTTCGCG AGGTGGCGGC
GGATGGCGAC GCGAACGCCG TCCGCGCGAC GCACGCCGTG GACTTGGGCG ACGGCTTCGA
CAGTCAAGAC GTGTTACGCG CTATTTGCGA CGACGCCGCG AGCGCAAAAG CGAGCGAGAG
CGAAGGGAAG GAAGTCGTCG GGGAGAATGA ATACGTGAGC GCGGCGAGGC GGTTGTTGCG
AGCGCAGGCG ACGCGCGCGG AGACGGATGC GCTGAAGGAA CGATTGTTGG CGATCGCGGG
TCGAATGAAT CGTGCGGTGG TTGGGAAGCA AAAACGCGAA GAGTGGACGC GTGAGAAGCA
GGAGCGACGG CAGAAGCGCG AGTTGGCCAT CGTCAAAGCG GAGGAAAAGG CGAAACGGCG
AGCGGCGGGA GAGGAGGTGT CGTCGAGCGA GGAAGAGGAG GTCGTGCACG CGAAGCCCAC
GTCTCGCCGC GATGCTGACA ACGAGGATAG TGCATCCAAA TCGGATTCTG AATTCGAATC
CGACGTCGGT GAAGATGGCT ATGCGAGTTT GAGCGAGGAT ACGTTGGCGG AGCTTTACGC
GGCGAAGGCG GCGGCGAAAA AGTCGAAAAA GCAGGCGAAA TCGGCGAAGG ACGGGGACGA
TGTCGAGGAT GTAGATCTAG GACCAAAGAA GAAAAAAGTG AAAAAGCGCA TGGGGCAGCG
CAAGCGTCGT CAAATCGCGG AGGCTAAGTT TGGCTCAAAC GCCGCGCACA TCATCGCCGA
ACGCGAAAAA GCTGCAGCCG AGCGTAGGGC GAAGGAGGAA GAGGAAAGAA ATATGCACCC
GTCTTGGAAA GCGAAACGTA AGCAAGCACC CATCATAATC GCTGGTGCAA AGGGTAAGAA
GGTCAAGTTC GGCGATGACG ATGGTGCGAA GAAAACGCCG ATAGTTTCGA AGAAGCAGCA
ATACGCGCCC AAGGTTCCGG AGGGACCGCT GCACCCATCG TGGGCAGCTA AGCTGAAGGC
TGATCAACAG GCCTGGGGCG GCGGCGGTGT CAAGCCCGAG GGTAAAAAGG TCGTATTTGA
TTAA
 
Protein sequence
MLRRAKRAVK DARVTAKARA TVETRDEAYA RVPLRIDLDG ARASDDDEDA RDEDARASPR 
LVAHYGFDAT TSAVSYARDQ HALFAASEHG VKAFGARGTE CLFASASGGS SEAVRGEYVA
PSRVYRCGRD GDVEVWDGRE RRSLASENLD AGDDEPSCAS EAMRGTNFFV TGTVRGDVCV
HALGVNARGE TTVARRRGYR VSASRALSRV LPTRNAVAAV RARPGRDERA MMLIAWSDGA
LALWHLHEQR SVAVTSPRGD AVDETEATDA SLTCAEWLDA ECVVAGYSDG RVRVWKVRAG
TVMTEEIVEK QCIVPHVLIN PSYKGALTPI RALKTYVSED DDAARDVATW FACVGGEPIA
CPDPVICLRA TRCDGEFRIE SAGAIALPWF GPVLDATLVP YRDTVESICV LSEGSQLHLH
DVRYGVSSDD IARPQEVMVM RPTLSKTCAP SICATSRDVA RAFEQNAKRV AVPENEVDAS
FAWKGSKWPI SGGCDVVDDT TSPSSALRAR IVVGAFGRDG SGVKIYIDRD GRLMSGGAIA
PNGDSITRLH VDAGGALLIV GRVSGNLEIY ALREYPADGA EATSRVRTHA RNLNKSDAEN
VDEERAFFAG ESRDFVDTDS AAACMTSVYT LIGRFSSAGK AISCVRTNAA ATLLAVGDVA
GCVSLLNLQR GTKMWTVSLP TSAEGKPSAV ADFDFGLPLP DAPDECVLAV LESNCSVRFH
ALSTGSQIGK TMTPKSAAED VALAISLLRL DGTASDIIPP APVAKSWFTP PTSFAYQFLA
SEWTVDDASA SDDEDKVLST DEEDRVDSDD VIKGNPKLTA IVVTVAKDSM RVYHAVGCSR
GERFTLRKEH LDEPLIGAFA VRDDGSENEC AFGHGRKRSH IVALTEFGRM VAYASPSLQM
RGVFGPVPAL SNANATCCSR GGAVVVVADD GLSIARLEAF GDSFPAQGCI IDLEVESAAE
AARAARHAME EDDPELAQQT KSPRREVSSA STTPMKTKAL TMSEALRHRA KAAFEKLEEK
FAQSPRDSSS SPATRKMYTT TDLAVLFADA RIEDPAPVER TTESAERDEL FATSSSTPAI
APVRRSASSV RAKCDFRIFS SSATPSGVTS TRAPFMPPAG NWPKWAKKRH ARDFTARDTV
LEELRKKTHH GASLLRRALK KAKTFDEAKL RRRLKAPEDA DKLKRALHAT RNVDIDVLAR
QCASACAQRV EESLELAFRE VAADGDANAV RATHAVDLGD GFDSQDVLRA ICDDAASAKA
SESEGKEVVG ENEYVSAARR LLRAQATRAE TDALKERLLA IAGRMNRAVV GKQKREEWTR
EKQERRQKRE LAIVKAEEKA KRRAAGEEVS SSEEEEVVHA KPTSRRDADN EDSASKSDSE
FESDVGEDGY ASLSEDTLAE LYAAKAAAKK SKKQAKSAKD GDDVEDVDLG PKKKKVKKRM
GQRKRRQIAE AKFGSNAAHI IAEREKAAAE RRAKEEEERN MHPSWKAKRK QAPIIIAGAK
GKKVKFGDDD GAKKTPIVSK KQQYAPKVPE GPLHPSWAAK LKADQQAWGG GGVKPEGKKV
VFD