Gene OSTLU_17468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17468 
Symbol 
ID5004505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp589550 
End bp592843 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table 
GC content59% 
IMG OID640419926 
Productpredicted protein 
Protein accessionXP_001420389 
Protein GI145352085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones74 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAAC GTGCGGTGCG GGCGACGCCG CGCGCGGGGC CGTCGACGTC GACGCGGCGG 
CTGAAATCGT CGACGCCGTC ACGCGCGTCC ATCGTGCGAC GCGCGCGAGG GAAAGGCGAC
AAGGACGATG CGTCGACGCC GACGTCCTCG ACGCCCGACC CAGAAGAGTT GAAACTGGCG
TTGGCGCGCG CGAGGACGCG CTTGGAGCGG TCGCGAAAGG GTCTGCCGGC GTTGGTCGAA
GAGAACGAGA GCGAGGGTTT CACGTCCGAT GAAGTCGTCG AGGAGTCGTC GACGGCGGAA
AACGGCGACT TGCGAAGAAT CGTGTCGGAC TGGTTTGGCG CGGTCGGAGG CGTCTTTGAC
TCGCTGAACT CCGACGACGA GGCGATGGAC GCGAGAGACG AGAATCCCGC CATCGCGGCG
CAAGATGATG CGTCGTCGTC GCGTGATGAC GACGAAGACT CTTCGGCGCC CGTCGAGGCG
GGCGGATGGT TTCGATGGAA GAGTCCATGG CGGGTGAGTG GAGTGGGCGA GTCCATCGCT
ACGGCGACGT CGTCGTGGAT GCCGAACAAG AGTGTAGACG GTGATGGAAG CTTCGTAGTG
AACACCGACG CAATCGGAGT CTCAACAGCG AGCGATGGAG CGTCATCGTC TTCAAGCGAT
CGAGGAGTCG AGCCGACGAA CTTGCTCGAG CGCGTGCCGT TGCCGCCGTG GATTCGGCAA
CGCGTGGACG GTTCGAGTGA TGGTGATTCA TCCTCAGACG ACGAGGAAGA TGATGAAATG
GCGTACGACA CGACGATTTC GAACGAGAAC GTCGGTAGTG CGATTCGTGA TGCGAGCGAG
ACGGCGGTGA GCGTGACGAC AGCGGCGGTG GAAAAGTTGA ATTCTGCGCT TCTCGGTATT
GAAACCGTGG AGAAGAGCGC GGTGAAGGAT GGAAGGTTTG AAAACAACGC TTCTGCACTC
AAAGCGTTGC AAGAGTCGCT ACGAGATGCA CGTGCTTCTG CAAAAGAAGC GCGCGAGGCG
AGCCAAAGTC TCGAAGAAGC CGTGCGCGAA GTGAGTCGCG CGCAAGAAGC GCTTCGAGCA
GAGGGCGATT TGAAAAAAGA AGAAGCGATC AAGGCTCTTG AGGCTGCCAA GGCGGCTGTC
ATCCGACAAG CGACGAGCTC GAACGCGGCC GTCGCAGATG CGCTCCGGCA GGTGTCATAC
GTCGCCACTC GAGTCGAGAA ACTGGGCTCA TGGACGCTCG AAGACGACGG CGAGGCGTCG
TCACGCGCAC TCGACACTAG CAAGGCGCAA AAACGCGGGG TTCGTCGAGG CCTGATGGGC
GCGATCGCTA GCGCGCGTGA GAGTGCTCAA GACGCGCTCG TAGCGTCAGA ACTCACGCAG
ACGGTCTCAA GGCTGGCTTC ATTGGCCACG CAGCGCGGAG ACGGCAAATC GAAGAAGTCG
GAAGAGCTGA GCGAAATGGT CAAGCCCGCG CTGCAACGAG TCGTCGCGAA CTCGCTCGTA
CCTGTCGAGA GCGACATTGA GATTGGCAGT GCGAGATTGG CGTGTTCGCT CGCGGCTTGG
GTGTACTACT TACCGCAAAT GCAACACGCG TTGCCTCGCA ACGGTTTGAA GTTGATCACG
TCATCGTTGG ACGTCGAAGA AATCATTCCG TCCACTGAAT TTCGCGCGGG CAAATCGCTC
TTGGAGGAAT CTGCGTGGCG TGCTGAACAG GCGGTTGAGG AAGCCGTAGT CGCGGCTCGA
GTGGCTGCGA GTAAAGACGC CAAGCTGGAA ATCGCCGCAT CGAAAGCTGC AGCCGAGTCT
GCCGAGCTCG CGGCGCAGGC TCTGAAGCTC GCGGCTGAAT TACGCTCTCA GGACGCCCAG
GACGAATCGA CTAGACTCAA AGCCAGAAAG ACGGCGCGAG CGGCGAAAGC GCTCGCCGTC
GAGGCGCAGA AGAAGTTGGA CGAAGCGAGT GCGATTGCCG AACGGAACAA GCGCAAGTTG
AAAAAGTGGG CCATGCAGCG CGAGCTCGCG GATCTTCAGT CGCAAATGCA ACAAACGGTG
ATCGAACAAC AGCGGAAAGT CGAAGCCGAG CGCGTGAGAC AAGAGCGAGC AGAAAATGCA
TCCCTCCCTG TGAACTTTTG CGTCGCCGCA CAAGACGACA CCGCGACGCT GTGGGTCGTC
GTCGAGGGCT CTACGAACTT TGCGAGCTGG CAAGCAAACT TGACCTTTCA ACCCGTGACG
TTTGAAGATC CCGCTCTCGG TGTGGAAGTT CATAGAGGCG CTTACACGGC GGCGAAGACG
ATGTACCGTC GAATCGAGAA AGCAGTCAAA GAGCACGTCG CAAAGCACGG CGCGCGAGCG
CGAGTGCGAA TCACCGGACA TTCGATCGGT GGATCGATCG CGATGATCAT CGCGATGATG
CTCCTCGTGC GAAACGGCGC ACCTCGCTAC GCCATAGCCG ACGTCTGGGC GTTCGGCGCG
CCGTACGTCA TGACTGGCGG CGAAGCGCTC ATGACTCGAC TCGGATTACC TCGTTCGTTT
ATTCGAATGA TCATGATGGG CGACGATGTC GTGCCGCGCT CATTCTCGTG CTATTATCCG
CAGTGGGCGC GCCGAGTGTT AGACAACGCG CCTGGGCCGT TCAACGTCAA CACTTCTACG
GCGAATTTTT TGGACGAGCA GATGTTTTAT ACGCCGATGG GTGACTTGTA CGTGCTGCAG
GCGAACAACG GTTCTGAACA CCCGCTTCTG CCGCCGGGAC CCGGTCTTTA CATCCTCGAC
GGCGACGGCG TGTACGAGAT GCTAGCGACG CGCGCGCGAT TGGGCGAAAA CGAAGACGGC
GATGATGAGT CGTGGTTGAA TCGTCGCCCT TCGAGCAAGC ATTGGGACGA GGCGGCGGCG
TACGACGTGG ACGGAGACTT CTCTTCGTCC GAGGATGAGA AAGCGAACAA ACTTCGAATG
CATCAGCTCG CGTGTTTGAC GCAATCAGAC GCCGCCCTCA CGGCTTCGCT CATCATCAGT
CACATCAAGC AAGACGAATT GCTCCCGAGC GAAGGCGGCA TTGCGGAGGT GTTGAATCAG
CGAGGGCGAG ACGCCGCACA AAGAGTGTTG TTCAATAGCC CGCATCCTCT GAGTATTCTC
AGCAAACCCG ACGCGTACGG CGACGCGGGC ATCATAAGTC GCCATCACAA TCCGTTTCAA
TACGCGAAGA GTCTTTCTCT GTCGCGAAAA CGGAAGCCTG GCGTCGCCGA TCTGATTTCT
AGCCTACCGA AAAAAAGCAT CGACGGCGCG CCAGCGAGTG GCGGTCCGAG ATGA
 
Protein sequence
MRERAVRATP RAGPSTSTRR LKSSTPSRAS IVRRARGKGD KDDASTPTSS TPDPEELKLA 
LARARTRLER SRKGLPALVE ENESEGFTSD EVVEESSTAE NGDLRRIVSD WFGAVGGVFD
SLNSDDEAMD ARDENPAIAA QDDASSSRDD DEDSSAPVEA GGWFRWKSPW RVSGVGESIA
TATSSWMPNK SVDGDGSFVV NTDAIGVSTA SDGASSSSSD RGVEPTNLLE RVPLPPWIRQ
RVDGSSDGDS SSDDEEDDEM AYDTTISNEN VGSAIRDASE TAVSVTTAAV EKLNSALLGI
ETVEKSAVKD GRFENNASAL KALQESLRDA RASAKEAREA SQSLEEAVRE VSRAQEALRA
EGDLKKEEAI KALEAAKAAV IRQATSSNAA VADALRQVSY VATRVEKLGS WTLEDDGEAS
SRALDTSKAQ KRGVRRGLMG AIASARESAQ DALVASELTQ TVSRLASLAT QRGDGKSKKS
EELSEMVKPA LQRVVANSLV PVESDIEIGS ARLACSLAAW VYYLPQMQHA LPRNGLKLIT
SSLDVEEIIP STEFRAGKSL LEESAWRAEQ AVEEAVVAAR VAASKDAKLE IAASKAAAES
AELAAQALKL AAELRSQDAQ DESTRLKARK TARAAKALAV EAQKKLDEAS AIAERNKRKL
KKWAMQRELA DLQSQMQQTV IEQQRKVEAE RVRQERAENA SLPVNFCVAA QDDTATLWVV
VEGSTNFASW QANLTFQPVT FEDPALGVEV HRGAYTAAKT MYRRIEKAVK EHVAKHGARA
RVRITGHSIG GSIAMIIAMM LLVRNGAPRY AIADVWAFGA PYVMTGGEAL MTRLGLPRSF
IRMIMMGDDV VPRSFSCYYP QWARRVLDNA PGPFNVNTST ANFLDEQMFY TPMGDLYVLQ
ANNGSEHPLL PPGPGLYILD GDGVYEMLAT RARLGENEDG DDESWLNRRP SSKHWDEAAA
YDVDGDFSSS EDEKANKLRM HQLACLTQSD AALTASLIIS HIKQDELLPS EGGIAEVLNQ
RGRDAAQRVL FNSPHPLSIL SKPDAYGDAG IISRHHNPFQ YAKSLSLSRK RKPGVADLIS
SLPKKSIDGA PASGGPR