Gene OSTLU_16936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16936 
Symbol 
ID5004245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp778 
End bp3918 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table 
GC content56% 
IMG OID640419666 
Productpredicted protein 
Protein accessionXP_001420051 
Protein GI145351365 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAG ACATCCTGCG CGCGCACATC GCCTCGTTTT TTCAAAGTCA GGATGCGAGC 
GAACGTGCGA ACGCGGAAGC GGCGCTTTCG AGCTTCGGCA AGAGCGATGG ATCTTGGAGC
GTGCTATTGC GCGTGTTAGA GCGAGACGAT GCGACGGCGG TCGAAACACT GTTCTGCGCG
CGCACCCTGC ACGTGCTTTT GCGTCGCTGC GTCGCAAAGG AGGAGCGGAC GCAGGCGTCG
CACGCGGCGT TCACGGAACG CGATTGGATC GATCTTCGCT CGCGCGTGTT GAAGCTGACG
ATGCTCTTTG CCGTGAATTC CTCGTCGTTC GCGCACGACG AGTCGAACGC GTCGCGTGCG
GTCGACTTGA GAAGCACGCT GACGCAACTC GCGCTAGCCA CGTCGGCGTT GGCGTGTAAA
ATGCCGACAT GGGATCCCAC AGCGGTCGTG CGAGACGTCA TCAAGGTGTT TCAGGAAGAC
GCTCGCGTGT CGAATGAAGC CAAGTTGTTG TGTTTGTGCA CATTTTTGGC GTTCGTGCCG
CAGGAAGCGA GTTCCCGAGA GTTGTCCATA CATCCGGCGC GCCGCGAGCA AGTATTGACT
GGTTTGCGTA GCACCGCGAA CGACGTCATG GACTTGCTCC AGCAGCTCGC GACGTCAGCC
AGTGGCGACA CGCTGTTACA CAAGTACATA CTCGATGCTC TCGCGGCGTG GGCGGACATC
GCAAACGTCA CACCGAGATT TCCTCGCGTC ATACTTGAAG GCGCGCTACA CATCGTGTGC
TCGGAAGATC ACCACGCAAA CATCAAACAA AGCGCCGCGA GCGCGGCGTG TGCGTCACTG
GTGCAGTGCG TTTGGACGAG TGACACTGAG CTTCGTGCGT TGCTTGCGAC GAGTCTAGCA
AAGTTGCGCG CTGAAGTCGT CAAAGCTGAG AGATCGGAGG AGAGTCGCGC GCTGATCGTG
AACGTACTTT CGAGCGTAGC GATGAAGGCT TTGCGAGACC AGAAAGACGC GACCAAAAGT
CCATTTGCGA CAGGACCAGA TGCGGCTGGC GATCGCACGT ATGTCAAGTA CGCCGAATTC
AAAAGTTTGC AAAGACAACA AAAGAAGACG CAGCGATCGG AGCAGAAGCA GAAGACAAAT
ATCGCCGTGG ATATCGACAC AGAAGTGTTG CTTTTCGCAC TCGATGGCTT ATCCGAGGCG
CTTTCTGTCG GTGCCTCCAT GGCGTCGGCG CTGGAACCTT GGGGTAAGCT GGCGAAATCA
TTCACGCCGG ATTCGTTTGT GGAGTTGCTT CGCCCGGTGG CGGAGCGATG CGTTCACGCC
GCAGTGCTGT ACGTTCAACT CTTGCCCAAG CACGATCTGG ACGACGACCA AGTGAAGGAA
GAAATTTCTG ATTGTCTTCG CGACGTCATT TCAGCCGTGC CGATTGAAGA AATACTCGGC
GACTTCAATC AGCGCCTCTG CGCGGAGATG TCCGCCGCAC AGAGCGGTGG ATGGAGAACG
CTAAACGCGC GTCTGTACGT ATTGCTCTCA CTAGCGAAAT CCTTCCGAGC TGAAGCTAAT
CAGTCGTCTT TTGCGATCTT GATTGAAAAT TTGTGCACTT TATCGACGAG TGAAGTTGTT
CCGAAAGCGA CTTTGGAATC CACTTGTTGG GTTCTGGCGG GTGTCGCCAA GTGCATTTCG
CAGCTTGAAG ACAACATTCT TCTTGGCGTT TCGCACGCGC TGATTCGTTC GATGAGCCAT
TCAGAATTTG TTGTCGCACG AGGTGCCGCT GTGGCGATGA TGAAGCTCTC TGAATTTGCA
GCTTCGCGAC TTGGCGCCAC GGACGTACCG TCTCTTTTAG CCGAGCTTCA CGTTCGCGGT
GGGCCGACGC CGTCGCCGAC TTTGCGTCTG GGTCAAGAAC ACGAATCAAC CGTTTTGCTT
CGCGCTCTTA CGTTCTATGT GAAGTGCGAG TGTCGAGAGC AGACAGAAAG TGCTTGCGCG
TCGCTCGCCG AGCCCGTCAT CGAGGCGATG AATGTTTCCC TTCACCGCGG CAGCTCGGAA
GAATATGTTC GTCGTTTGGT TGATTTGGAT ATCGTGCTTC GAGCGATGAA GAGCGCGTAT
GAACACATTC AATCGCCCAG TGAGGTGCTC GCCGGGTTGG CAACGCGCGC CGCGATTGCG
GTTGAGCAGA CGAGCTTGCG AATTGTCGAT CATCGAATGG TCGAGGAGCG CTTTAAAGTC
GCATGGGCGA TGAAGGCTCT CGTGGAGCTG GCGCGCTTCG TCGATGGGCT CTTGGGAGCT
GTCGTTCGAA TTTCAGTCGA GGCGTACATG CGAGCACCGG GTCTCGGCGC GTGTTACCTG
GATGCGTTGT CTGTTATGCT GGAGTTCTAT GGCGATAGCC GATGCGGAAT TGAGATTGGA
GGGACGAAAT TTCAATCCGT CGGTCACGTT GTCGTCGAGC TCTTGGCGAC CGTCTTACCT
GCATCTCTCG AAGATTGCGA AGGGTGGACG AGCGCGTTCA CGCTCGCGCG CGCGACACTG
CGCACTGCGT GTGTCGCAAT AGTTCCGCAT CTTCGTATGA TGGTCGAAGT CAGTCAGGCG
TCGCTGCGCG GAGTCTCCGA CGAACCCGCG GCGGCGGCTT TGTTATTCGC GACCGACTTG
CTTCGAGCGC CGGTGATGTT GAGTGCCGAG TTGGCGTCCA CTGCGAAGAA TGCTTCTGCG
AGCGCCATGC ACGCCGGTCT CGGGCGTCTA GCGGCGGAGA TCTCGGGTCA AAGTAACAAA
AAGCGAGGGA CGTGCGAGTG GAATCGACGT TCAGCCAAGC CGGTCGCGGA CGCCGTCGCC
GCGGCGATGG AATCTCACAG CGTCGTAATC GTGCGCAAAA TCCTCGAAGC AGCAAACGGC
GAGATGCCAC CGAGTATGAT ATCCGATATT TCTGCGGCTT TGCACCTCGT TTGGAGCACG
TATGGCACCG AGAGATTTCA AGGCGTCATG TTGGCAGCGC TCGGCGGCGA AGACGACGCC
TTCCCGAAGC CCAAAACAAA GCTATCCGAC AAGCGCGAGT GGGTCGCGTT TTTGACGAAC
GAGACCTGCG CAAACGATTG TCGAGTATTT AAACGATTCT TAAAATCGTT CTTGGGAGGG
AAAAAAGTAG GCAAAAACTA A
 
Protein sequence
MDEDILRAHI ASFFQSQDAS ERANAEAALS SFGKSDGSWS VLLRVLERDD ATAVETLFCA 
RTLHVLLRRC VAKEERTQAS HAAFTERDWI DLRSRVLKLT MLFAVNSSSF AHDESNASRA
VDLRSTLTQL ALATSALACK MPTWDPTAVV RDVIKVFQED ARVSNEAKLL CLCTFLAFVP
QEASSRELSI HPARREQVLT GLRSTANDVM DLLQQLATSA SGDTLLHKYI LDALAAWADI
ANVTPRFPRV ILEGALHIVC SEDHHANIKQ SAASAACASL VQCVWTSDTE LRALLATSLA
KLRAEVVKAE RSEESRALIV NVLSSVAMKA LRDQKDATKS PFATGPDAAG DRTYVKYAEF
KSLQRQQKKT QRSEQKQKTN IAVDIDTEVL LFALDGLSEA LSVGASMASA LEPWGKLAKS
FTPDSFVELL RPVAERCVHA AVLYVQLLPK HDLDDDQVKE EISDCLRDVI SAVPIEEILG
DFNQRLCAEM SAAQSGGWRT LNARLYVLLS LAKSFRAEAN QSSFAILIEN LCTLSTSEVV
PKATLESTCW VLAGVAKCIS QLEDNILLGV SHALIRSMSH SEFVVARGAA VAMMKLSEFA
ASRLGATDVP SLLAELHVRG GPTPSPTLRL GQEHESTVLL RALTFYVKCE CREQTESACA
SLAEPVIEAM NVSLHRGSSE EYVRRLVDLD IVLRAMKSAY EHIQSPSEVL AGLATRAAIA
VEQTSLRIVD HRMVEERFKV AWAMKALVEL ARFVDGLLGA VVRISVEAYM RAPGLGACYL
DALSVMLEFY GDSRCGIEIG GTKFQSVGHV VVELLATVLP ASLEDCEGWT SAFTLARATL
RTACVAIVPH LRMMVEVSQA SLRGVSDEPA AAALLFATDL LRAPVMLSAE LASTAKNASA
SAMHAGLGRL AAEISGQSNK KRGTCEWNRR SAKPVADAVA AAMESHSVVI VRKILEAANG
EMPPSMISDI SAALHLVWST YGTERFQGVM LAALGGEDDA FPKPKTKLSD KREWVAFLTN
ETCANDCRVF KRFLKSFLGG KKVGKN