Gene OSTLU_40850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40850 
Symbol 
ID5002462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp157930 
End bp161511 
Gene Length3582 bp 
Protein Length1193 aa 
Translation table 
GC content56% 
IMG OID640417883 
Productpredicted protein 
Protein accessionXP_001418393 
Protein GI145347890 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.12081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00463606 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACGCGG TGGACGCGTC GCGCGCGCGC GACGGCGACG ACGCGCGCGA CGGGCGCGCG 
AACGCGCGCG CGAGCGCGAA TGCGAACGCG AAAGCGAATG CGACCGCGGG GACGTCGCGC
ACGGCGACGC GGAGCGCGTG CGCGTTCGTC GCGCGCGCGC AGGCGCTCGC GGCGGAACTG
CTGCGATTGG GAGAGAACGC GCCGAGGGCG CTGTCGCGCG AGGACTCGAA GTTCGCGAGG
GTGTTGTTTG ATTACGAATA CTTCGACGCG CGGGAACGCG CGGAGGACGC GGCGGCGAGG
GATGGATCGA CGTATGAACT CGACGAGGAG GTGCGCGGGA GCTATGGACA CGTGTTTGAA
CGGTATTGGA ACGCGTTCGA CGCGGTCGTG CGGTGGCATC AGGATTTCGT GCGATTCGCG
GAGGACGTGG CGGAGGGGAC GTACGTGAGC GAGACGTGGG AAAAGATTCT GAGCGACGAG
GACGGGAGGC AGTACGTGTG CGAAGCGATC GCGATGTTCG GGGTTATTTT GAAGATATTA
GATGCAAAGT TTGATTGGCG GTTTCGCGAG CGCATGGTGG TGGCGTATTA TCGCATGAGA
GGGGCGAGCG AGGACATCGC GAACGCGGAC GAAATCGTGA CGCTCTGCGC GCGGACGGGA
TACGACGCCG CGCGGAAAAC GCGACCGGAG GGGTACCCGG AGCTATATTT CGCGAGATTC
GAAATGCCAG AGTGGCTTAT ATATATGGTG ATCGGGCGAT TGCGAACGGA TGACGTGTAT
AATCACGCTC CGCACTACCC GAATCCGGAT CATCGCTCCA CCGCGCTCGC CGCGCAGGGA
GGATTGTTGT ACATTATATT GTATTGGGCG CCCAAGATCT TAAATAGAGG GACGTCGGCG
ATGAGAGAAA TCGTAGACAG ACACTACGCC GATAACTGGG TCGTCGCCTA CGGCGCTGGG
TTGACAGTGG ATTTGTTGAC GGAATGGGAA CCGTACGAAG CGGCGTCGAC GGCCATGCGT
AACGCGGTGA CGCCCAAGGC GGCGAGGGAA TTGATCGATA ACGCGTCAAC GCGCGTCGGT
GAATTGAAAA CGGCGTTCAA AACATACCTC ACCGAAGGTG TGCTCACCGA GGAATTCGTT
TTGAGCAATG AAAAAGTGCT CATGAATGTC GTTCGCGACG CCAACGTCGT GGCGCGTTTC
GTACTTCTAC ACAATTTGAC CACCCACAAG TCAGTATCGT CCCTCGTTTC TTACATGCCG
AGCAAAGAAA ACATAATAGA TTTGCTCTTA GATTGCGCCG AATTGGAGAA CGAGTTGAAG
AAGATTTATA CCTCGCTCTT GAGCGGCAAG CACGAGCTGT GGGAAAAGTG CAAACACGAA
GCCGGAGAAC GAATGAAGGA GCTTAGCGCG TATTTCGGTG GTACCGCTGG ATTGAGCAGG
AACGCGAAAG ACGAAAACTT GCGGCTGTGG TTTGCGAACT TGTCTGTCGA AGTCGAACGA
CTTTCGTACG ACGATGCGGT GGCCGCCGGT AGAACCATTC AGGAGCTCGA GACCGCCTTG
ACGGAAGTCG AGCAATTTCA TCAAATCGTC GACAACATCC ACGCCAAGCA GTACTTGCTC
GATTCCCGGG GATATTTGGG TAAGATGATG ATGACTTCAA ACGTCGCAGA TTCCGCGCTG
AACACGCTGA CGATTGTGTC CGACGCGGCG TATGCGTGGC GAGTGCTCGA TCCGTACACG
GAACAGCTGC AACAACGCAT TCGTAAAGAT CCATTTGCGG TGCGAAAACT ACGGTTCACA
TTTCTTAAGC TGAAAAGCAT TTTGGAAATG CCGTTATTGC GCATTTCGCA AATGGAGTCG
CCGGACATTT ACAGCGTGAG TGAATATTAT TCTTCCCAGC TCGTGAGTTA CGTGCGAAGC
GTTGTGGAAG TGGTGCCGAT TAGTATGTTT GAGATTTTGA ACGAAATCAT CGGCGTTCAA
ACGAACGCGC TGAAGGAACT GCCGACCAAG CTCGAAAAAA CCGCGCTGAG CGATTACGCG
CAACCTGCTG AAAGAGCACA GCTTTCCAAG GCGACGTACG AAATCTCCAT CTTTGCGCAA
GGCATTTTAG CCATGGAGCG GACATTCATG GGGGTCATCG AACTCGACCC TAAGCAGTTG
TTGGAGGAAG GCATACGCAA ACAGCTCGTA AAACAAATCA CAGAAACGTT TCACACAGCT
CTCGTCTTTG GCGATGGCGC TAAAGACGCG CTAGGCTGGA ATAATTTCGT GGCGGCTATG
ATGAAAACGA ATCCATTTGA AGACCGTCTC AACTCGTTGG CGAATCGGAT CGAAGGCTTC
AGACGCTCGT TCGAGTACAT TCAGGATTAC GTGAATATTT ATGGGTTGCA AGTGTGGCAA
GAAGAGACCA ACCGAGTGGT GAGCTATCAC GTCGAACAAG AGTGCAATAG TTTCATCAAA
CGCAAGCAAG TCAATGATTG GGAGAGCGAA TTTCAATCCG TGGCGATACC CATCCCAGAT
TACCCAGCGC TCGATGGAGA GTCAAAGAAT TTCATGGGAC GACTGTTGCG CGAACTGATG
AGACAAACCG ATCCCAAGAC GACTCGATAC GTCTCGCCAC ACAGCGGATG GTTCGACACT
GAAGGCTCGG AAGTGGTAGG CATCCGAACA TTTTCGCTGC TGACGTCTGC CGTCGGGAAC
GTGGGTTTGG CCGGTCTTGA TCGTCTTCTG AGCTTCATGG TGACGCAAAA ATTGCAGATG
TGCATCGAAT CGTACTCCGA ACGCCTCAGA GGCGATCTTG GGGCGACGAT TCGCGCGCTC
GATAACGCGT TGCGACCGCT AGGCTCCGTT CCTGAAGGTT CGATTGAGGC GTACGAGCAA
GCCATCAGGG CGTCATCTTC CGCGTGGGAC GACATGCTCG CCGCCTTCGC AACGATTGGA
CAAGCGCAGT TGTTGCGTCG GCAGCTGAAC GCCGAGCTCG TGGCAAACAT TCGCATCGAT
TCGCACACGT TGAGTCGCGC GCTGGACACG GCGAACAAGG CGATTTTGAC GGATATTCGA
TCGCACTACA AGTCTCCAGA CACCGTGCCT TATCCCGACG AGGCCAACGC GATCGTGCCC
AAGCTCAGCG CCTACCTCTC CGCGAGTGGC ATGCAAAACC CACTGCGACA GATCTACTGC
ACCGTCGCCG CCGTGGACGA CGACTGGGGA TTGGCCGCGT TCGTCTTCAC ACTGACTCAG
CTCGAGCTCT ACCGCTTCGA CGACGTCGCG TCGACGCTCG TCCCGATCAA CCAGTCGGTG
ACGAAACTCG ACGCACACGT CCTCATTCTC GCCGTCTCCA CCACCCTACG CCAGTTCCAC
GCCGACCAAA CGACGTCGTA TCTCTCTCAT CTCGGCGCGT TCGCGCGCGC CGAAATCTCG
GCTCGACGAC CGTCTTCGTC GTCGTCCCCC GACGTCTTCG CCCCCCGCGC TCGCGCCTCC
ATCGCCTGGG CGCGCGCCTT CGCCACCGCC CACGACGTCA ACCTCACCGT CTTGGCCTCT
TTCTTCCCTC CCTTCGTCTT CTCCCACGCT TTCGTCGCCT AG
 
Protein sequence
MDAVDASRAR DGDDARDGRA NARASANANA KANATAGTSR TATRSACAFV ARAQALAAEL 
LRLGENAPRA LSREDSKFAR VLFDYEYFDA RERAEDAAAR DGSTYELDEE VRGSYGHVFE
RYWNAFDAVV RWHQDFVRFA EDVAEGTYVS ETWEKILSDE DGRQYVCEAI AMFGVILKIL
DAKFDWRFRE RMVVAYYRMR GASEDIANAD EIVTLCARTG YDAARKTRPE GYPELYFARF
EMPEWLIYMV IGRLRTDDVY NHAPHYPNPD HRSTALAAQG GLLYIILYWA PKILNRGTSA
MREIVDRHYA DNWVVAYGAG LTVDLLTEWE PYEAASTAMR NAVTPKAARE LIDNASTRVG
ELKTAFKTYL TEGVLTEEFV LSNEKVLMNV VRDANVVARF VLLHNLTTHK SVSSLVSYMP
SKENIIDLLL DCAELENELK KIYTSLLSGK HELWEKCKHE AGERMKELSA YFGGTAGLSR
NAKDENLRLW FANLSVEVER LSYDDAVAAG RTIQELETAL TEVEQFHQIV DNIHAKQYLL
DSRGYLGKMM MTSNVADSAL NTLTIVSDAA YAWRVLDPYT EQLQQRIRKD PFAVRKLRFT
FLKLKSILEM PLLRISQMES PDIYSVSEYY SSQLVSYVRS VVEVVPISMF EILNEIIGVQ
TNALKELPTK LEKTALSDYA QPAERAQLSK ATYEISIFAQ GILAMERTFM GVIELDPKQL
LEEGIRKQLV KQITETFHTA LVFGDGAKDA LGWNNFVAAM MKTNPFEDRL NSLANRIEGF
RRSFEYIQDY VNIYGLQVWQ EETNRVVSYH VEQECNSFIK RKQVNDWESE FQSVAIPIPD
YPALDGESKN FMGRLLRELM RQTDPKTTRY VSPHSGWFDT EGSEVVGIRT FSLLTSAVGN
VGLAGLDRLL SFMVTQKLQM CIESYSERLR GDLGATIRAL DNALRPLGSV PEGSIEAYEQ
AIRASSSAWD DMLAAFATIG QAQLLRRQLN AELVANIRID SHTLSRALDT ANKAILTDIR
SHYKSPDTVP YPDEANAIVP KLSAYLSASG MQNPLRQIYC TVAAVDDDWG LAAFVFTLTQ
LELYRFDDVA STLVPINQSV TKLDAHVLIL AVSTTLRQFH ADQTTSYLSH LGAFARAEIS
ARRPSSSSSP DVFAPRARAS IAWARAFATA HDVNLTVLAS FFPPFVFSHA FVA