Gene OSTLU_27868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_27868 
Symbol 
ID5005740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp346903 
End bp349977 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table 
GC content63% 
IMG OID640421161 
Productpredicted protein 
Protein accessionXP_001421631 
Protein GI145354732 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.238399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0610065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCCC CCACGCTCGA CGCCGCGCTT CGTCGCAACG CACGCGAGCG CCCGAGCGCC 
ATCGCGTGCG TCCTCGTCCC GAACGCGCGC GAGAAGAACC GAAAGTGCTT CTTCACTTGG
GACGAGTTAC ACGGCGTCGT CGACGCCCTC GCGCGCGTCT TGCGCGCCGA CCGCCATCCG
CGCGTTCTGA GCTTTTGTCG CGACGGCGTC GAGGCGTTCG TGGCGCTGCA CGCGTGCGCG
CGGGCGCGGA AGAGTTTGGT GAACGTGGAC GCGGCGACGC CGGCGAGCAG GATTCGGGAC
GTCGCGCGGG ACGCGAACGC GACGACGTGT CTGTGCGCGA ACGACGGCGA TGAAGACTCG
GTAAAATCGG TGTTAGGAGA AGACGGTGTC GCGGTGCGCG TGTGGGAGGT GATTAATGCG
GGCGCGGAGG CGATCGAGGC GGAAGAGGAC GAGACGCGAG AGGACGACGA AATTTGGATT
GCGTACACGA GCGGGACGAC GGGAAAATGC AAGGGCGCCG CGGCGACGCA TCGACGCGCG
ATGGCGTACG CACGGGCGAA GTGTGAGGTG GAAGAGATCG ACGCGTGCTC GCGGGTGTGC
TTGGCGTCGA ACGCGACGTT CGATTTGTGG CACGGCGACG CGTGCGCGGC GGCGGCGGCG
GGCGCCGCCG TCGTCGTCGC GTCTCGCGCG ACTTTTCAAC ACGATTTCGC GAGAGTGTTG
CGAGAGGGCG AAGTCACGCA CGTGTGTTGC ACGCCGACGA TGTTCTCGCT GGCGCGACTT
CCGCGTGGGA TCGTGGACGC ACCGAAGTTG CGATGCGTAT CGCTGGCGGG GGAGCGCATG
CGCGAGAGCA CAATCGCACT ATGGGCCGAC GACGTGGCGT TGTACAACGT GTACGGTGCC
ACGGAAACGA CTGTGGTGCA AACGTACGCA AGAATGCGCA ATGCAGAAGA CGAAAGCACG
AATCGGGCGG GGAAAGCGTA CGAAGGATTT GCCAGCGTGT TCGTCGTGGA TGAAATGCGC
GAAGGCGAAT TGGCGGTGCA ATCCTTCGCG CAGAGACGCG GCGTCGAAGG CGAAGTTGCC
ATCGGCGGTA TTTGCGTCGC AGACGGGTAT TTGAACGACC CCGAACGCAC GGCACGTGCG
TTCGTTCACT CACCATTCGG TACCGTCTAT CTCACGGGCG ATCGCGGGTA CATCGATGAC
GCAGGCGATT TGTATTTGCG CGGTAGACTC GATCGCCAAG TCAAGATTCG CGGTCATCGC
GTCGAGTTGG ATGAAATCGA AGCCGCGCTT CGTTCGTGCC CCGCACTATG CGACGACGCC
GTCGTGTTCT ACGATGCCAA TGAGGGAGTG CTGACTTCCC ACGCACGAGC AGCTTCGCCG
ACGTTCGACG CGTCGAGCGA TGCAGATTTA TACGCGTTCG CGCTCGAACG CGTCGTCGAG
CTACGATTGC CGCGACACAT GGTGCCTCGT CGACACGTCT TCATCGAGCA CGCGCGATGG
CCTTTGACTT CGAGTGGAAA AACGAATCGT CAGATACTGC TTCAATGGTT TGCGAGCGGC
GAGACGAAGG CGTTTCGACC TAAGCGTGAG AAACCGCAAG CGGGATTGGA AACAGTCGTC
GCGAACGCTT GGGCTGTCGC GCTGGGGCAC GGCGACGCGA GCGATATCGG CGCCCTCGAC
GCGTTCGATG CGCTCGGTGG AACCTCACTC AACGTTCTCG CAGTTTCAAA AGCGCTGGCG
AGCGACGACG GAATCGTGCG CTCGCGCGAT ACCGCGCGTG GCGAAATTGT GCGCGAAGCG
CGAATTGGTT TACGCGACGG CGAAGCTGCG GCGCTGCTCA ACGACGGCGA AGAACCCGCG
GCGTGCGCGT TCGACGTCAT CGACGGCCCG TTCGCACCGT GCGAAATATT CGCGAGACCC
GTGCTTCGCG ATTACGTTGA TTATTTGCGA GCGCAAGGTA TAGACGCCGA CGTGAACGCT
CCCGCGTCCA CCGCCGTCGA GCCACAGCGA GATACATCTG CGAATCGTCG CATGCTCGCG
GCGTGCGGTC GCGGCGTCGC CCCCGTCGTT CGCGCTTTGC TCGCGACCGG CGTCGAAGCA
TCGGGCGAGC ACCTTCGAGC CGCCGCCGCG AGCCTCGCCG ACGAAGCCGT GGACGTAGTG
CAAACTTTAC TCGCCGCCGG TGCGGATGCA AGTTCATCAT CCGCCACAGC CGGTACGCTC
GCGACGCACG TCGCCGCCGC TCGCGGGAAC TCTGAAATGT TAGAAATGTT GCTCGCCGCC
GGCGCTCCAG CGGGAGCGAA GGACGCGGAT AAGCAAACGA TTTGTCACTT AGCAGTGCGG
AGCGGGGATG TGAGTACCGT GCGCGTCGCG GCAAACGCGT GTAAACACTT GAAGACACGC
AAAGGTGGCT TGGAGTCGTG GGATCGTTGG AAGCGCACGC CCGCAGCGTG GGCGCTCGTC
GCGGGATCGA GTGAAATTTT AGCCGAGTTG CGAGATGCCG GGGCGAATTT AACATCGCTT
GAAACTGACG TCTCCAACGC CTGGGTGCAC GGTTCGCTGA GCGCAAAGAG CGAGGTGCAG
TTGGCGCATC GCCCGGTTCG CAAGCGAGGT GCCGCGGCGG AGGTGTTGAG CGCGCTCGCG
GCTAGATTGG ACGCCGCAAA CGTCGAGAGC GAACGTATTG AAGCGGCGAC GGCGATTCGC
GAGCTCGTGT GCGCAAACGC AGAGAATCGA GAAAAGGCGC GAACGATCGG TTTGGTGCCA
AAACTGTGCG AACTCGCGCG CGATCGTCAC TGCGTTGAAG CCATCGGCGC GTTGAGAAAT
TTAGCCACCA ACACGAGCGC GGCAGGGGCG GCTGGCGACG CCGGAGCTAT GGAAATCCTC
GGTGATGTCA TTCGCGCTCG TGCTAAATTA AAAGACAGCG AGCTGAGTGC TGACGATAGG
CGAGTAGTGT ACGCTGCGGC GAGCGCGATG CGCGCGTTGG CGGTGAAGCA CGACGCGAAT
GCCGCGCGAT TGCGCGCGTC GAGAGACGTC GCGGACGTCG TCAGTCGATT GTGCGAAGGC
ATGCGATTAG AGTAG
 
Protein sequence
MSPPTLDAAL RRNARERPSA IACVLVPNAR EKNRKCFFTW DELHGVVDAL ARVLRADRHP 
RVLSFCRDGV EAFVALHACA RARKSLVNVD AATPASRIRD VARDANATTC LCANDGDEDS
VKSVLGEDGV AVRVWEVINA GAEAIEAEED ETREDDEIWI AYTSGTTGKC KGAAATHRRA
MAYARAKCEV EEIDACSRVC LASNATFDLW HGDACAAAAA GAAVVVASRA TFQHDFARVL
REGEVTHVCC TPTMFSLARL PRGIVDAPKL RCVSLAGERM RESTIALWAD DVALYNVYGA
TETTVVQTYA RMRNAEDEST NRAGKAYEGF ASVFVVDEMR EGELAVQSFA QRRGVEGEVA
IGGICVADGY LNDPERTARA FVHSPFGTVY LTGDRGYIDD AGDLYLRGRL DRQVKIRGHR
VELDEIEAAL RSCPALCDDA VVFYDANEGV LTSHARAASP TFDASSDADL YAFALERVVE
LRLPRHMVPR RHVFIEHARW PLTSSGKTNR QILLQWFASG ETKAFRPKRE KPQAGLETVV
ANAWAVALGH GDASDIGALD AFDALGGTSL NVLAVSKALA SDDGIVRSRD TARGEIVREA
RIGLRDGEAA ALLNDGEEPA ACAFDVIDGP FAPCEIFARP VLRDYVDYLR AQGIDADVNA
PASTAVEPQR DTSANRRMLA ACGRGVAPVV RALLATGVEA SGEHLRAAAA SLADEAVDVV
QTLLAAGADA SSSSATAGTL ATHVAAARGN SEMLEMLLAA GAPAGAKDAD KQTICHLAVR
SGDVSTVRVA ANACKHLKTR KGGLESWDRW KRTPAAWALV AGSSEILAEL RDAGANLTSL
ETDVSNAWVH GSLSAKSEVQ LAHRPVRKRG AAAEVLSALA ARLDAANVES ERIEAATAIR
ELVCANAENR EKARTIGLVP KLCELARDRH CVEAIGALRN LATNTSAAGA AGDAGAMEIL
GDVIRARAKL KDSELSADDR RVVYAAASAM RALAVKHDAN AARLRASRDV ADVVSRLCEG
MRLE