Gene OSTLU_18862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18862 
Symbol 
ID5006479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009373 
Strand
Start bp45309 
End bp47375 
Gene Length2067 bp 
Protein Length688 aa 
Translation table 
GC content65% 
IMG OID640421900 
Productpredicted protein 
Protein accessionXP_001422371 
Protein GI145356301 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000000072012 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCGCGCG ACGCGCTCGG AGAACGCTCG ACGCGGACGT TGACGGCGTA CGGACGTCCG 
TCGACGTCGC CGCTGGCGGA GAGCGCGAGA GAACTGGCGA GGACGACGAT GGCGGGACGA
CTTCGGGACG CGGCGACGAC GACGCGACGA GGACGGCGAG ACGACGAGGG CGAGGACGAG
TACGAGAGCG CGACGACGAC GAGCGAGACG ACGAGCGCGA CGGCGAGTTT GATTGGGTCG
TCGGTGATCG AATTCGGGCG CGAGCGGGAG GGCGCGAGGA GACTGCGGGG GAGGGATCGG
TTCGGAACGG CGGCGCGGCG CGAACGATTC GAAGAGTCGA AGCGAACGGC GAAGACGCGA
GGGACGTTTG ACGTGGCGGA GAACGCGAGA TTGGCGCTCG AGGCGAAGGA GGCGCAGTTG
CAGAAGATGG CGGCGCGATT GCGAGCGTTG GAGCGCGAGC GGGAGCGGGA GCGCGGGCGT
GAGATGGACG AGGAGCGGGA GACGTTTGCC GACGCGGCGA ACGATTTAGA GGGGGAGATG
GCGGCGGCGA TGGCGAGGTT ACGAGGCGAC GCTGGCGGCG CGACGACGTC GCGTCAGCAG
ACGACGAAGT CGTCGCGCGC GCGCGAGGAC GCGGGCACGT TTGCGTTCGC CCCGGATGCT
TTCGCGAGTC GCGACGGCGA CGACCCCGCG ACGCCCGAGC TGCCGCGCGG GCGAGGCAAA
GCGGAGGAAC ACGGCGATGA CGTCGAGATG ACGAAGACGC CGGAGATAAA GCCGATGCAC
CCACCGCGCG CGCCACCGTC AGAGCGCTCG GACGCCGAGA TTTCGGAGTC GAAGACGCCT
GAACCGATCA AACCAGGCGT ACGCGTGGAC GGCGGATACC TCACGCCGAG CGGTCGAGTC
GTCGGGAGCG CGTTGAAATC GGGTTCGTCG AAGCCGAAGA CGCCGAACGA ATCCAAGTCC
GTCGTATTCA CACCCGGAGA GATCGCGCAG TACTCGGACG CGGGAACGCC GTTCGACGCG
GCCGATGTAG AAGAGTTGCG CGCCGAGCTC CGCGAAGTCA GGGCGGCGCT CAAGAGCGCG
TTCAGCGTGA GTGAAAAACT CGTCGCGCAG TGCGAAAAGA AGAATAGGGA GATCAAGGAA
AAGAATACGA GCGTCATGCT CGCGCGCGAG GAGCTGGAGA ACGCGCGCGC AGCGCTCGAG
CGAGCGCAAA AAGAAGCGCT AGAGGCGAAG AAGTCCGTCG CCGGCGTCCC GGCGGGCGAT
CTCGCCGAGG CGTACGAAAA GAGCAAACGT TTGAATAGCG AACTCTTGCG CGTGATGCGC
GCGTCGATCG GGGTGAATGT CGAATTAAGT CCGGACGTTG TGTTCGCAAT TCTACCCACG
CTCGCGAGCT TAGCCGTGCG GGGAGACGTC GCCGAGGCCA TGGCGCAGGG CGGCGCCTTG
GACGCCCTCG CGGGAGCGTT GGATTTGTTC GCGCACGACC CCGCGGTTTG TCGCAAAGCC
ATCGCCGCGA TCGAATCTAT TTGTCGCGCC GGATCTCCCG ACGCTAGCGG GGCGACTCAA
GACGAAGAGA TCATCGCCAT TCGCTCGAGA TTGACGAACG AAGCCGTGGA CGCGTGCGGC
GCCGCCGTGG CGCGATGCGC GTACGATCAC GTCGCCGACG CGCGATTGAG CGAAGCCATT
TGCGCGCTCG CGCACGCGTT CGCCGAGATC GGCGATTATG ATCGCGTTCA GTTCTTGATT
GGACGTGAAT GCGCGCTGGT GCAAGCGATT TGCAAGATCT CTAAGACGCA CGAGCACGAC
GAGCGAACGC AGCGCGCGTC TTCGTTGGCG CTCGCCGCGT TCGCCGCGTG CGACGAGAAC
ACCAAGCGCG TCGTCGAAAG TCTCGGTGGC ATCGCGCTCA TCAAGCGGTG CGTGCAGGAA
CTCGGAATAG ATGACGCGCA GCGCGCGTTT CCGCACGTCA AGCGATGGAT TTCCGGGAAG
AAGACGCGCG AGGACAGAGA ACGCGGCGCC CGCGGTTTCG ACTCGGACGA CGGCGAAGCC
GACGTCGACG TCACGGGAAA TCTATGA
 
Protein sequence
MSRDALGERS TRTLTAYGRP STSPLAESAR ELARTTMAGR LRDAATTTRR GRRDDEGEDE 
YESATTTSET TSATASLIGS SVIEFGRERE GARRLRGRDR FGTAARRERF EESKRTAKTR
GTFDVAENAR LALEAKEAQL QKMAARLRAL ERERERERGR EMDEERETFA DAANDLEGEM
AAAMARLRGD AGGATTSRQQ TTKSSRARED AGTFAFAPDA FASRDGDDPA TPELPRGRGK
AEEHGDDVEM TKTPEIKPMH PPRAPPSERS DAEISESKTP EPIKPGVRVD GGYLTPSGRV
VGSALKSGSS KPKTPNESKS VVFTPGEIAQ YSDAGTPFDA ADVEELRAEL REVRAALKSA
FSVSEKLVAQ CEKKNREIKE KNTSVMLARE ELENARAALE RAQKEALEAK KSVAGVPAGD
LAEAYEKSKR LNSELLRVMR ASIGVNVELS PDVVFAILPT LASLAVRGDV AEAMAQGGAL
DALAGALDLF AHDPAVCRKA IAAIESICRA GSPDASGATQ DEEIIAIRSR LTNEAVDACG
AAVARCAYDH VADARLSEAI CALAHAFAEI GDYDRVQFLI GRECALVQAI CKISKTHEHD
ERTQRASSLA LAAFAACDEN TKRVVESLGG IALIKRCVQE LGIDDAQRAF PHVKRWISGK
KTREDRERGA RGFDSDDGEA DVDVTGNL