Gene OSTLU_32955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32955 
Symbol 
ID5003346 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp135887 
End bp139074 
Gene Length3188 bp 
Protein Length1049 aa 
Translation table 
GC content59% 
IMG OID640418767 
Productpredicted protein 
Protein accessionXP_001419080 
Protein GI145349311 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000297669 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.336011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGTT GGGGACGTCG CTCGTGAGCG TGAGCGCGGA TGGGGAGGTT TATATATCGA 
GGGCGAGCGC GACGACGCGG AGCGACGGCG AGTTTGAGGG AACGCTGGAG ATCGAACGCG
TCGTGACGTT GGACGACGGC GAGCGGGCGA CGTGTTGCGC GAGCGATGAC GATGAAATCG
TGGTCGGGAC GGATAGGGGG CGGGTGGTGT TTTTGTGCTG GGAGGAGGGG TCGGAGGGGC
GAAGCGCGAC GTGCGCGCGG AGCGAACGAG ACGGCGGCGC GACGAGCGTG GCGTGGTGCC
GCGATACCGG TACGGTGGTC GTGCGGTTCG ATGCGGGAGC GGTGTGCGCG CTGGAGATTG
ATGCGGCGGG CGACGTGCGG CGAAGGAACT GGTTCGATGC GTTCCCGTGT CCGGCGACGT
GCGCCGCGTT TCATCGAGGC TCGCGGCGGC TCGCGCTCGG CACAGCCGAC GGCGAAATCC
GCGTGTACGA CGACGCGATG ACGTCGGAGG CCGCGAAACC GCGACATGTT TTCGGGTTAA
GTCCGTGGGG CTTCGGCGTG GAAGACACTG GCGCGTTAGC GCACGCTTCT TGGTCGAACG
ACGGGCGAGC GCTCGCCGTC GGTTGGCGTC GGCGCGGCGT GAGCGTGTGG AGCGAATCGG
GATGTTTGCT CATGTGTACT TTACATCACG GCGGCGGCGA TAGCGCGGTG GGAACATCGT
CTTCGCCGCG CGCGACGTTC ACGGGTGACG AGGAAGTGCC AGAGATGGGC GCTTGTCTGG
CCCCGCCCGC GTGGGGCGTC GGCGACTACG CGTTGTTCGT CCCGGTGCGG TCCGAGTCTG
GTTCGAAAGT TTTAGAGTAC GCGCTCGCGA AGAGCGTGTC GAACAGTCGC GTCGCCCCTC
GATCATACGA TGGCGGTTGC GATCACGATG ATGCATCACT GCTTCTTGGT GACGATCGTA
TTTTCATCGT CGCCTCGAGC GCGACGAGCA CCAGAGTTCA CGCGCGACAG GAAGTGTGTC
CGAGCGAGTA CGTCCAGCGT CAGTGGCCCA TGTGCGTGGC GGGAATGAGC CCGAGCGGCG
ATCGAGTCGC CGTCGGAGGC GTGCGAGGAT GCGTCGTGTT TGACACGCGC GGTGAATGCT
GGAGTCAGTT AGGAGACGTG GAAGAGGAGA ACTCCTTCGA GGCAATCGCG TTCGATTGGA
TGCAACCGGC ACCGCCGACA TCGGGGCGGC AGCGTAGCGT GTTGCAGCCG GTACTCGCCA
TCGTCGCTCG ATTAGGTAGA ACGAGGATGT TTACAAAGTC TATCAAGTTG TCGTACGGCA
TTAGCTTTTA CGCCGACGGC GGAAAGGGTG ACCTTTTGAT GACGATGCCG CTACCATCCG
AGGCGACGGG CGCATACGCG TGCGGAGAAT TCTTACTCGT GAGCTTCTCG AATGGCGAAA
TAGCAGTGTA CGAGGTGGAA GAAATGTCAT CGAACGTGGG AGCGATTTCC GCACACCACG
TTCGCGAGGA TGCGGGACAG CGACGGAAAA CAACCTTAAA CGCGGGCGGG CGCGTGCAAG
GAATGTGCGC GGTGCCGCCG GCGGCGGCGC CCGAGCGAGC GCCGAGCGAG TGCGTAGTTT
TGACAGAAGC TGGCGAACTC TTCGTCGTAG ACTTGACGGA TGAGTACGAT CAGGTGAAGC
TCTTTGATGA CGTCGCGGAG TTCTGGGTCG TTGGAAGCGC GAACGCACCG CAAGAAATGA
TGATGCAAGG GGACGAAAGC AGCGACTTTG AGAGCGACGC GAGAGATTCG CTTTCTGTGG
ACGGGGGATG CGTATTTGCC TATGGTGCCG AAGGGATGCG CATATGTTAC TTTCCGAATG
GCGACTTACG ACAAATCCTC ATCAACGGCG CAACGTCGTG TGACGTTGAG CGCGCGGCGA
ACAATCCAGA ACTCGAGTTT GATCGAGAAT TGTATCCGAT GAGCGTGAGC CTGAATATGA
ATCGCATCAT CGGAGTGACG CAAAAGTTTT CCTTCGCAGA TGCGGTAGAC ATGCCATACT
TCACAATCGC ACCCAAATCG CACACAATTG TGCCTTACAT TTTGCGCAAG CTTTTGAGTT
CTGGCCAGCA CGACGCCGCG TTGCGATATG CGCGTGCCGC TCGACGACAG ACGCCGCACT
TCATGCATGC GCTCGAATGG TTGCTCTTCA CAGCGCTGGA ACGCTCGAAT CGCGAAATCA
CGTCACAAAC AGTACTCAAG CAATCGATTG CACTGCTGTC GGAGTTGCCA AATTATCTTG
ACGTCATCGT GAGCGTGGCG CGGAAGACAG ACAACACGCG ATGGGAATCG CTCTTCAAAT
ATGCCGGTAA ACCAAGCGAG CTTTGTGTCA AGGCGTTGAA ATTAAAGCGC ATTCGTATCG
CGGCGTGCTA CATTCTCGTC GTCGATAAAC TTGAGGGCGA AACAATGGGA CGTGAAATCG
CCGTGCGCGT GATGAGAGCC GCGCTGGAAG CCCGCGAGTA CAAGCTCGTC GAAGACCTCA
TCAAGTTTTT ACTGCAGCCA GCAGACGAAG CGGCGAAGGA AAATCAAAAG CCTGGAATCT
TCAAGCGCGT GCTTGAAGTC ATCGCTCCGC CGCCAAACAG CGTAATTGCG TTAGGCGGGC
GCGCAGATCG TGAGCTTGCG CTTGGTGAAC CAGAACAACT GTTATTGAAG TCGCACGTCG
ACTCTCTGGG CCGCGAACGC GATGTCGCGG CGATGGGAGC TTTTATGAGC GAAACGTCCT
TTGACGGCGT CGCTTACTTG AAACACGAGA CGGACGAGAA TGGTGAAGCG TACATCTCCG
ATTTCGCGGG CAGTATCGAG TTAGCAGCGC GGCGATTACG AGAAGGAAAA TTACGACGAG
CGGCGTCGAG TCAAAGTTCT CGCACAGAGA GTTTATTTCT CGTCGATCCC ACGCGCGCCG
TCGGATCGAA AGTCGAAAGC GACGCCACTT ACGTCACCTC GCTTCTCGCC ACCGCGCGCG
AGGCGGGTTG TACCGACTGG TCTTTACTAC TCGCTACTCT TTTAGGACGC GCAGACGTCC
TAAACGAGTT TTTCACGAAC GAACCGGCGC TTCGAGAACC TTGGATGAAC ATCGCAAAGC
GCGTCGCGAC AAACACGAGC GACGCGACGT TGAAGAATCA CCTCACGGCG CTCGTCTCGG
ATATTTGA
 
Protein sequence
MNGWGRRSAS ATTRSDGEFE GTLEIERVVT LDDGERATCC ASDDDEIVVG TDRGRVVFLC 
WEEGSEGRSA TCARSERDGG ATSVAWCRDT GTVVVRFDAG AVCALEIDAA GDVRRRNWFD
AFPCPATCAA FHRGSRRLAL GTADGEIRVY DDAMTSEAAK PRHVFGLSPW GFGVEDTGAL
AHASWSNDGR ALAVGWRRRG VSVWSESGCL LMCTLHHGGG DSAVGTSSSP RATFTGDEEV
PEMGACLAPP AWGVGDYALF VPVRSESGSK VLEYALAKSV SNSRVAPRSY DGGCDHDDAS
LLLGDDRIFI VASSATSTRV HARQEVCPSE YVQRQWPMCV AGMSPSGDRV AVGGVRGCVV
FDTRGECWSQ LGDVEEENSF EAIAFDWMQP APPTSGRQRS VLQPVLAIVA RLGRTRMFTK
SIKLSYGISF YADGGKGDLL MTMPLPSEAT GAYACGEFLL VSFSNGEIAV YEVEEMSSNV
GAISAHHVRE DAGQRRKTTL NAGGRVQGMC AVPPAAAPER APSECVVLTE AGELFVVDLT
DEYDQVKLFD DVAEFWVVGS ANAPQEMMMQ GDESSDFESD ARDSLSVDGG CVFAYGAEGM
RICYFPNGDL RQILINGATS CDVERAANNP ELEFDRELYP MSVSLNMNRI IGVTQKFSFA
DAVDMPYFTI APKSHTIVPY ILRKLLSSGQ HDAALRYARA ARRQTPHFMH ALEWLLFTAL
ERSNREITSQ TVLKQSIALL SELPNYLDVI VSVARKTDNT RWESLFKYAG KPSELCVKAL
KLKRIRIAAC YILVVDKLEG ETMGREIAVR VMRAALEARE YKLVEDLIKF LLQPADEAAK
ENQKPGIFKR VLEVIAPPPN SVIALGGRAD RELALGEPEQ LLLKSHVDSL GRERDVAAMG
AFMSETSFDG VAYLKHETDE NGEAYISDFA GSIELAARRL REGKLRRAAS SQSSRTESLF
LVDPTRAVGS KVESDATYVT SLLATAREAG CTDWSLLLAT LLGRADVLNE FFTNEPALRE
PWMNIAKRVA TNTSDATLKN HLTALVSDI