Gene OSTLU_17102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17102 
Symbol 
ID5004024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp378838 
End bp381804 
Gene Length2967 bp 
Protein Length988 aa 
Translation table 
GC content64% 
IMG OID640419445 
Productpredicted protein 
Protein accessionXP_001420157 
Protein GI145351596 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.121633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.237025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGG CCAAGGCGCG CGCCGTCGCG CGGTCGTCCG CCGCGTCGTC CGCGCCGGCG 
ACCAAGCCGA GGCGCGGCGT CGGGTTCACG AGCGGGCCGA CGAAGCATCG CACCGGGAAG
ACGCTCGGCG GCGGCGTGGA GTTCGCGAAG AAACGCCTGA AAGTCGGTCG AAAGGTGGCG
AAACACGCGA ACGAGACGGA CAGCGCGGTG CGGAGCAAGC GCATACGGCT CGCGGCGCAG
AATTTGCAGA GCGCGGGGGC GGGGACGAGC GGTGAGGGGA CGCGGGACGA GGACGCGGCG
AGCGCGCGAG GGACGCCGCT GAACGAACTG TTGAATCAGT GCGGACACTA CGCGGCGAAG
ACGCGATGCG ATGGGTTGAA TGGGTTGTTG GAGGTGTGCG AGAGGTATCC GGGCGCGGTG
CGAGCGAGGG CGGGGGACGC GATCGAACGC GTGGGCGAAC GGTTGGCGGA CGAAGCGAGA
GAGGCGAGAC GAGCGGCGAG GGAGTGCCTG GGGCGAGGCG TGGTGCCGGC GTTAGGGGTG
GAGGGGCTGG CGCCGTTTGC GAAAACGTTG ATTTTGTACG CGGGGGCGGC GCTGACGCAC
GTGGCGGACG ATGTACGGAG GGACGCGCCC GCGGCGCTGG ACGCGCTGTT GGAGGCGGCG
CCGACGCTCG TGGCGGCGCA CGCGCCGGCG AGCACGCTGG GGCACTTGGG CGAGCTCTTG
CGGCGCGGAG ACGACGGTGG CGTCGGTTCG GGGTCGTCCG TGCAGCGTGG CGTCGGATCG
CAAAAGCCCG CGACGCGATT GGCGCTATTG CGAAGTTGTA GGCGCTTTTT GGAGACGCTC
GCAGAGGGCG TAGGCACGAC GGACGGCGAT CGCGGTGGTC TTTCGTCGAC CTCGTCGATG
ACGACGACGT TCGTTTGGGG AGAAGAGGCG CGCCGCGGCG CCGCCACGCG CTCTATCGGA
TCGATGTATG CGGAAAGATC GCGCCAAGCG CCGCCGAGCG TGCTCGCGGC GACGACGGCG
TCCAACGACG AATCTGAAGA TTCCGGGGGA CGAATCACCG GTGAAAGTCG TGCTGCGGTG
CGCGTGAACG CCAAACGCCT TGTAGAGTTG GCGATGTGCG TCTGGGACGA CGCAGCGCAG
ACGTTTACGG ATGAGAGAGG CGTCGACGTT GATCGCGTTC GAGTAATGGC GCAATCCATG
GCGTGCGCTC GGTTAGCGCT CAGCCTCGCC GACGGTGCGG AAGAATCGGA GATGCTCGAA
AACACTGATA GTGCATCGAC TGCGGTGAGT GTCGTTCCGG AAATCGCGCG GCGTACTCTC
GGCATGTTCC CATCGACGTC GCCGGCGAGC GTGGCGGAGA AGCAAGACAT CTCGCGAACG
CGCGAGGCGA TGGTGGATTT GAATTTCGAG ACGTGTCGGT TCTTGCTCGA CGCGTCCTCA
TCGGTTGCTC ACGAAGCACT CGCGCAGCAT TTAGCGCCAG GAGTGATGGA TGCGCTTCCG
CACGTTCTCG CGCGCGCTTT GCAATACGTC ACGTCGACGC TGCGGGGCGT CGCCCTCGAC
GGCGGCGCCA TGGGCGAAGA CGTGCCGACG CCGGATGACG CGTACGGGGA CGTCTTGGCG
CTCGCGCGTG ACGCACTTAC TTTACCGGCG TGGTGTTTCA GTGCGAGCAT CACGGGTAGC
GCGTGTGCAG ATCTTCTCGT CGCCGTGACG GAGACTTGGG AGCGCGCCGT GGCGGATGAA
GACATCGAAC GCATCACGCA ATGCGTCGCT TTGTTGACGG AAACTTTGCC TGAAGAAGCG
CGACAAGGGT ACTTTCGCGT GCCGATCGAA ACCGCTGCGG GATGGGTGCG TCACATTCCT
CGCGTACTGT GGGCGTTCAA GCATGAAAAC CCCTCTGCGA CGCAAAAGTT GCTGTCGTTG
CTACACGACG TCGCGGCGAG AAATCCTCCG GGATCGCCGT TGGCTGACGT CTTATCGACG
TGCGAGGCAG AAATGGCGGT GCTTTTCTTC ATGGTTCCAC CCGCGGGGTC ACCCGAAGGC
GCGAAATCGA GACCCGGTCC GTTCGCCCGC CTTCCGTTCC CGTCGCAGTG TGCGGCGGTG
CGTCTCGTAG GCGTCTTGCC GACGCTCACG CCGCCGATGA TTCGGGCGTT GGCCAAGATG
TGTCTGGACG TTGACCGCGT GAATGAAGAA CTTTGCGTCA TCGCTATCGA GGCCATGCAA
GCCAACGCGC TGGCGGCGCC TTTAGAGTTA ACGATGTCGT TCTACGCCAC TCTACTCGTC
GGCGCCGCCG GGGTGAAATT TCTCGACAAG TCGAGCAAGC GCGACGTCGC GACGGTGGAA
CAGAAATCCT GGCTCATCGC TCGTCGAGCG ATTCCGAGCG CGGCGGCCGC TTTAGTAGCG
CTTAGTGATG CCGACGCGCC GTGGACCGGG GCATCTCTCG CCAGTGTGAC GCTCAAGCAC
ATGTGGAGCA GTCGAGTGGA AAAGGGCGAC GTCGACGGCG CCACGCGAAC GGCGAGCGGG
TTCGTCGCGC TCATCGCGAG CACGGCTGAA TTCGCAAGAT CGGTGTCGGG TCAATCGGGT
CAATCGATCG ATGACGATAA CGTTTCCGGC GCCATTCCAG AGATGTTCGC TTGGTTCATC
TTGCGCGCTG AAGACGGCGA TGGCGTTGAC GTCGACGTCG CGTGGCGAGC GTTGCGCGCC
GCGCCTTCGA CGACGCCTGG TCCCGTCGCG AGCGCGGTCG TCGCGTCTTC CGAATCTTCC
GCCGCGCTCA CCGATCGCGC GTTGGCTTTC GTGAGCAAGT TGATCACGGA AACCGCCGCC
GGTTCGATTG AAATTTCCAA AGACGAGTTG CGCGACGTCG TCCGATCGAT TCAAAACAAA
GCGTCGGCGC TCGAGGCGAA CGAGTCGACG AAACGCGCGC GGGCGCTCGA CGTTCATTGG
AACGTCGCGT TCGGAGAGGC GATATAG
 
Protein sequence
MGKAKARAVA RSSAASSAPA TKPRRGVGFT SGPTKHRTGK TLGGGVEFAK KRLKVGRKVA 
KHANETDSAV RSKRIRLAAQ NLQSAGAGTS GEGTRDEDAA SARGTPLNEL LNQCGHYAAK
TRCDGLNGLL EVCERYPGAV RARAGDAIER VGERLADEAR EARRAARECL GRGVVPALGV
EGLAPFAKTL ILYAGAALTH VADDVRRDAP AALDALLEAA PTLVAAHAPA STLGHLGELL
RRGDDGGVGS GSSVQRGVGS QKPATRLALL RSCRRFLETL AEGVGTTDGD RGGLSSTSSM
TTTFVWGEEA RRGAATRSIG SMYAERSRQA PPSVLAATTA SNDESEDSGG RITGESRAAV
RVNAKRLVEL AMCVWDDAAQ TFTDERGVDV DRVRVMAQSM ACARLALSLA DGAEESEMLE
NTDSASTAVS VVPEIARRTL GMFPSTSPAS VAEKQDISRT REAMVDLNFE TCRFLLDASS
SVAHEALAQH LAPGVMDALP HVLARALQYV TSTLRGVALD GGAMGEDVPT PDDAYGDVLA
LARDALTLPA WCFSASITGS ACADLLVAVT ETWERAVADE DIERITQCVA LLTETLPEEA
RQGYFRVPIE TAAGWVRHIP RVLWAFKHEN PSATQKLLSL LHDVAARNPP GSPLADVLST
CEAEMAVLFF MVPPAGSPEG AKSRPGPFAR LPFPSQCAAV RLVGVLPTLT PPMIRALAKM
CLDVDRVNEE LCVIAIEAMQ ANALAAPLEL TMSFYATLLV GAAGVKFLDK SSKRDVATVE
QKSWLIARRA IPSAAAALVA LSDADAPWTG ASLASVTLKH MWSSRVEKGD VDGATRTASG
FVALIASTAE FARSVSGQSG QSIDDDNVSG AIPEMFAWFI LRAEDGDGVD VDVAWRALRA
APSTTPGPVA SAVVASSESS AALTDRALAF VSKLITETAA GSIEISKDEL RDVVRSIQNK
ASALEANEST KRARALDVHW NVAFGEAI