Gene OSTLU_92478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_92478 
Symbol 
ID5000945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp691814 
End bp694924 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table 
GC content49% 
IMG OID640416366 
Productpredicted protein 
Protein accessionXP_001417023 
Protein GI145345023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTC GTTCTGAAGA CCCGGCGTGG GTACCACCGC CGCGAATCGT CATGGTGAAG 
GCTTTCCCGG AGGAGGACGT CGTGCTCGTG AAGCCCGAGC TCTTCGCGGA GTGCTCGCGA
AGACGCTACC TCGCCCTGAG CGATGGTGTA AAGTCCCGCT TCGCCCAAGC GCACGCGAGC
TCGCGAACGA AGTATCGTAA GCTCTGGCGG CGGTTGTTGG TTCACATGCA AACGTCGCTG
GCGATGGCGA AGGCGCTGAC AAAGTTGGAG TTGTTGCGAA CGAATACGGC GCGCGGACAC
AGATTGGACG TAGCTGGATT GGCTCTCATT GCGATGTACG ATAATTACAC GAAGAAGATT
GATCAGATGT TGAGCAAGCA GATGCAGGAG TCGCTACAAA AGGCGATGGA TTCGAAAATG
GAGAAAGATA AAGTGATTGC ACAATCGCTG CTCGAACTCG AAACGAACGA TGAGAAGAGT
GGCGAAGATT TCAATCCGAT GGCTATCATG CGAGCTTTGC GTAAGCATAG CGCGCAGAAA
CCAACAAAAA AACAGTTTCA AGGCTCTGAA GAAGAGATTG GCGTTGAGCA CACGCGTTTG
TTGTTCCTCG TGTCGGTGTA CACGGCGGAC GCGAACAGAG ATGCGTTTCC AGGCTTGGCC
GAAGACAGCG AGCTTTGGGT ACGAAAGACC CAGCTCTTGG TTTTGATTTA TGAATGTATT
CGTGCTGGTG CGCTGAATTA CGATTACGCA CCTTTGGCCG AGACCATGGG ATCTAAGCGC
GTTTGGTTGA ATATTTCTCA AGAAGGCGTC GACGACCTCG ACGATATGTG TCAAGTTGGA
TTCTTATCGA GCATGAAGAT GAGTTCGACG AAGTATAGCA CCTCTACGGC TTACCGATTG
ACAAAAGAGG GATATTTACA CCTCAAAACG CACCTCCGTC GTCGTGATAG AGCCGCCATC
GAAGAAGTCG TGTACTCCGA TAAATTGCAG CCTTCACCAA GAAACCTGTT TGTCGCAAAA
TGGGACGCCA AAGCGGATAC TTTTTATCTC CAAAGCGCTT CAGGTCACAC GAAATCAAGC
GATGTCACCG ATATCGAAGA GGTTTCGTAC GTTTCATCTC CGTTCGTTCC GAAGTCGATG
CGAAAGTGGG GGCGTGAGTG CACTAGTAAC AAACACAAGA CCGCCGCACT CGTCAAAGCA
ACGGGCACCA TCAGAGACGA GCTGGATGAG CAGTTGTCAT TCGATAGATT GCGATTGATG
GTTGGTGAAT GGATTCCTAT GGGCGCGAAT CAGGTGTTGA GTCTGAATGA CAAGCTAGGT
TCGACGGACC GCGTTGCTGG CGGCTACTTT ACGAGCGAAA TGGACAAAGA TCCCAACAAC
CCATGCTTCC AAGGCAAAGT CGATGGTTTG ACGCGGGTCA ACGTGCTAGA TTTCGAAGAA
ACGTCGTACG TGAACTTCGA AGCCGAGGTA CAGTACGAAG AAGAACCTGG AATTGTGCAA
ATTGAAAACT TCGGCATCCA CGTGAGTGAA GAAGGCTTCA TGCTTTACGG CTTGACGCTC
GACGGTATGA TGAAAGTGAC CGATGGCAAC AACTTTTCGC TCGATCACCT CGCACGTTTG
CTTCGTGATA TCGGCACGGA TTCAAGCGAA GTCATTGGTA ATTTACTCAC CGACCATCAG
CGACACTTGC TGGATTTAGT GCACATGGGC GATGCGATGA ATCGCGAAAA GTTCAACGTG
TTCTTCACGA GTAGAATCAA CAAGAGAGGT CAAGAAGAGA TGCCCATGGC GCATGAGCTG
CTTGATATGG AAGACATGGA AAATGAGATT CGGCAGATCA TCGGTGAAGT AGAGTGTGGT
TTTCAACTAT CGCGAGATGA TGAATTGATC ATCATCGGCT CAACCGGTAT GATTCTCTGC
TCAAAGAATA CCGAAAAATT TGAACCCTTG GTATTACAAT ACATGTCCAT GATGTCACGA
AATATGTTCA TTCAGGCGCT CTATAGGCAA ACATTTGTCA CCGTGGACAC GCTCGGAGAA
ATCGATCATC TCATACGCAA TCATGACGCC GATCCAAACA ATATCTTCAA AATTCGCGAA
CTCATGTCAA ATGTATCGGC AGATATCATT CTCATGCGGG AAATTCATAG CTATCTCCTT
GAATCTCTCA CTGAGACGGC ACCAATGGCA ATAAACGATC AAGTATTGAA GCGCTTGTCT
AAAATACTGC AGCTCGATGA TACAAACTTT CGCCTTGAGC GACGCATTCG AGATATTCGC
AAAAGCTTAG ACGGCGCGAG CGGTGAGTTG CAAGCGTTGA AAAGTGCCGC CGATGTCATT
CAAGAAAACA AAGAGTTCAA GGTGAACGAG GCGGTGTCAA ACAACACGCA GAACCTTGAA
GAAGTTTTCC GCGCAAATGA GCGCGCATCG ACGTCCCTTG AGATCATGCA AGTAGTGTTG
GCTGGATCTC TCGCTTTTGC AATTCTTGAT CGATTGCACG GTTTGTATCT CGGCGTTGCG
GCTGACATCG ACTGGTCGGT CAAGGCTTTC GATTGGTACG TACAGACTCC AATGGTCATG
TTCATCTTAA ATATGCTCTG GTGGTTCGCA TTGGGTGCGT CATTCAATCG ATTGATCAAG
TATGCCGGAT CAAAATCAGC GGGAATCTTG TCCATTCGGT ACACGATGAA CTGTCGATTT
AATCAGAAAG CCATGACTGC GTTCTTGCAT GTGGCGAACC CAGAAATGGA GGATGGTGAA
GCTGATGCGA GGACAAATCT GAAAAAATTC ACTTGGGACG AGACAGATGA GATCCGATGG
AAAGGATGCC CTCCAAAGAT TGAAATGATC GTCGACATGA AGAATGGTTT CTTGCTCAGC
GTTTTCATTC AAATCGCCAC CAGGCGAAGT AAATGTACGC AGTCTGACGC AAAGAGACAT
TTTTTTGCTC GACTCCGAGA ATTAGGTCTC ATTTCTGGTC CCGTGCCTGG GTTAGAGACG
GCAAAGGATG CCGAATACGT CTACCGTAAG CCATTTCTCT CACGTGGAGC GAAATTCAAA
CTATGGTTGA AGAAGACACG TGAGAATGTT CACTACTTCT TCACGTTCTA G
 
Protein sequence
MARRSEDPAW VPPPRIVMVK AFPEEDVVLV KPELFAECSR RRYLALSDGV KSRFAQAHAS 
SRTKYRKLWR RLLVHMQTSL AMAKALTKLE LLRTNTARGH RLDVAGLALI AMYDNYTKKI
DQMLSKQMQE SLQKAMDSKM EKDKVIAQSL LELETNDEKS GEDFNPMAIM RALRKHSAQK
PTKKQFQGSE EEIGVEHTRL LFLVSVYTAD ANRDAFPGLA EDSELWVRKT QLLVLIYECI
RAGALNYDYA PLAETMGSKR VWLNISQEGV DDLDDMCQVG FLSSMKMSST KYSTSTAYRL
TKEGYLHLKT HLRRRDRAAI EEVVYSDKLQ PSPRNLFVAK WDAKADTFYL QSASGHTKSS
DVTDIEEVSY VSSPFVPKSM RKWGRECTSN KHKTAALVKA TGTIRDELDE QLSFDRLRLM
VGEWIPMGAN QVLSLNDKLG STDRVAGGYF TSEMDKDPNN PCFQGKVDGL TRVNVLDFEE
TSYVNFEAEV QYEEEPGIVQ IENFGIHVSE EGFMLYGLTL DGMMKVTDGN NFSLDHLARL
LRDIGTDSSE VIGNLLTDHQ RHLLDLVHMG DAMNREKFNV FFTSRINKRG QEEMPMAHEL
LDMEDMENEI RQIIGEVECG FQLSRDDELI IIGSTGMILC SKNTEKFEPL VLQYMSMMSR
NMFIQALYRQ TFVTVDTLGE IDHLIRNHDA DPNNIFKIRE LMSNVSADII LMREIHSYLL
ESLTETAPMA INDQVLKRLS KILQLDDTNF RLERRIRDIR KSLDGASGEL QALKSAADVI
QENKEFKVNE AVSNNTQNLE EVFRANERAS TSLEIMQVVL AGSLAFAILD RLHGLYLGVA
ADIDWSVKAF DWYVQTPMVM FILNMLWWFA LGASFNRLIK YAGSKSAGIL SIRYTMNCRF
NQKAMTAFLH VANPEMEDGE ADARTNLKKF TWDETDEIRW KGCPPKIEMI VDMKNGFLLS
VFIQIATRRS KCTQSDAKRH FFARLRELGL ISGPVPGLET AKDAEYVYRK PFLSRGAKFK
LWLKKTRENV HYFFTF