Gene OSTLU_26100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26100 
Symbol 
ID5003983 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp225677 
End bp230967 
Gene Length5291 bp 
Protein Length1646 aa 
Translation table 
GC content56% 
IMG OID640419404 
Productpredicted protein 
Protein accessionXP_001419944 
Protein GI145351142 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.490738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.061037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTCGCGCTTT CCAACTCGCG TCTTCGGCGA TCGAGCGACG CAACACTTCC TCCCGCCGCG 
TCTCCGCGCG CGCGCTCGCC CCGAGCGCGC GCACCGCGCG CGGAAAGTGT CGTCTTCTCG
AACGATCGCG CCGAAGCACG AAATCTTGCG CGCGCATTTT CTCACGAAAT GAGCGTTCAC
GATCGGTTCC CGCACTCGAC CGCGGAGCTG TGCCGCGTAA ACGCGGTGCA GTTTGGGGTG
TTGTCGCCCG ATGAGATCGT ACGTGCATCG AACGCGCGCG CGCGCGAAGC GAAGGCGCGA
AAGAGCGCCG GCGCGGGAGA TCTTCGAGAG GGCGAGGCGG CGAGGGCGAA CGCGCGACGC
GGGCGACGAA GGCGAGCGAG CGCGCACTGA CCGAACGTGA AACTTTCGCG CGATAGCGTA
AAATGTCCGT GTGCGCGATC GAGACGAGCG AGACGTACGA GAAGGGTAGA CCCAAGCGAG
GGGGTTTGTC CGATCCGCGC ATGGGGACGA TGGATCGCGG CGCGGTGTGC GAGACGGACG
GGTGCGACAG CGTGCAGACG CCGGGTTACT TTGGGCATCT TGAGTTGGCG AAACCGGTGT
ACCATCATGG GTTCATCAAC GTCGTGTTGA GAATTCTGAG ATGCGTCGGA TACTCGTCTT
CGCAATTGTT GCTGCACAAG GACGAGGACC CAAAGTTTGC GCAGTTGATG CGAATCAAGA
ACCCGAAAAA CCGATTGAAG AAACTCACCG ACGCGTGTAA AACCAAGAGC GTGTGTCCGT
ACACCGGGAC GGCGCAACCG CAGTATCGTT TGGAGGGAAT GAAGATCACC GCCGAGTTCA
AGCACCGCGA CGAGGAGCTG TTGCCGGAAG GTGGCGAACG CAAGGTGATC GTCACCGCGG
AGCGCGCGTT ACAGATCTTG AAGGCGATTT CTGACGAGGA TTGCCGTGCG CTGGGTCTCG
ACCCCGAGCA CGCGCGTCCG GATTGGTTCA TCTTGCAAGT GATGCCCGTG CCGCCGCCGC
CGGTTCGTCC GTCGGTGTCG TTCGACGCTT CCACGCGCTC GGAGGACGAT TTGACGGTCA
AGCTCATGGA AATCGTGAGA ACGAATAAAA ATTTAGAGAG ACAAGAACAA AATGGAGCGC
CGCAGCACGT CATTTTAGAA TTTACAGAGC TGCTTCAGTA TCACATCATG ACCTTCATGG
ACAACACGGT GGCTGGCATG CCGCGAGCGC TCACGCGCAG CGGCCGAAGC ATTAAATCCA
TTTCAGAGCG TTTGAAGGGC AAGGCGGGTA GAATTCGAGG CAACTTGATG GGTAAGCGCG
TGGATTTCTC TGCGCGTACC GTCATTACGC CCGATCCAAA CTTGATGCTC GACGAGCTCG
GAGTTCCGTG GTCCATCGCG TTGAACATGA CGTATCCGGA AACCGTGACG CCGTACAACA
TCGATCGCCT TCAACGATTG GTCGAGAACG GCCCGCATCC ACCTCCTGGT GAGACCGGTG
CGCGTTACAT CATTCGTGAG GATGGTCAGC GACTCGATCT TCGCTTCTTG AAGAAGTCGA
GCGAGAAGCG TCTCGAGTAC GGTTACAAAG TTGAACGTCA CATGGTGAAC GGAGACGTCG
TCCTGTTCAA TCGTCAACCT TCTTTGCACA AGATGTCCAT CATGGGTCAT CGCGTCCGAA
TCATGCCTTA CTCGACGTTC AGATTGAACT TGTCTGTCAC GCCGCCATAC AACGCTGATT
TTGATGGGGA TGAAATGAAC ATGCACCTTC CGCAGTCTTA CGAGACACGC GCGGAGGTGA
AGGAGTTGAT GATGGTACCG AAAATGATCG TATCGCCCCA GGCCAACAAG CCCGTGATGG
CTATCGTGCA AGACACCTTG TTGGGTTGCC GCTTGATCAC GAAGCGCGAC ACCTTCATCA
CAAAGGACGT CTTCATGAAC ATCATCATGT GGTTGGAGGA TTGGGACGGG AAGGTGCCGA
AGCCGGCTAT CCTCAAACCC CAACCTTTGT GGACTGGTAA ACAAGTGTTT TCGATGATGT
TACCGAAGGT GAACTTGCTT CGCACGAGTG CGTGGGCCAA GGACTCCGAC GACATGGCTT
TCAGCGTGGA CGACACCGGT GTCCGAATTG AACAAGGCGA ATTGCTCACC GGGACGCTGT
GCAAGAAGAC TATGGGTAGC GGCGGTGGCG GTTTGATCCA CGTGACCTGG GAAGAGTGGG
GTCCCATCGC CGCACGCGGG CTCATCTCTC AGACGCAGAC GCTCGTCAAC TACTGGGTGC
TCCACCACGG TTTCACCGTC GGCATTGCGG ATACCATTGC GGACGACGAG ACAATGTTCT
CCATTAACAA CACCATCACC AAGGCTAAAG CTGACGTGAA GGAAGTCATC AAACTCGCAC
AAAACAATGA ACTCGAGTTG CAACCGGGTA TGACGATGCA ACAATCGTTC GAACAGAAGG
TGAACCAAAT CTTGAACAAG GCGCGTGATA ACGCTGGTAA CTCGGCGCAA AACTCGCTTA
TGGATACCAA CAATGTCAAG ATGATGGTGA CGGCCGGTTC GAAGGGTTCG TTCTTGAACA
TTTCGCAGAT GATTGCGTGT GTAGGACAAC AAAACGTAGA AGGAAAACGT ATTCCGTACG
GTTTCAAAGG CAGAACATTG CCGCACTTTA GCAAGGATGA TTTCGGTCCC GAGTCGCGTG
GTTTCGTCGA GAACTCGTAT TTGCGTGGTT TGACTCCTCA GGAGTTCTTT TTCCACGCCA
TGGGTGGCCG AGAAGGTTTG ATCGATACTG CAGTTAAGAC GTCTGAAACT GGTTACATCC
AACGTCGTCT TGTCAAGGCT ATGGAGGACG TGGTTGTCAA GTACGACGGC ACTGTGCGTA
ACAGCGTCGG CGACGTCATC CAGTTCTTGT ACGGCGAGGA CGGCATGGAC GCGACCATGA
TTGAAAGCCA ATCCATCGAT ACGTTGCGTT TCAGCGTGAA GGAGTTCGCG GCGAAGTTCC
ACATCGATCC AGATGTCCCG GGCTTTAACA ATGGCTGGTT GAGCGAGGAG CAAGCGAGCG
AACTCGCGCA CACTGCAAAG GTGCGCGAGA TTCTTGACGC AGAATGGGAT CGACTTCAGC
GCGACCGAGT CGAACTGCGA ACGATTTGTC CCACAGGGGA TCCTCACGTG CACTTACCAG
TGAACATGAA GCGTATCTTG TGGAATGCAC AAAAGCAGTA CGGCCTGTAC CGACCGGATT
CTGGCGAAGA AGAGGTCAAG GTGACGCACA TCATCGAGAG CGTCGCTGAA CTTTTGCCGA
AGCTCATCGT GGTGCCTGGT ACAGATCCGT TATCCGTCGA AGCGCAAAGA AACGGTACGA
TGCTATTTTT CGCGCACGTG CGCGCCAATT TGGCAGCCAA GCGCGTGCTG AAAGAACACA
AGCTCACGCG CGCCGGCTTC GACTGGGTCA TCGGCGAAAT CGAATCACGC TTCAAGATGG
CACTTGCGCC GCCCGGTGAT GGTATTGGCA CGGTTGCCGC GCAATCAATC GGTGAGCCGG
CGACGCAGAT GACGTTGAAT ACGTTCCACT TCGCCGGTGT GTCCGCCAAG AACGTGACCC
TTGGTGTGCC GCGCTTGAAA GAAATCATCA ACATCGCGAA GAGCATCAAG ACGCCAAGCT
TGACGATCGC ACTCAAGCCA GAGTTAGCTG GTGACAGAAC GAGAGCGAAG GACTGTCAAG
CGAGCCTAGA GTACACCACG CTCCACAGCG TTGCCGCAGT CACCGAGGTT CACTACGATC
CAGATCCCAC GGACACTGTC ATCGAGGAAG ACCGCGAATT CGTTCGCGCG TACTACGAAA
TGCCAGACGA AGACGTTGAC CCGTCGCGCA TGTCCCCATG GTTGTTGCGT ATCGAGCTTA
ACCGTGAGAT GATGGTAGAC AAGAAGCTCC TCATGGCGGA TGTTGCGGAG AGAATCAACG
AAGATTTCGC CGGCGACTTG AGCTGCATCT TCAACGACGA CAACTCTGAA AAGTTGGTGT
TGAGAATTAG AATCATGAAC CCCGAAGGCG TGAAATACGA GGACGCCTCG ACCGAGGACG
AAGTATTCTT GAAGCGTCTC GAGACGCAGA TGTTGAGCAA CCTTGCATTG CGCGGTATTC
CAGACATTAA GAAGGTTTTC ATCCGCGAAG CCAAACAAAA CGCCATCAAC AGCAAGACGG
AGCTCTTCGA GAAAACCACC GAGTGGATGT TGGACACCGA AGGCGTCAAC TTGCTCGAAG
TCATGGCACA TCCGGACGTC GACTTCACGA GAACTACATC TAACCACTTG ATTGAAGTGA
TTCAAGTTCT TGGCATCGAA GCGGTGCGCA ATACGCTCTT GCGTGAGTTG CGTGGTGTGA
TTGAGTTCGA CGGTTCGTAC GTGAACTACA GACATCTCGC GATTCTCGTC GAAGTGATGA
CATACCGTGG TCACTTGATG TCGATTACGC GACACGGCAT CAACCGCGTG GAGACCGGTC
CGCTCATGCG CTGCAGCTTC GAAGAAACTG TCGACATTCT CATGGAGGCG TCGGCGTTCT
CTGAGCGAGA CAATATGACT GGGCCGAGCG AGAATATCAT GCTCGGGCAG TTCTGTCCGA
TCGGTACCGG GGAGTTCAAG CTGCACTTGA ACGACGAGTT GTTACAAGAT GCAGTTGAGT
TAGAACTGTT CGGGGATGGA TCTAGGACCC CTGGACACGG CGGAATGATG ACGCCGGGGC
GCGATGGAAC GCCCGGTTGG GGATCAAAGT CGCCGAGTTT CTTGTTGAGT CCAGGTGGAC
ATCAAAGTCC GTTCGATACA AGCATGGCAT TCTCGCCGTA CTCCGATGGC ATGGCGTTCT
CACCGGGAAT GTCCCCTGGT GGTTTTGGAG GTTACTCCCC CACGAGTCCA GCGTATTCTC
CGACGAGCCC AGCATACTCC CCCACGAGTC CGGCGTACTC GCCCACGAGC CCAGCGTACT
CGCCCACGAG TCCAGCGTAC TCGCCCACGA GCCCGGCTTA CTCGCCCACG AGTCCAGCGT
ACTCCCCCAC GAGTCCAGCG TACTCCCCCA CGAGTCCGGC GTACTCGCCC ACGAGCCCAG
CGTACTCGCC CACGAGTCCA GCGTACTCGC CCACGAGCCC GGCTTACTCG CCCACGAGTC
CAGCGTACTC CCCCACGAGT CCAGCGTACT CCCCCACGAG TCCGGCGTAC TCGCCCACGA
GCCCAGCGTA CTCCCCGAAA GAGGAAGAAG AAAACGAAGA AGAATAGAGG AAAGGAGACA
ACTGTTTGTT T
 
Protein sequence
MSVHDRFPHS TAELCRVNAV QFGVLSPDEI RKMSVCAIET SETYEKGRPK RGGLSDPRMG 
TMDRGAVCET DGCDSVQTPG YFGHLELAKP VYHHGFINVV LRILRCVGYS SSQLLLHKDE
DPKFAQLMRI KNPKNRLKKL TDACKTKSVC PYTGTAQPQY RLEGMKITAE FKHRDEELLP
EGGERKVIVT AERALQILKA ISDEDCRALG LDPEHARPDW FILQVMPVPP PPVRPSVSFD
ASTRSEDDLT VKLMEIVRTN KNLERQEQNG APQHVILEFT ELLQYHIMTF MDNTVAGMPR
ALTRSGRSIK SISERLKGKA GRIRGNLMGK RVDFSARTVI TPDPNLMLDE LGVPWSIALN
MTYPETVTPY NIDRLQRLVE NGPHPPPGET GARYIIREDG QRLDLRFLKK SSEKRLEYGY
KVERHMVNGD VVLFNRQPSL HKMSIMGHRV RIMPYSTFRL NLSVTPPYNA DFDGDEMNMH
LPQSYETRAE VKELMMVPKM IVSPQANKPV MAIVQDTLLG CRLITKRDTF ITKDVFMNII
MWLEDWDGKV PKPAILKPQP LWTGKQVFSM MLPKVNLLRT SAWAKDSDDM AFSVDDTGVR
IEQGELLTGT LCKKTMGSGG GGLIHVTWEE WGPIAARGLI SQTQTLVNYW VLHHGFTVGI
ADTIADDETM FSINNTITKA KADVKEVIKL AQNNELELQP GMTMQQSFEQ KVNQILNKAR
DNAGNSAQNS LMDTNNVKMM VTAGSKGSFL NISQMIACVG QQNVEGKRIP YGFKGRTLPH
FSKDDFGPES RGFVENSYLR GLTPQEFFFH AMGGREGLID TAVKTSETGY IQRRLVKAME
DVVVKYDGTV RNSVGDVIQF LYGEDGMDAT MIESQSIDTL RFSVKEFAAK FHIDPDVPGF
NNGWLSEEQA SELAHTAKVR EILDAEWDRL QRDRVELRTI CPTGDPHVHL PVNMKRILWN
AQKQYGLYRP DSGEEEVKVT HIIESVAELL PKLIVVPGTD PLSVEAQRNG TMLFFAHVRA
NLAAKRVLKE HKLTRAGFDW VIGEIESRFK MALAPPGDGI GTVAAQSIGE PATQMTLNTF
HFAGVSAKNV TLGVPRLKEI INIAKSIKTP SLTIALKPEL AGDRTRAKDC QASLEYTTLH
SVAAVTEVHY DPDPTDTVIE EDREFVRAYY EMPDEDVDPS RMSPWLLRIE LNREMMVDKK
LLMADVAERI NEDFAGDLSC IFNDDNSEKL VLRIRIMNPE GVKYEDASTE DEVFLKRLET
QMLSNLALRG IPDIKKVFIR EAKQNAINSK TELFEKTTEW MLDTEGVNLL EVMAHPDVDF
TRTTSNHLIE VIQVLGIEAV RNTLLRELRG VIEFDGSYVN YRHLAILVEV MTYRGHLMSI
TRHGINRVET GPLMRCSFEE TVDILMEASA FSERDNMTGP SENIMLGQFC PIGTGEFKLH
LNDELLQDAV ELELFGDGSR TPGHGGMMTP GRDGTPGWGS KSPSFLLSPG GHQSPFDTSM
AFSPYSDGMA FSPGMSPGGF GGYSPTSPAY SPTSPAYSPT SPAYSPTSPA YSPTSPAYSP
TSPAYSPTSP AYSPTSPAYS PTSPAYSPTS PAYSPTSPAY SPTSPAYSPT SPAYSPTSPA
YSPTSPAYSP TSPAYSPKEE EENEEE