Gene OSTLU_24022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24022 
Symbol 
ID5000072 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp175983 
End bp180551 
Gene Length4569 bp 
Protein Length1395 aa 
Translation table 
GC content54% 
IMG OID640415493 
Productpredicted protein 
Protein accessionXP_001416093 
Protein GI145342014 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000888169 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCGCCGCGA GGAAGCGCGC GTCTCGCGCC CATCGACGCG AAACCCTCCA TTGTCGCCAT 
CACCGCGCGC GCGCGTTCTC GCGGCGCGTC GTCGACGCGC GCGCGCCGTC CGAGTCGCCG
AAAAGCGATC GCCGCCGCCG GTGCAGACGC GCGCGGTCGC GCCTAAAACG CGCGTCGAAC
GCCATGCCGA CGTTCCGGGA CGACATGGCG GGGTCATCGC ACGACACCGC GGCGATTGAT
TTCACGAAGA CGCCGTTCGC GCCGGCGTCG ACGCCGAGAA AGATTAGCGC GGTGCGGTTT
TCGCTGATGA GCTCGAGCGA AATCGGACGC GCGGGGGTGT TTCACGTGTT CGAACGGAAT
TTGTATCAGA TGCCGCAGCG AACGCCGATG CCGAACGGGA TATTAGATCC GAGAATGGGG
ACGACGGATA AGCGCGGTGG GGAGTGCGCG ACGTGTCGAG GGAACATGGT GGATTGCGCG
GGGCACTTTG GGTACATCAA GCTGGAGCTG CCGGTGTTTC ACATCGGATA CTTTAAAAAC
ATCATCAGCG TGTTGCAGTG CGTGTGCAAG AATTGCTCGC GGATTTTACT GAACGAGGAT
GATCAAGATA TGTTTTTGAA GCGCTATCGG AATCCAAGAC TTGAGATTAC GCAGAGGCGT
CTGCTGTACA AAAAGTTGAC GGAAAAGTGC AAACGATGCC GAACGTGTCC GCATTGCGGA
GATTACAACG GCGTGACCAA GCGTGCGGGG CAGACTTTGA AGATCGTTCA CGAAAAGTAT
AGTAAAAACC CAGCGTTGTT GGAGGAGTTT ATGAAGGAGT TTGAAACAGC TTTGAAGTAC
AACGAGCAGC TTCGAGCGTC ACTGCCAAAG GTGCAAGACG ATTTGAATCC CATTCGTGTG
TTGGGCATAT TACAGCGCGT CACTTCGGCC GATTGCGAAT TACTAGACAT CGCAGATCGA
CCGGAACACT TGCTCTTGAC GCACCTGCCC GTTCCGCCGT GTTGCATCAG GCCTTCGGTG
GAGATGGACG GCGTGTCAGG GTCAAACGAA GACGATATCA CGATGAAGCT CATACAAATC
ATCGAAGTCA ACAATGTTTT GCGGCAAGGC TTAGAGAAAG GGCTCGCTGT TAACAATATG
ATGGAGAATT GGGATTTTTT ACAGATTCAA TGCGCCATGT ACATCAACAG CGAGCTTCCC
GGTTTATCTC TGCAGTACCA GGGCCCTGGT AAGCCGCTGC GCGGCTTTGT GCAAAGACTC
AAGGGTAAGC AAGGGCGCTT CCGCGGTAAC CTCAGCGGTA AGCGCGTAGA CTTCAGTGCG
CGCACTGTGA TCTCACCAGA TCCGAATCTT CGGATCGACG AGGTTGGTGT ACCAATACAT
ATAGCGAAGA CAATGACATA CCCAGAAGTC GTCAACAAAC ACAACATGGT GATGCTGAGA
GAGCGCGTGC GCAATGGTAT GGCGAAGCAT CCTGGAGCGA ACTTTGTCAA GTTTGCGTCT
GGTGGTATGC AGTACTTGAA GTACGGCGAT CGTCGCAAGA TTGCCAGCGA GTTGAAATTC
GGAGACATTG TTGAGCGACA TTTGCACGAC GGCGACATTC TGTTGTTCAA TCGTCAGCCG
AGTTTACACA AGATGTCAAT CATGGCGCAC AAGGCACGTA TCATGGAGTG CCGAACTCTA
CGTTTTAACG AGTGCGTGTG TACGCCGTAC AACGCCGATT TCGACGGTGA CGAGATGAAT
ATTCATTGCC CTCAAACGGA AGAAGCGCGC GCCGAAGCGA TGCAACTTAT GGGAGTGCAA
CACAACCTGT GCACACCGAA AAACGGCGAG ATTCTCATCG CAGCGACGCA GGACTTCCTA
ACTGCAGCGT TTCTTCTCAC CATTAAGGAT GCGTTTTTTG ATCGCTCTCA GTTTGGCTCG
CTCGTCGCGT ACATGGGTGA TGCGCTCCTT TCCGTGGATC TTCCGACGCC AGCAATCTTG
AAACCCATTG AGCTGTGGAC TGGAAAGCAA GTGTTCAGCG TTCTCATTCG ACCGAAAGCG
TCCGATCACA TATATGTTAA TCTTGAAGTC GCTGAAAAGC TGTACAACAA GAAGGACAAA
ACAATGTGTC CCGACGACGG GTACGTGTGC ATTCAGAATA GTGAAATAAT CTCAGGGCAA
CTCGGCAAGG CGACACTCGG TAGTGGAAAT AAAAGTGGAC TATTCTACGT GCTCAACATA
GAATACGGCG CCGAAGCCGC CGCAAATGCG ATGAATAGAC TCGCCAAGCT GAGCGCTCGA
TGGCTGGGTA CGCGCGGGTT TAGCATTGGC ATCGACGACG TCTCACCGGC CGCGGAGTTG
TCAGCCGAAA AGGGCCGACG TATCGAGGAC GGCTACAGAA CTTGCGACGA ACGAATCGCT
TCGTATGAGA AAGGTACTTT ACCTCTTCAA GCGGGCTGCA ACGCGGAGGA AACGTTGGAG
GCTGAAGTTC TCGGGGTCTT GTCTGCGGTG CGCGAAGCCG CGGGGAACGC GTGTTTAAAG
GCGCTGCCGC GTCGCAACGC ACCACTTATC ATGGCTCTTT GCGGCAGTAA GGGTAGCACG
ATCAACATTT CTCAGATGAT CGCGTGTGTT GGTCAACAAG CTGTGGGAGG CTCGCGACCT
CCTGACGGTT TCGCCGAGCG ATCGCTCCCG CATTTCAGGC GCGGAGAAAA GACACCCGCC
GCAAAGGGCT TTGTCGCCAA CTCTTTCTTT TCTGGCATGC GACCGACGGA ATTCTTCTTC
CATACCATGG CTGGTCGTGA AGGTCTCGTC GATACTGCCG TAAAGACTGC AGAGACTGGA
TACATGTCTC GTCGTCTCAT GAAGGCGCTA GAGGACCTAT CGCTTCAGTA CGACGGCACT
GTGCGCAACT CGATGGGTGG CATTGTGCAG CTGCGCTACG GCGACGACGG CATGGAACCG
ACTATGATGG AGAGTGAAGA CGCCCAACCG ATCGAGTTTA AGCGGTGTTT GATGAATGTG
CGAGGTCTGA ACTCGGCACG AGGCGAACCG CCAGCGACTC GCTCGGCGTT AGATGCGGCG
CTGGACCAGT TCAAGCGAAA CAACAGAATT ATCAATGTCG ACGAAGAAGG CGAAAGCGCC
GAGACGCGCG AACTCGTGTA CAGCGCTGGT ATCTCGCAAT TATTCTACGA AAACATGAGA
ACATTCATCA TCAACGAAGT GGAGTCGCCT TCGGATGGCG TATACATGTC GGATCGACAG
ATGAAGGCAT TTTTGGACGC GTGCGCGCGA AAATACACTG AAAAACGAAT TGAGCCCGGC
ACCGCAGTTG GCGCCATTGG CGCACAATCC ATCGGCGAGC CAGGCACACA AATGACGCTG
AAGACGTTCC ATTTCGCCGG TGTGGCGTCG ATGAACATCA CCCTCGGCGT TCCTCGAATT
AAGGAAATCA TCAACGCGTC CAAGAATATC AGTACGCCAA TCATCACGGC GTCGCTCATG
AGCGACAGGG ACGTCAAGGC GGCGCGCGTG GTGAAGGGTA GAGTGGAGAA GACGACGTTG
GGCGAAATCT GTTCCGAAAT CAATATTGTC GTTCGTCCGC ACGATCTTTA CCTCGAGCTT
ATGTTAGATA TGGAGGCGAT CAATCAACTT CAGTTGGACG TCACCATTCA CTCCGTGCGC
CTGGCTGTGC TCGGGGCGCC AAAGCTCAAG CTCAAGCCTC AACACGTGCT CATCGCCGGC
GAGAATATCT TGCATGTCTT ACCCTCGGAA GAAGCACTGA TAGAAAAACG CGCGTTGTTC
ACTCTGCAGC ACATGCGACT GGCCATTCCC GCCGTCATCG TGCAAGGCAT CCCATCCGTC
GGTCGAGCCG TCATCAACGA CAAGGGCGAC GGAACGTACA ACCTCATCGT GGAAGGCGTG
AACTTGCAGT CCGTCATGGG TATCGAGGGC ATCAAGGGAA CAGAGACACG AACAAACCAT
GTCATGGAGT GTGAACGCAC GCTAGGCATC GAAGCGGCGC GCGCGTGCAT CATTGAAGAG
ATTGACAGCA CGATGGGCGC TCACGGCATG TCAATCGACA ACCGACACTC GATGTTGCTC
GCCGACGTCA TGACGTACAA GGGCGAGGTT CTCGGGATCA CGCGTTTCGG CATGGCAAAG
ATGAAAGACT CTGTACTCAT GTTGGCTTCG TTCGAGAAAA CTACGGATCA TCTGTTCGAC
GCAGCGCTGC ACGGGCGCAC GGATTACATC GACGGCGTCT CCGAGTGCAT CATCATGGGC
ATTCCAATGC CCATCGGCAC GGGGATGTTC CGCTTGCAGC ACCGCGCGAT GAAACTCGAA
GTTGACATCA ACTTGGTAGA AGAAGACGGA AGAGAAACGT TGATGACGAC TGTCACCCCC
GAACAACGCG AAGAATTGCC GAAAAGACCG CCCTTGCTCT TGCAAGGCGG ATACTGCAAA
CCATGCAGAC CTATCGAAGC TTAATACGTT GCGGTCCGCT CGAAATTTTA GAAATTTTAG
TAGAATAGAC GGATTTTTAC AATTCAATTC GAGGGCGTTA GCCAAATGTC GCGATCGTGC
GAGCGATGA
 
Protein sequence
MPTFRDDMAG SSHDTAAIDF TKTPFAPAST PRKISAVRFS LMSSSEIGRA GVFHVFERNL 
YQMPQRTPMP NGILDPRMGT TDKRGGECAT CRGNMVDCAG HFGYIKLELP VFHIGYFKNI
ISVLQCVCKN CSRILLNEDD QDMFLKRYRN PRLEITQRRL LYKKLTEKCK RCRTCPHCGD
YNGVTKRAGQ TLKIVHEKYS KNPALLEEFM KEFETALKYN EQLRASLPKV QDDLNPIRVL
GILQRVTSAD CELLDIADRP EHLLLTHLPV PPCCIRPSVE MDGVSGSNED DITMKLIQII
EVNNVLRQGL EKGLAVNNMM ENWDFLQIQC AMYINSELPG LSLQYQGPGK PLRGFVQRLK
GKQGRFRGNL SGKRVDFSAR TVISPDPNLR IDEVGVPIHI AKTMTYPEVV NKHNMVMLRE
RVRNGMAKHP GANFVKFASG GMQYLKYGDR RKIASELKFG DIVERHLHDG DILLFNRQPS
LHKMSIMAHK ARIMECRTLR FNECVCTPYN ADFDGDEMNI HCPQTEEARA EAMQLMGVQH
NLCTPKNGEI LIAATQDFLT AAFLLTIKDA FFDRSQFGSL VAYMGDALLS VDLPTPAILK
PIELWTGKQV FSVLIRPKAS DHIYVNLEVA EKLYNKKDKT MCPDDGYVCI QNSEIISGQL
GKATLGSGNK SGLFYVLNIE YGAEAAANAM NRLAKLSARW LGTRGFSIGI DDVSPAAELS
AEKGRRIEDG YRTCDERIAS YEKGTLPLQA GCNAEETLEA EVLGVLSAVR EAAGNACLKA
LPRRNAPLIM ALCGSKGSTI NISQMIACVG QQAVGGSRPP DGFAERSLPH FRRGEKTPAA
KGFVANSFFS GMRPTEFFFH TMAGREGLVD TAVKTAETGY MSRRLMKALE DLSLQYDGTV
RNSMGGIVQL RYGDDGMEPT MMESEDAQPI EFKRCLMNVR GLNSARGEPP ATRSALDAAL
DQFKRNNRII NVDEEGESAE TRELVYSAGI SQLFYENMRT FIINEVESPS DGVYMSDRQM
KAFLDACARK YTEKRIEPGT AVGAIGAQSI GEPGTQMTLK TFHFAGVASM NITLGVPRIK
EIINASKNIS TPIITASLMS DRDVKAARVV KGRVEKTTLG EICSEINIVV RPHDLYLELM
LDMEAINQLQ LDVTIHSVRL AVLGAPKLKL KPQHVLIAGE NILHVLPSEE ALIEKRALFT
LQHMRLAIPA VIVQGIPSVG RAVINDKGDG TYNLIVEGVN LQSVMGIEGI KGTETRTNHV
MECERTLGIE AARACIIEEI DSTMGAHGMS IDNRHSMLLA DVMTYKGEVL GITRFGMAKM
KDSVLMLASF EKTTDHLFDA ALHGRTDYID GVSECIIMGI PMPIGTGMFR LQHRAMKLEV
DINLGVSQMS RSCER