Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26100 |
Symbol | |
ID | 5003983 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 225677 |
End bp | 230967 |
Gene Length | 5291 bp |
Protein Length | 1646 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419404 |
Product | predicted protein |
Protein accession | XP_001419944 |
Protein GI | 145351142 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.490738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.061037 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTCGCGCTTT CCAACTCGCG TCTTCGGCGA TCGAGCGACG CAACACTTCC TCCCGCCGCG TCTCCGCGCG CGCGCTCGCC CCGAGCGCGC GCACCGCGCG CGGAAAGTGT CGTCTTCTCG AACGATCGCG CCGAAGCACG AAATCTTGCG CGCGCATTTT CTCACGAAAT GAGCGTTCAC GATCGGTTCC CGCACTCGAC CGCGGAGCTG TGCCGCGTAA ACGCGGTGCA GTTTGGGGTG TTGTCGCCCG ATGAGATCGT ACGTGCATCG AACGCGCGCG CGCGCGAAGC GAAGGCGCGA AAGAGCGCCG GCGCGGGAGA TCTTCGAGAG GGCGAGGCGG CGAGGGCGAA CGCGCGACGC GGGCGACGAA GGCGAGCGAG CGCGCACTGA CCGAACGTGA AACTTTCGCG CGATAGCGTA AAATGTCCGT GTGCGCGATC GAGACGAGCG AGACGTACGA GAAGGGTAGA CCCAAGCGAG GGGGTTTGTC CGATCCGCGC ATGGGGACGA TGGATCGCGG CGCGGTGTGC GAGACGGACG GGTGCGACAG CGTGCAGACG CCGGGTTACT TTGGGCATCT TGAGTTGGCG AAACCGGTGT ACCATCATGG GTTCATCAAC GTCGTGTTGA GAATTCTGAG ATGCGTCGGA TACTCGTCTT CGCAATTGTT GCTGCACAAG GACGAGGACC CAAAGTTTGC GCAGTTGATG CGAATCAAGA ACCCGAAAAA CCGATTGAAG AAACTCACCG ACGCGTGTAA AACCAAGAGC GTGTGTCCGT ACACCGGGAC GGCGCAACCG CAGTATCGTT TGGAGGGAAT GAAGATCACC GCCGAGTTCA AGCACCGCGA CGAGGAGCTG TTGCCGGAAG GTGGCGAACG CAAGGTGATC GTCACCGCGG AGCGCGCGTT ACAGATCTTG AAGGCGATTT CTGACGAGGA TTGCCGTGCG CTGGGTCTCG ACCCCGAGCA CGCGCGTCCG GATTGGTTCA TCTTGCAAGT GATGCCCGTG CCGCCGCCGC CGGTTCGTCC GTCGGTGTCG TTCGACGCTT CCACGCGCTC GGAGGACGAT TTGACGGTCA AGCTCATGGA AATCGTGAGA ACGAATAAAA ATTTAGAGAG ACAAGAACAA AATGGAGCGC CGCAGCACGT CATTTTAGAA TTTACAGAGC TGCTTCAGTA TCACATCATG ACCTTCATGG ACAACACGGT GGCTGGCATG CCGCGAGCGC TCACGCGCAG CGGCCGAAGC ATTAAATCCA TTTCAGAGCG TTTGAAGGGC AAGGCGGGTA GAATTCGAGG CAACTTGATG GGTAAGCGCG TGGATTTCTC TGCGCGTACC GTCATTACGC CCGATCCAAA CTTGATGCTC GACGAGCTCG GAGTTCCGTG GTCCATCGCG TTGAACATGA CGTATCCGGA AACCGTGACG CCGTACAACA TCGATCGCCT TCAACGATTG GTCGAGAACG GCCCGCATCC ACCTCCTGGT GAGACCGGTG CGCGTTACAT CATTCGTGAG GATGGTCAGC GACTCGATCT TCGCTTCTTG AAGAAGTCGA GCGAGAAGCG TCTCGAGTAC GGTTACAAAG TTGAACGTCA CATGGTGAAC GGAGACGTCG TCCTGTTCAA TCGTCAACCT TCTTTGCACA AGATGTCCAT CATGGGTCAT CGCGTCCGAA TCATGCCTTA CTCGACGTTC AGATTGAACT TGTCTGTCAC GCCGCCATAC AACGCTGATT TTGATGGGGA TGAAATGAAC ATGCACCTTC CGCAGTCTTA CGAGACACGC GCGGAGGTGA AGGAGTTGAT GATGGTACCG AAAATGATCG TATCGCCCCA GGCCAACAAG CCCGTGATGG CTATCGTGCA AGACACCTTG TTGGGTTGCC GCTTGATCAC GAAGCGCGAC ACCTTCATCA CAAAGGACGT CTTCATGAAC ATCATCATGT GGTTGGAGGA TTGGGACGGG AAGGTGCCGA AGCCGGCTAT CCTCAAACCC CAACCTTTGT GGACTGGTAA ACAAGTGTTT TCGATGATGT TACCGAAGGT GAACTTGCTT CGCACGAGTG CGTGGGCCAA GGACTCCGAC GACATGGCTT TCAGCGTGGA CGACACCGGT GTCCGAATTG AACAAGGCGA ATTGCTCACC GGGACGCTGT GCAAGAAGAC TATGGGTAGC GGCGGTGGCG GTTTGATCCA CGTGACCTGG GAAGAGTGGG GTCCCATCGC CGCACGCGGG CTCATCTCTC AGACGCAGAC GCTCGTCAAC TACTGGGTGC TCCACCACGG TTTCACCGTC GGCATTGCGG ATACCATTGC GGACGACGAG ACAATGTTCT CCATTAACAA CACCATCACC AAGGCTAAAG CTGACGTGAA GGAAGTCATC AAACTCGCAC AAAACAATGA ACTCGAGTTG CAACCGGGTA TGACGATGCA ACAATCGTTC GAACAGAAGG TGAACCAAAT CTTGAACAAG GCGCGTGATA ACGCTGGTAA CTCGGCGCAA AACTCGCTTA TGGATACCAA CAATGTCAAG ATGATGGTGA CGGCCGGTTC GAAGGGTTCG TTCTTGAACA TTTCGCAGAT GATTGCGTGT GTAGGACAAC AAAACGTAGA AGGAAAACGT ATTCCGTACG GTTTCAAAGG CAGAACATTG CCGCACTTTA GCAAGGATGA TTTCGGTCCC GAGTCGCGTG GTTTCGTCGA GAACTCGTAT TTGCGTGGTT TGACTCCTCA GGAGTTCTTT TTCCACGCCA TGGGTGGCCG AGAAGGTTTG ATCGATACTG CAGTTAAGAC GTCTGAAACT GGTTACATCC AACGTCGTCT TGTCAAGGCT ATGGAGGACG TGGTTGTCAA GTACGACGGC ACTGTGCGTA ACAGCGTCGG CGACGTCATC CAGTTCTTGT ACGGCGAGGA CGGCATGGAC GCGACCATGA TTGAAAGCCA ATCCATCGAT ACGTTGCGTT TCAGCGTGAA GGAGTTCGCG GCGAAGTTCC ACATCGATCC AGATGTCCCG GGCTTTAACA ATGGCTGGTT GAGCGAGGAG CAAGCGAGCG AACTCGCGCA CACTGCAAAG GTGCGCGAGA TTCTTGACGC AGAATGGGAT CGACTTCAGC GCGACCGAGT CGAACTGCGA ACGATTTGTC CCACAGGGGA TCCTCACGTG CACTTACCAG TGAACATGAA GCGTATCTTG TGGAATGCAC AAAAGCAGTA CGGCCTGTAC CGACCGGATT CTGGCGAAGA AGAGGTCAAG GTGACGCACA TCATCGAGAG CGTCGCTGAA CTTTTGCCGA AGCTCATCGT GGTGCCTGGT ACAGATCCGT TATCCGTCGA AGCGCAAAGA AACGGTACGA TGCTATTTTT CGCGCACGTG CGCGCCAATT TGGCAGCCAA GCGCGTGCTG AAAGAACACA AGCTCACGCG CGCCGGCTTC GACTGGGTCA TCGGCGAAAT CGAATCACGC TTCAAGATGG CACTTGCGCC GCCCGGTGAT GGTATTGGCA CGGTTGCCGC GCAATCAATC GGTGAGCCGG CGACGCAGAT GACGTTGAAT ACGTTCCACT TCGCCGGTGT GTCCGCCAAG AACGTGACCC TTGGTGTGCC GCGCTTGAAA GAAATCATCA ACATCGCGAA GAGCATCAAG ACGCCAAGCT TGACGATCGC ACTCAAGCCA GAGTTAGCTG GTGACAGAAC GAGAGCGAAG GACTGTCAAG CGAGCCTAGA GTACACCACG CTCCACAGCG TTGCCGCAGT CACCGAGGTT CACTACGATC CAGATCCCAC GGACACTGTC ATCGAGGAAG ACCGCGAATT CGTTCGCGCG TACTACGAAA TGCCAGACGA AGACGTTGAC CCGTCGCGCA TGTCCCCATG GTTGTTGCGT ATCGAGCTTA ACCGTGAGAT GATGGTAGAC AAGAAGCTCC TCATGGCGGA TGTTGCGGAG AGAATCAACG AAGATTTCGC CGGCGACTTG AGCTGCATCT TCAACGACGA CAACTCTGAA AAGTTGGTGT TGAGAATTAG AATCATGAAC CCCGAAGGCG TGAAATACGA GGACGCCTCG ACCGAGGACG AAGTATTCTT GAAGCGTCTC GAGACGCAGA TGTTGAGCAA CCTTGCATTG CGCGGTATTC CAGACATTAA GAAGGTTTTC ATCCGCGAAG CCAAACAAAA CGCCATCAAC AGCAAGACGG AGCTCTTCGA GAAAACCACC GAGTGGATGT TGGACACCGA AGGCGTCAAC TTGCTCGAAG TCATGGCACA TCCGGACGTC GACTTCACGA GAACTACATC TAACCACTTG ATTGAAGTGA TTCAAGTTCT TGGCATCGAA GCGGTGCGCA ATACGCTCTT GCGTGAGTTG CGTGGTGTGA TTGAGTTCGA CGGTTCGTAC GTGAACTACA GACATCTCGC GATTCTCGTC GAAGTGATGA CATACCGTGG TCACTTGATG TCGATTACGC GACACGGCAT CAACCGCGTG GAGACCGGTC CGCTCATGCG CTGCAGCTTC GAAGAAACTG TCGACATTCT CATGGAGGCG TCGGCGTTCT CTGAGCGAGA CAATATGACT GGGCCGAGCG AGAATATCAT GCTCGGGCAG TTCTGTCCGA TCGGTACCGG GGAGTTCAAG CTGCACTTGA ACGACGAGTT GTTACAAGAT GCAGTTGAGT TAGAACTGTT CGGGGATGGA TCTAGGACCC CTGGACACGG CGGAATGATG ACGCCGGGGC GCGATGGAAC GCCCGGTTGG GGATCAAAGT CGCCGAGTTT CTTGTTGAGT CCAGGTGGAC ATCAAAGTCC GTTCGATACA AGCATGGCAT TCTCGCCGTA CTCCGATGGC ATGGCGTTCT CACCGGGAAT GTCCCCTGGT GGTTTTGGAG GTTACTCCCC CACGAGTCCA GCGTATTCTC CGACGAGCCC AGCATACTCC CCCACGAGTC CGGCGTACTC GCCCACGAGC CCAGCGTACT CGCCCACGAG TCCAGCGTAC TCGCCCACGA GCCCGGCTTA CTCGCCCACG AGTCCAGCGT ACTCCCCCAC GAGTCCAGCG TACTCCCCCA CGAGTCCGGC GTACTCGCCC ACGAGCCCAG CGTACTCGCC CACGAGTCCA GCGTACTCGC CCACGAGCCC GGCTTACTCG CCCACGAGTC CAGCGTACTC CCCCACGAGT CCAGCGTACT CCCCCACGAG TCCGGCGTAC TCGCCCACGA GCCCAGCGTA CTCCCCGAAA GAGGAAGAAG AAAACGAAGA AGAATAGAGG AAAGGAGACA ACTGTTTGTT T
|
Protein sequence | MSVHDRFPHS TAELCRVNAV QFGVLSPDEI RKMSVCAIET SETYEKGRPK RGGLSDPRMG TMDRGAVCET DGCDSVQTPG YFGHLELAKP VYHHGFINVV LRILRCVGYS SSQLLLHKDE DPKFAQLMRI KNPKNRLKKL TDACKTKSVC PYTGTAQPQY RLEGMKITAE FKHRDEELLP EGGERKVIVT AERALQILKA ISDEDCRALG LDPEHARPDW FILQVMPVPP PPVRPSVSFD ASTRSEDDLT VKLMEIVRTN KNLERQEQNG APQHVILEFT ELLQYHIMTF MDNTVAGMPR ALTRSGRSIK SISERLKGKA GRIRGNLMGK RVDFSARTVI TPDPNLMLDE LGVPWSIALN MTYPETVTPY NIDRLQRLVE NGPHPPPGET GARYIIREDG QRLDLRFLKK SSEKRLEYGY KVERHMVNGD VVLFNRQPSL HKMSIMGHRV RIMPYSTFRL NLSVTPPYNA DFDGDEMNMH LPQSYETRAE VKELMMVPKM IVSPQANKPV MAIVQDTLLG CRLITKRDTF ITKDVFMNII MWLEDWDGKV PKPAILKPQP LWTGKQVFSM MLPKVNLLRT SAWAKDSDDM AFSVDDTGVR IEQGELLTGT LCKKTMGSGG GGLIHVTWEE WGPIAARGLI SQTQTLVNYW VLHHGFTVGI ADTIADDETM FSINNTITKA KADVKEVIKL AQNNELELQP GMTMQQSFEQ KVNQILNKAR DNAGNSAQNS LMDTNNVKMM VTAGSKGSFL NISQMIACVG QQNVEGKRIP YGFKGRTLPH FSKDDFGPES RGFVENSYLR GLTPQEFFFH AMGGREGLID TAVKTSETGY IQRRLVKAME DVVVKYDGTV RNSVGDVIQF LYGEDGMDAT MIESQSIDTL RFSVKEFAAK FHIDPDVPGF NNGWLSEEQA SELAHTAKVR EILDAEWDRL QRDRVELRTI CPTGDPHVHL PVNMKRILWN AQKQYGLYRP DSGEEEVKVT HIIESVAELL PKLIVVPGTD PLSVEAQRNG TMLFFAHVRA NLAAKRVLKE HKLTRAGFDW VIGEIESRFK MALAPPGDGI GTVAAQSIGE PATQMTLNTF HFAGVSAKNV TLGVPRLKEI INIAKSIKTP SLTIALKPEL AGDRTRAKDC QASLEYTTLH SVAAVTEVHY DPDPTDTVIE EDREFVRAYY EMPDEDVDPS RMSPWLLRIE LNREMMVDKK LLMADVAERI NEDFAGDLSC IFNDDNSEKL VLRIRIMNPE GVKYEDASTE DEVFLKRLET QMLSNLALRG IPDIKKVFIR EAKQNAINSK TELFEKTTEW MLDTEGVNLL EVMAHPDVDF TRTTSNHLIE VIQVLGIEAV RNTLLRELRG VIEFDGSYVN YRHLAILVEV MTYRGHLMSI TRHGINRVET GPLMRCSFEE TVDILMEASA FSERDNMTGP SENIMLGQFC PIGTGEFKLH LNDELLQDAV ELELFGDGSR TPGHGGMMTP GRDGTPGWGS KSPSFLLSPG GHQSPFDTSM AFSPYSDGMA FSPGMSPGGF GGYSPTSPAY SPTSPAYSPT SPAYSPTSPA YSPTSPAYSP TSPAYSPTSP AYSPTSPAYS PTSPAYSPTS PAYSPTSPAY SPTSPAYSPT SPAYSPTSPA YSPTSPAYSP TSPAYSPKEE EENEEE
|
| |