Gene OSTLU_34669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34669 
Symbol 
ID5003515 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp591667 
End bp594522 
Gene Length2856 bp 
Protein Length922 aa 
Translation table 
GC content57% 
IMG OID640418936 
Productpredicted protein 
Protein accessionXP_001419848 
Protein GI145350935 
COG category[K] Transcription 
COG ID[COG5108] Mitochondrial DNA-directed RNA polymerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00224083 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.282079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGA CGGCGTTCGG GGACGTGAAG GATCGTATTT TTGAAACTAC GGACGAGCGG 
AGGGCGAAGA TCAAGCTCGC GCTGCAGTCG CTGCACGCGC GAAGACGACA GCTCAGACGC
GCGGGGGGGC ACGAGGCGCG AGAGCGCGCG GAGGACGCGG ATGAGGGGGA GAGACGGGCG
CAAACGCGGA AGCAGCGCGA GGCGGAAAAT TTGGCGGTGG AAAGAGAGGT GATGCGGTAC
AAGGAGTTGG CGAGGAAGAC GTTCGCGGCG GGGATTGGGG CGCAGTTACC CGTGGTGCAA
AAGTTGCTGG CGTCGTTTTA CGTGCCTCTG GTGGAGGCGC TGACGGAAGA ACAAGAGAAG
ATACGGGGCA ACGTGCCGGG CGTCGACAGA CGCGTGTACG GGCACTACTT GGGCTTGCTC
GAACCGGATA AACTCGCGGT GTTGACGCTT CACGCGACGC TCAGCACGCT GATGAAGGGT
GACGGTAGGA ATAATGGCGC GTGGTCGTTC ATAAGGGGCG ACCAGACCGC GGCGGGGAGC
GCGAAGTTTA TCAGCGTCGC CGACCAAGTT GGAAGCGCGG TGCAAGCTGA GGTGAACTTG
GAGCGCATGC GGTCGGCGGA AAAGGCGGCC AAGGAGGCAT TTAGACGATC TAGACTAGTG
AACACCGAAG GCAACGTCAA GGAAATAGAC GACGCGGCGG AGCATAGGAT TAATTTGGCG
TTGGATACGT CAAAGATGAA CACCATCAAG TCGGTGTCCA AGCACGCTCG GCAAGCGCTC
GAGAACGCGG AGTGGGGTCG TGAGATTCGC CTTAAAATCG GAACAGTGCT GCTGACCGCG
CTCATGAACA CGGCAAAGAT TGGCGTACCG GACGAAAAAG GTGAATTATT GACGCTTCCA
GCCTTTTATC ACGACTACAA AGAGGCTGGG TACGGAATGC TGCACTGGCA CGATAGCATT
TACAGATTCA TCAACACGGA GACGATGACT CGCGCAGCGC TCGTACCGGT GCGACACTTT
CCCATGGTTA TCCCGCCGCG GTATTGGGAG AGGTACAATA AGGGAGGATA CTTGCGCGCT
GATAATCTAT GCATGCGAGG GAAGTACTCG AACGAAGGGC CGAGTCGAGC GCAAATCGCG
GCGTTGGAGG AGAAAGCGCG CGAGGCGGAC GCATCGGGGG AGCCCGTGCA ATACCAACCC
GTGTTAGACG CTCTCAATGC TTTGGGGCAA ACCGCGTGGC AAATCAATAC GGATGTGTTA
CCGATCGTGG AAGAAGTCTG GGCTCGAGGT GGTGGCGTGG CTGAGGTTCC GCTTCGCGCT
GAGTTGCAAC TCCCACGCTG GCCCGGCGGG TCATATGCGC TTCGCAGCGA CAAAAAGCGT
TTGCAGCTGC TCGCTAGTGG ACTTCCGGGC AAAGGTGAGG TTATCGACTT TTTGCAAAGC
GTGCGAAAGA CGAAAAAGTC AAACATGGAG CTCCACTCTC AGCGCTGCGA CTTTCTCATT
AAATTGCAAG TTGCGCGAGA GATGAAAAAC GAACCAAATA TTTATTTCCC GCACAATTTG
GATTTTAGAG GGCGCGCGTA CACGATGCAC GTGCACTTGA ATCACATAGG AAGTGATTTG
TGTCGCGGTT TGCTTCGATT TAACGAAAAG AAGCCGCTCG GCGAGCGCGG CCTGCGTTGG
ATGCACATCC AGTGCGCGAC GCTGTTTGGT AACGGCGCCG ACAAGCTTCC GATGGATGAG
AGAGTGCAGT TCATCAAAGA TCGCATAGAA GACGTGCGAG CGTCGGCTCA AGACCCGCTC
GCGAAGGATG CGTGGTGGCA AGAAGCGGAA GAGCCCTGGC AGTGTTTGGC GACGTGCATC
GAGCTCGACA AAGCGCTCGA GCTCTCGGAC CCGACGCAGT TCATGAGTAA TTTACCCGTG
CATCAAGATG GGTCGTGCAA CGGGTTGCAA CACTACGCCG CGCTCGGTCG CGACTTACAC
GGTGGTGAAG CCGTGAACTT GGTCCCCGCA GACAGAGGTG CGGACGTCTA CACCGGCATC
GCGAATGTGT TGAAGCGCAT CGTTGCTGAG GATATTAAAC TCATCGACAG CGAAGACGAG
GAAGACGTTA ACAACGCCAA GCTCGCGATG TCACTCGCTC AGCACATCGA CCGTAAGCTT
GTGAAGCAGA CGGTGATGAC GTCCGTGTAC GGCGTTACTT TCATCGGGGC GCGAGCGCAA
ATATATAGCC GTCTCCGTGA GCGCGAGGCG ATGGAGGACA ACGAACTTCT TCGCTATCGT
GTGTCCAACT ACGCCGCGAA AAGGACGCTC GACGCGTTGA ATAATATGTT TTCAAACGCC
CGAGATGTCA TGGGATGGCT CACGACCTGC GCCACGATCG CTACCTCAGC GGGCGAGCCC
GTGCGTTGGA CCACGCCTCT GGGATTGCCC GTCGTGCAGC CGTATCACAG TCAGCGAACC
AAGCGCGTGC GGACGATTTT GCAGTCATTC TCGCTAAAAG TTCACGACGA ACAACAGCCG
GTTATGAAAG TGAAGCAAAG GAGCGCGTTC CCGCCGAATT ATATTCACAG TATCGACAGT
TCTCATATGA TGAGGACGGC GATCGCGTGC GTGGACGCCG GATTGACGTT CGCCGGCGTT
CACGATTCCT TTTGGACGCA CGCGACGGAC GTGGACACCA TGAATGTCAT CCTGCGTGAA
AAGTTCATCG AGGTTCACAA AGAGCCTCTT CTCGAAAATC TTTATCACGA GTTCCGCGCG
AATTACCCAG ACGTCGCGGA CGAGTTCCCT CAGCCGCCCG CACCTGGCGA TTTGGATTTA
GACGTCGTTC AGGACTCGGT GTACTTTTTC AGCTAG
 
Protein sequence
MAATAFGDVK DRIFETTDER RAKIKLALQS LHARRRQLRR AGGHEARERA EDADEGERRA 
QTRKQREAEN LAVEREVMRY KELARKTFAA GIGAQLPVVQ KLLASFYVPL VEALTEEQEK
IRGNVPGVDR RVYGHYLGLL EPDKLAVLTL HATLSTLMKG DGRNNGAAKF ISVADQVGSA
VQAEVNLERM RSAEKAAKEA FRRSRLVNTE GNVKEIDDAA EHRINLALDT SKMNTIKSVS
KHARQALENA EWGREIRLKI GTVLLTALMN TAKIGVPDEK GELLTLPAFY HDYKEAGYGM
LHWHDSIYRF INTETMTRAA LVPVRHFPMV IPPRYWERYN KGGYLRADNL CMRGKYSNEG
PSRAQIAALE EKAREADASG EPVQYQPVLD ALNALGQTAW QINTDVLPIV EEVWARGGGV
AEVPLRAELQ LPRWPGGGLP GKGEVIDFLQ SVRKTKKSNM ELHSQRCDFL IKLQVAREMK
NEPNIYFPHN LDFRGRAYTM HVHLNHIGSD LCRGLLRFNE KKPLGERGLR WMHIQCATLF
GNGADKLPMD ERVQFIKDRI EDVRASAQDP LAKDAWWQEA EEPWQCLATC IELDKALELS
DPTQFMSNLP VHQDGSCNGL QHYAALGRDL HGGEAVNLVP ADRGADVYTG IANVLKRIVA
EDIKLIDSED EEDVNNAKLA MSLAQHIDRK LVKQTVMTSV YGVTFIGARA QIYSRLRERE
AMEDNELLRY RVSNYAAKRT LDALNNMFSN ARDVMGWLTT CATIATSAGE PVRWTTPLGL
PVVQPYHSQR TKRVRTILQS FSLKVHDEQQ PVMKVKQRSA FPPNYIHSID SSHMMRTAIA
CVDAGLTFAG VHDSFWTHAT DVDTMNVILR EKFIEVHKEP LLENLYHEFR ANYPDVADEF
PQPPAPGDLD LDVVQDSVYF FS