Gene OSTLU_44713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_44713 
Symbol 
ID5000123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp259705 
End bp262882 
Gene Length3178 bp 
Protein Length982 aa 
Translation table 
GC content55% 
IMG OID640415544 
Productpredicted protein 
Protein accessionXP_001416112 
Protein GI145342057 
COG category[L] Replication, recombination and repair 
COG ID[COG0417] DNA polymerase elongation subunit (family B) 
TIGRFAM ID[TIGR00592] DNA polymerase (pol2) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCGATGTGC GAGCGGCGGA TCGCGCGAAT TGGGTGCGAC GACCGCTGGC GCGGGCGGTG 
GACGCGACGC GGGACGCGCT GCTGTTTCAA CAGTTGGACA TCGATTACGC GGTGGAGAAG
CCGCACGAGG GACTGGGGGC GCGACGGGCG AAAGGTGACG CCGCGGTGGT GCGGATGTTC
GGGGTGACGA AAGAGGGACA CAGCGTGTGC GCGCACGTGC ACGGGTTCGA GCCGTATTTT
TACGCGTCGG TGCCGGAAAA TTTCGGCGAG GCGGATTGCG CGGCGTTCAG GCGACGTTTG
AACGAGGAGG TGAGCGCGGC GAGGAAGAAC GCGCCGGGAG TGCACGTCGT GGACGTGTCG
CTCGAGAGGA AGCAGAGTTT GATGCATTAT AGCGACGTGA AGGATCGCTT GTTCGCCAAG
ATCACGATGG GGCTGCCGAA TATGGTGAGC GCGGCGCGAG GGATTTTGGA AAAGGGATTC
AGCGTGCCGG GGGTTCGGGA CGGGGCGTTC ACGACGTACC CGACGTTTGA GTCCAACATC
GTGTACGCCT TGCGGTTTAT GGTGGACTGC GCCGTGGTCG GTGGGAACTG GATCGAGTTT
CCGGTGAATT CGTACACGGT TCGTGCGAAG AAGGCGTCGC ACTGTCAAAT CGAAGTGGAC
ATCATGTACG ACAAGCTCAT CTCGCACCCG GCGGAGGGTG AGTACTCCAA GCTCGCCCCG
TTTCGCATCT TATCCGTGGA TATCGAGTGT GCGGGGCGTA AAGGCCACTT TCCCGACGCA
CAATTGGATC CGGTGATTCA GATCGCGACT CTAGTCACCG AGCAAGGGAG CGATAAACCG
ATTATTCGAG CCGTGTGGAC GCTCGATACG TGCGCTCCGA TCGTCGGCGC TGACGTCTTG
AGTTTTAAGG ATGAGCGGGA ACTGCTTCGG AACTGGGGTA ATTTTCTTCG CGGGTGCGAC
CCGGATTTGT TGATCGGCTA TAACATCGTC AATTTCGATT TCCCGTACTT GTTGGAGCGC
GCCGAGAAGC TCGGCGTCGC CGACTTTCCG TACTGGGGAC GCTTGATCGG CACCAAGGTG
CGCATGCGCG ACACGGTTTT CTCGTCCAAG GCGTACGGTA CGCACGAAAG TAAAGAAATA
TCTTGCGAGG GACGCGTGCA ATTCGACATG CTTCGCGCTA TTCAGCGCGA TTACAAGTTG
TCTTCATATT CGCTCAACGC AGTTTCTGCG CATTTTTTGG GCGAGCAAAA AGAAGACGTG
CACTACAGCG CTATCTCGGA GCTGCAAAAC GGTACGCCCG AGACGAGACG GCGTTTAGCG
GTGTACTGTT TGAAGGATGC GTATCTTCCG CAGAGATTGC TGGACAAGTT GATGTACATG
TACAACTACA TCGAAATGGC TCGCGTCACC GGCGTGCCGC TGAATTACCT GTTGACTCGC
GGTCAGTCCA TCAAGGTCAT GTCGCAGCTT TTGCGCAAGG CGAGGCAGCG AAACATGCTG
ATTCCTCACC ATGTGAAGCA GGGCGGTGGC AACGTTCAAG CGGAGGGTGG TGTCGCGTAC
GAAGGTGCGA CGGTCTTAGA CGCCAAGGCT GGTTACTACG AGATGCCCAT CGCGACGCTC
GATTTCGCTT CGCTGTATCC ATCCATCATG ATGGCTCACA ATCTGTGCTA CTCGACTTTA
GTGCCAAAGG ATAAGGTGGG TAAGCTCAAC CCGGATGATT ACGGGACATC GCCATCGGGA
GATACGTTCG TGAAGGCATC CAAGGTGAAG GGGATTTTGC CAGAAATTTT AGAGGAACTC
TTGGGGGCGC GTAAACGCGC CAAGGCTGAT CTTAAAAAAG CCACTGATCC GTTGCAAAAA
GCGGTCTTAG ACGGTCGCCA GCTCGCGCTG AAAGTCTCAG CAAACTCTGT GTACGGTTTC
ACCGGTGCCA TGGTCGGACA GCTGCCTTGC CTCGAAATTT CTTCTTCAAC GACGGCGTAC
GGCCGTACGA TGATTGATCA CACGAAGAAA ATGGTGGAAG AGAAGTATCG AACCGTGAAT
GGATACACTG GAAATGCCGA GGTCATTTAT GGTGATACCG ACTCCGTGAT GATCAAGTTC
AATACGCCGG ATTTGGCGGA GTCTATGAAA CTCGGTGAAG AAGCCGCGGT GTACGTCTCT
GAGACGTTCT TGAAGCCTAT CAAACTCGAA TTCGAAAAGG TGTACTGGCC ATATCTCTTG
ATTAGCAAGA AGCGCTACGC TGGTTTACTC TGGACGAACA CTGAAAATTA CGACAAGATG
GACACCAAAG GTATCGAGAC CGTGCGTCGC GATAACTGTT TGCTCGTGCG ACAAGTCATC
GAGACGGTTT TGGAGAAGAT TTTGAAGCAG CGAGATGTGC AGGGCGCGGT GAAATACGTG
CAGAGTACGA TCGCAGACTT GCTCATGAAT AGATTAGATT TCTCACAGCT CGTGATCACG
AAGGGTTTCA CAAAAGAAGC AGACTCGTAT GGGGTCAAGA TGGCACACAT AGAGCTCGCG
CAGCGCATGC GCCAGCGCGA TCCCGCCACC GCGCCGGTAG TCGGCGATCG TATCGCGTAC
GTCATCATTA AAGCGGCGAA GAACGCAAAG GCGTATGAAA AGTCAGAGGA TCCGATTTTT
GCTTTGGACA ACAACTTACC CATCGATACG AAGCACTATT TGGATCACTA CTTGACCAAG
CCTCTGCTGC GTATTTTCGA GCCCATCTTG CACAACGCGC AATCCGTACT GTTGCACGGC
GAGCACACGC GTCGCATCGC GCAGCCGACG CCGACTGCGA AAGCTGGCGG AATCATGCAG
TTTGCCAAGA TTCGTTTAAG CTGTATCGGT TGCCGCGCAC CGATATCGGA TGAGAAAATG
TCCAAGTCGC TGTGCAAGAA TTGCCTCAAC GACGAATCGC AGCACCTGAG AAAAGCGCTC
GCCTCCGTGA ACAACCTCGA GGGAGACTTC AACCGGCTTT GGACGCAGTG TCAGCGGTGC
CAAGGCTCAC TTCACCAGGA CGTCCTCTGC ACATCGCGGG ATTGTCCCAT TTTCTACCGT
CGCAAAAAAG TCCAAAAAGA TTTAACCGAA GCCACGGCGC AGCTGCGACG TTTCGATTGG
TGAGCGCGAC GAAGATTCCC GACGCGATAG TCGAGTCGAT TCCCGACGCG ATAGCCAA
 
Protein sequence
MFGVTKEGHS VCAHVHGFEP YFYASVPENF GEADCAAFRR RLNEEVSAAR KNAPGVHVVD 
VSLERKQSLM HYSDVKDRLF AKITMGLPNM VSAARGILEK GFSVPGVRDG AFTTYPTFES
NIVYALRFMV DCAVVGGNWI EFPVNSYTVR AKKASHCQIE VDIMYDKLIS HPAEGEYSKL
APFRILSVDI ECAGRKGHFP DAQLDPVIQI ATLVTEQGSD KPIIRAVWTL DTCAPIVGAD
VLSFKDEREL LRNWGNFLRG CDPDLLIGYN IVNFDFPYLL ERAEKLGVAD FPYWGRLIGT
KVRMRDTVFS SKAYGTHESK EISCEGRVQF DMLRAIQRDY KLSSYSLNAV SAHFLGEQKE
DVHYSAISEL QNGTPETRRR LAVYCLKDAY LPQRLLDKLM YMYNYIEMAR VTGVPLNYLL
TRGQSIKVMS QLLRKARQRN MLIPHHVKQG GGNVQAEGGV AYEGATVLDA KAGYYEMPIA
TLDFASLYPS IMMAHNLCYS TLVPKDKVGK LNPDDYGTSP SGDTFVKASK VKGILPEILE
ELLGARKRAK ADLKKATDPL QKAVLDGRQL ALKVSANSVY GFTGAMVGQL PCLEISSSTT
AYGRTMIDHT KKMVEEKYRT VNGYTGNAEV IYGDTDSVMI KFNTPDLAES MKLGEEAAVY
VSETFLKPIK LEFEKVYWPY LLISKKRYAG LLWTNTENYD KMDTKGIETV RRDNCLLVRQ
VIETVLEKIL KQRDVQGAVK YVQSTIADLL MNRLDFSQLV ITKGFTKEAD SYGVKMAHIE
LAQRMRQRDP ATAPVVGDRI AYVIIKAAKN AKAYEKSEDP IFALDNNLPI DTKHYLDHYL
TKPLLRIFEP ILHNAQSVLL HGEHTRRIAQ PTPTAKAGGI MQFAKIRLSC IGCRAPISDE
KMSKSLCKNC LNDESQHLRK ALASVNNLEG DFNRLWTQCQ RCQGSLHQDV LCTSRDCPIF
YRRKKVQKDL TEATAQLRRF DW