Gene OSTLU_297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_297 
Symbol 
ID5003311 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp97144 
End bp100200 
Gene Length3057 bp 
Protein Length990 aa 
Translation table 
GC content55% 
IMG OID640418732 
Productpredicted protein 
Protein accessionXP_001419281 
Protein GI145349730 
COG category[L] Replication, recombination and repair 
COG ID[COG0417] DNA polymerase elongation subunit (family B) 
TIGRFAM ID[TIGR00592] DNA polymerase (pol2) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0400359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.313206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GACGGTTCGA CGCCCTTCTT CTTCATGGAT GCGCAAGAAG AGCGCGAATC CCCCGGCACA 
GTGTTCCTCT TCGGTCGTGT GCCGGTGTCG AGCGATCCGC AGTCGGAGAC CATCAGTGCG
TGTGCGGTGG TGCAGAACAT GCAAAGATGC ATGTACATCG TTCCGACCGC GTCTACGTTC
GCTGATCCTG ATGGAGAGTT GGAAGCGCTG GGGCAAATGA TGGAGGAAAC GCGGCGCGAA
TTCAAGTCGT GCGCGGATTC GGACGACAAG AAGGAGGAAA AGAGAATTGC GGCTCAAAAG
GCGAAATCTG ATCTCATGAA AAAACTCGTG CCTTTGTCCG GTGACCTTCG CGCCGAAGTC
AAAGAAGTCT TGAAGGCGCG AAAGATTGAG AACTCGAAGA TCACCATCGT CCGCCGACGC
TATTGCTTTG AGCGCAAGGA CATTCCGCAA GGTCCGTTGT TCGTGCTCAA AGTGAAGATC
CCGGCGACTT ACGCAGCGTT TCCGAGCGAT ATCAAGGGTA AGCACTTCGT CGCCGCTTTG
GGCACGCAGG CTCCCATGTT GGAACTTCTC ACGCTCAAGT CTAAGCTCAA GGGCCCGAGT
TGGATCGCGC TTCACGGCGC CGCCATCGTT CCGACGGAGA AGCAAAAGTC TTGGTGCAAA
CTCGAGCTCA CACTGCCGAA CGCACACAAG AGTGTGCGCC CTTGTCTCGA AGCCATCGCA
TCGCGCCCAG CGCCGCGTCT CACCGTTGCT TCGTTGAACC TGCAGACGAT CGTCAATCAG
CAAACCAACG TGAACGAAAT CGCAGTCGCC TCGGTGCAGT ACATTCGCGA CGTAAACTGC
GAAGGATCCA CAACGGCGGC GCAACTGAAG ACCGGTTTAC GTCATTTTAC CGTCGTGCGC
AAGCTCGATG GACTGGAGAT GCCTCCGGGA TGGCAAAATG CAGTCGCGCA CGAGAACAAC
ACCAATGTCA TCGCCAAGCG TACAGGCTCC GTCGTCCTCG CCGCGCAGAA CAACGAACTC
GGTTTGCTCA ACTTTCTTTT GGCAAAGTTG CACCAACTCG ACCCGGATGT CATCGTCGGT
CACAACATCG GCGGTTTCAA TCTCGAAGTG TTGCTTCGCC GATTTCAGGC GAACAAGATT
GGCCATTGGA GCCGCATCGG TCGCATGAAA CGTACGCGCA TGCCAAACAT CAACGGCTCT
GGAGGTGCTT ACGGAGGAGG TGCGTCCATG GGCGCGTTGC AGTGTCTCGC TGGAAGACTT
TTAGCGGATA CGTACTTGAG TGCGAAAGAT TTGTTGGGCA AGGAGGTGTC TTACACGCTT
ACGTCGCTCT CTGAAACGCA ACTTGGCGTA CGACGAGAAG AAGTGCCGAG CGCCGAAATC
CCCAACCGTT ATCAAGACAC GAACGCTCTC ATGCATTTGA TCAAGTGCAC GGAAATCGAC
GCAAAGTTGA GTTTGCATCT GATGTTTAAG ATGGAAGTCG TCCCGTTGAC GAAGCAGCTG
TCGAACATCG CCGGTAACCT TTGGAGTAAG ACGCTCGGGC ACACGCGCGC TCAGCGCGTC
GAGTACCTCT TGCTTCACGA ATTCCACAGT CGCAAGCACA TCGTCCCCGA TCGTTTAAGT
GCCAAGGAAC GTCGCCGCGT CGCCGCGGCG AGCGGTGAAG AAGAGGATGA TGGCGGTAAG
AAGGGTCCGT CTTACGCGGG TGGCCGGGTG CTTGAGCCGA AGAAGGGCCT GTACGACACC
TTTGTCCTCG TCCTCGATTA CCAGTCGTTG TACCCGTCGA TTATTCAAGA GTACAACATT
TGCTACACCA CGGTGCGGCG ACATTTCGAC GCGGGCGAAG AAAACACCGA AATTGAACTA
CCCGCGCCGA TCTTAAGCGA CAAGGACTTC GCCGTCCTGC CGAAAGTCAT CGCGAACATT
GTGCAAAGCC GTAGAGAAGT GAAAGGATTG ATGGCTCGCG AAAAAGATCC GGCGCGCGCA
AAGCAGTATG ATCTTCGCCA GCTGGCTCTC AAGCTCACCG CAAACTCCAT GTACGGTTGC
TTGGGTTTTA GTCAGTCGCG CTTTTTTGCC GAGCCCATTG CGGCGTTGAT CACGGCGCAA
GGTCGTAAGA TTCTTCAGCG CACGGAGGAT CTCGCGAAAG CGAAGTGCGA GCTGGACGTC
ATTTACGGCG ATACGGATTC CATTATGGTG AACACCAAGT CTCACGATTT GAATCATTCT
AGGGCGCTCG GAAACAAACT CATTCGTTTC GTCAACAAAG AATATCGGAA ATTGGTTTTG
GAGGAAGATT ACATCTTCCG ATCGATGTTG CTGTTGAAGA AGAAGAAGTA CGCCGCCATG
AAGGTTGTCA ACGGACCGAA TGGAACCAAG GCGACGAAGC TCGAGATGAA GGGTCTCGAT
ATCGTTCGTC GTGATTGGGC GCCGCTGGTG AAGGACGTTG GTAAACAAAC TCTCGAAGAA
CTTCTTGATG TGGATGGTGA ACGCGAAGAG CGCGTGAACG CGATTCACGA CGGTTTACGT
ACGATCCGAA AGGACATGGT TGAAAACCGC GTGCAGTTGT CCAAGTACAT CATCACGAAG
CAACTCACGA AGGCGGTTGA AGAGTACCCC GACGCAAAGC ATCAGCCGCA CGTCATGGTA
GCCAAACGTC GATTGGAGGC TGGTAAACAA GATGGCGTCA AGGCGGGAGA GACTGTGCCG
TACATCATCG CGCTGGAAAG CGAGCTGCCG CTCGAGGACA TCGCCGCGGG AAAAGCCGGC
GCCTCTGGTG GAAAGGGCTT GGCCGAGCGG GCCTATCATC CCGACGAAAT CCTCGAGAAG
GGTTTGAAGG TTGATTTGCA CTACTACTTG TCTCAGCAAG TCCATCCCGT GATAACTCGT
TTGTGCGCCC CGATTGAGGA AACCGACGGC GCCGCGATGG CGGAATGCCT CGGGTTGGAC
TCGAACAAGT TTAAAACGCA AACACGCGAT GAGGACGAGT ACGACGACAC GTTTGGCGGT
GGTAGATTCG CGTTGGATGA CGAAGAGCGC TTTGCCAAGT GTAAGCCTCT GAAGCTT
 
Protein sequence
DGSTPFFFMD AQEERESPGT VFLFGRVPVS SDPQSETISA CAVVQNMQRC MYIVPTASTF 
ADPDGEIAAQ KAKSDLMKKL VPLSGDLRAE VKEVLKARKI ENSKITIVRR RYCFERKDIP
QGPLFVLKVK IPATYAAFPS DIKGKHFVAA LGTQAPMLEL LTLKSKLKGP SWIALHGAAI
VPTEKQKSWC KLELTLPNAH KSVRPCLEAI ASRPAPRLTV ASLNLQTIVN QQTNVNEIAV
ASVQYIRDVN CEGSTTAAQL KTGLRHFTVV RKLDGLEMPP GWQNAVAHEN NTNVIAKRTG
SVVLAAQNNE LGLLNFLLAK LHQLDPDVIV GHNIGGFNLE VLLRRFQANK IGHWSRIGRM
KRTRMPNING SGGAYGGGAS MGALQCLAGR LLADTYLSAK DLLGKEVSYT LTSLSETQLG
VRREEVPSAE IPNRYQDTNA LMHLIKCTEI DAKLSLHLMF KMEVVPLTKQ LSNIAGNLWS
KTLGHTRAQR VEYLLLHEFH SRKHIVPDRL SAKERRRVAA ASGEEEDDGG KKGPSYAGGR
VLEPKKGLYD TFVLVLDYQS LYPSIIQEYN ICYTTVRRHF DAGEENTEIE LPAPILSDKD
FAVLPKVIAN IVQSRREVKG LMAREKDPAR AKQYDLRQLA LKLTANSMYG CLGFSQSRFF
AEPIAALITA QGRKILQRTE DLAKAKCELD VIYGDTDSIM VNTKSHDLNH SRALGNKLIR
FVNKEYRKLV LEEDYIFRSM LLLKKKKYAA MKVVNGPNGT KATKLEMKGL DIVRRDWAPL
VKDVGKQTLE ELLDVDGERE ERVNAIHDGL RTIRKDMVEN RVQLSKYIIT KQLTKAVEEY
PDAKHQPHVM VAKRRLEAGK QDGVKAGETV PYIIALESEL PLEDIAAGKA GASGGKGLAE
RAYHPDEILE KGLKVDLHYY LSQQVHPVIT RLCAPIEETD GAAMAECLGL DSNKFKTQTR
DEDEYDDTFG GGRFALDDEE RFAKCKPLKL