Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_297 |
Symbol | |
ID | 5003311 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 97144 |
End bp | 100200 |
Gene Length | 3057 bp |
Protein Length | 990 aa |
Translation table | |
GC content | 55% |
IMG OID | 640418732 |
Product | predicted protein |
Protein accession | XP_001419281 |
Protein GI | 145349730 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0417] DNA polymerase elongation subunit (family B) |
TIGRFAM ID | [TIGR00592] DNA polymerase (pol2) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0400359 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.313206 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GACGGTTCGA CGCCCTTCTT CTTCATGGAT GCGCAAGAAG AGCGCGAATC CCCCGGCACA GTGTTCCTCT TCGGTCGTGT GCCGGTGTCG AGCGATCCGC AGTCGGAGAC CATCAGTGCG TGTGCGGTGG TGCAGAACAT GCAAAGATGC ATGTACATCG TTCCGACCGC GTCTACGTTC GCTGATCCTG ATGGAGAGTT GGAAGCGCTG GGGCAAATGA TGGAGGAAAC GCGGCGCGAA TTCAAGTCGT GCGCGGATTC GGACGACAAG AAGGAGGAAA AGAGAATTGC GGCTCAAAAG GCGAAATCTG ATCTCATGAA AAAACTCGTG CCTTTGTCCG GTGACCTTCG CGCCGAAGTC AAAGAAGTCT TGAAGGCGCG AAAGATTGAG AACTCGAAGA TCACCATCGT CCGCCGACGC TATTGCTTTG AGCGCAAGGA CATTCCGCAA GGTCCGTTGT TCGTGCTCAA AGTGAAGATC CCGGCGACTT ACGCAGCGTT TCCGAGCGAT ATCAAGGGTA AGCACTTCGT CGCCGCTTTG GGCACGCAGG CTCCCATGTT GGAACTTCTC ACGCTCAAGT CTAAGCTCAA GGGCCCGAGT TGGATCGCGC TTCACGGCGC CGCCATCGTT CCGACGGAGA AGCAAAAGTC TTGGTGCAAA CTCGAGCTCA CACTGCCGAA CGCACACAAG AGTGTGCGCC CTTGTCTCGA AGCCATCGCA TCGCGCCCAG CGCCGCGTCT CACCGTTGCT TCGTTGAACC TGCAGACGAT CGTCAATCAG CAAACCAACG TGAACGAAAT CGCAGTCGCC TCGGTGCAGT ACATTCGCGA CGTAAACTGC GAAGGATCCA CAACGGCGGC GCAACTGAAG ACCGGTTTAC GTCATTTTAC CGTCGTGCGC AAGCTCGATG GACTGGAGAT GCCTCCGGGA TGGCAAAATG CAGTCGCGCA CGAGAACAAC ACCAATGTCA TCGCCAAGCG TACAGGCTCC GTCGTCCTCG CCGCGCAGAA CAACGAACTC GGTTTGCTCA ACTTTCTTTT GGCAAAGTTG CACCAACTCG ACCCGGATGT CATCGTCGGT CACAACATCG GCGGTTTCAA TCTCGAAGTG TTGCTTCGCC GATTTCAGGC GAACAAGATT GGCCATTGGA GCCGCATCGG TCGCATGAAA CGTACGCGCA TGCCAAACAT CAACGGCTCT GGAGGTGCTT ACGGAGGAGG TGCGTCCATG GGCGCGTTGC AGTGTCTCGC TGGAAGACTT TTAGCGGATA CGTACTTGAG TGCGAAAGAT TTGTTGGGCA AGGAGGTGTC TTACACGCTT ACGTCGCTCT CTGAAACGCA ACTTGGCGTA CGACGAGAAG AAGTGCCGAG CGCCGAAATC CCCAACCGTT ATCAAGACAC GAACGCTCTC ATGCATTTGA TCAAGTGCAC GGAAATCGAC GCAAAGTTGA GTTTGCATCT GATGTTTAAG ATGGAAGTCG TCCCGTTGAC GAAGCAGCTG TCGAACATCG CCGGTAACCT TTGGAGTAAG ACGCTCGGGC ACACGCGCGC TCAGCGCGTC GAGTACCTCT TGCTTCACGA ATTCCACAGT CGCAAGCACA TCGTCCCCGA TCGTTTAAGT GCCAAGGAAC GTCGCCGCGT CGCCGCGGCG AGCGGTGAAG AAGAGGATGA TGGCGGTAAG AAGGGTCCGT CTTACGCGGG TGGCCGGGTG CTTGAGCCGA AGAAGGGCCT GTACGACACC TTTGTCCTCG TCCTCGATTA CCAGTCGTTG TACCCGTCGA TTATTCAAGA GTACAACATT TGCTACACCA CGGTGCGGCG ACATTTCGAC GCGGGCGAAG AAAACACCGA AATTGAACTA CCCGCGCCGA TCTTAAGCGA CAAGGACTTC GCCGTCCTGC CGAAAGTCAT CGCGAACATT GTGCAAAGCC GTAGAGAAGT GAAAGGATTG ATGGCTCGCG AAAAAGATCC GGCGCGCGCA AAGCAGTATG ATCTTCGCCA GCTGGCTCTC AAGCTCACCG CAAACTCCAT GTACGGTTGC TTGGGTTTTA GTCAGTCGCG CTTTTTTGCC GAGCCCATTG CGGCGTTGAT CACGGCGCAA GGTCGTAAGA TTCTTCAGCG CACGGAGGAT CTCGCGAAAG CGAAGTGCGA GCTGGACGTC ATTTACGGCG ATACGGATTC CATTATGGTG AACACCAAGT CTCACGATTT GAATCATTCT AGGGCGCTCG GAAACAAACT CATTCGTTTC GTCAACAAAG AATATCGGAA ATTGGTTTTG GAGGAAGATT ACATCTTCCG ATCGATGTTG CTGTTGAAGA AGAAGAAGTA CGCCGCCATG AAGGTTGTCA ACGGACCGAA TGGAACCAAG GCGACGAAGC TCGAGATGAA GGGTCTCGAT ATCGTTCGTC GTGATTGGGC GCCGCTGGTG AAGGACGTTG GTAAACAAAC TCTCGAAGAA CTTCTTGATG TGGATGGTGA ACGCGAAGAG CGCGTGAACG CGATTCACGA CGGTTTACGT ACGATCCGAA AGGACATGGT TGAAAACCGC GTGCAGTTGT CCAAGTACAT CATCACGAAG CAACTCACGA AGGCGGTTGA AGAGTACCCC GACGCAAAGC ATCAGCCGCA CGTCATGGTA GCCAAACGTC GATTGGAGGC TGGTAAACAA GATGGCGTCA AGGCGGGAGA GACTGTGCCG TACATCATCG CGCTGGAAAG CGAGCTGCCG CTCGAGGACA TCGCCGCGGG AAAAGCCGGC GCCTCTGGTG GAAAGGGCTT GGCCGAGCGG GCCTATCATC CCGACGAAAT CCTCGAGAAG GGTTTGAAGG TTGATTTGCA CTACTACTTG TCTCAGCAAG TCCATCCCGT GATAACTCGT TTGTGCGCCC CGATTGAGGA AACCGACGGC GCCGCGATGG CGGAATGCCT CGGGTTGGAC TCGAACAAGT TTAAAACGCA AACACGCGAT GAGGACGAGT ACGACGACAC GTTTGGCGGT GGTAGATTCG CGTTGGATGA CGAAGAGCGC TTTGCCAAGT GTAAGCCTCT GAAGCTT
|
Protein sequence | DGSTPFFFMD AQEERESPGT VFLFGRVPVS SDPQSETISA CAVVQNMQRC MYIVPTASTF ADPDGEIAAQ KAKSDLMKKL VPLSGDLRAE VKEVLKARKI ENSKITIVRR RYCFERKDIP QGPLFVLKVK IPATYAAFPS DIKGKHFVAA LGTQAPMLEL LTLKSKLKGP SWIALHGAAI VPTEKQKSWC KLELTLPNAH KSVRPCLEAI ASRPAPRLTV ASLNLQTIVN QQTNVNEIAV ASVQYIRDVN CEGSTTAAQL KTGLRHFTVV RKLDGLEMPP GWQNAVAHEN NTNVIAKRTG SVVLAAQNNE LGLLNFLLAK LHQLDPDVIV GHNIGGFNLE VLLRRFQANK IGHWSRIGRM KRTRMPNING SGGAYGGGAS MGALQCLAGR LLADTYLSAK DLLGKEVSYT LTSLSETQLG VRREEVPSAE IPNRYQDTNA LMHLIKCTEI DAKLSLHLMF KMEVVPLTKQ LSNIAGNLWS KTLGHTRAQR VEYLLLHEFH SRKHIVPDRL SAKERRRVAA ASGEEEDDGG KKGPSYAGGR VLEPKKGLYD TFVLVLDYQS LYPSIIQEYN ICYTTVRRHF DAGEENTEIE LPAPILSDKD FAVLPKVIAN IVQSRREVKG LMAREKDPAR AKQYDLRQLA LKLTANSMYG CLGFSQSRFF AEPIAALITA QGRKILQRTE DLAKAKCELD VIYGDTDSIM VNTKSHDLNH SRALGNKLIR FVNKEYRKLV LEEDYIFRSM LLLKKKKYAA MKVVNGPNGT KATKLEMKGL DIVRRDWAPL VKDVGKQTLE ELLDVDGERE ERVNAIHDGL RTIRKDMVEN RVQLSKYIIT KQLTKAVEEY PDAKHQPHVM VAKRRLEAGK QDGVKAGETV PYIIALESEL PLEDIAAGKA GASGGKGLAE RAYHPDEILE KGLKVDLHYY LSQQVHPVIT RLCAPIEETD GAAMAECLGL DSNKFKTQTR DEDEYDDTFG GGRFALDDEE RFAKCKPLKL
|
| |