Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_94724 |
Symbol | |
ID | 5003802 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 411356 |
End bp | 413296 |
Gene Length | 1941 bp |
Protein Length | 633 aa |
Translation table | |
GC content | 61% |
IMG OID | 640419223 |
Product | predicted protein |
Protein accession | XP_001419800 |
Protein GI | 145350831 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5260] DNA polymerase sigma |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.127729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000571267 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATCAGCA CGCGGTTCGA GGGCGTGCGG GTGGCGCCGT TTGGGTCGTA CGTGAGCGCG TTTCACTCGG CGGGGAGCGA TATAGATATT TCGCTGCAGA TAGACAAGAA TGGACCGTGG TACGATGAAA AGGAAGAAGC GCAGGCGCGC CGATCGCAGC GGGGCGGGGT GCGCGCGCGT CGACAACAGC GCCAGGGTCG AACGAAACGC GCGCAATTGT TGCGCAAAGT CGCCTCGGAG TTGCGGTATC GCAATTACCG CGACGTCCAA CTCATCTCCA AGGCTCGGGT GCCTTTGATC AAGTTCAAAG ACCCGCAGAC GGGCGTCGCG TGCGATGTGT GCATCGAGAA CGACGGGGTG TACAAGAGCG CCGTGCTCGG CGTCGTCGCG GATATCGATC AACGTTATCG CGACTTGGTG TTCTTGATAA AGCTTTGGGC CAAGCATTAC GATGTGAACA ACGCCATGGA GGGATCGTTC AACTCGTACT CGTTGTGTTT GCTTTGTATG CATCATTTGC AGCGCCGACC AGTGCCAATT CTGCCGCCGA CGATGCTGCT CACGCTTCCT CGTCCCGATT TGGTGGAATC GGAAAAGCGC GAACTCGAGG AGCATTTGAA AAGTGAGGAC GACCAGTTCG ATACTTGGAA AGTTAGTAAA GCTCGCGTCG TGAGTGATGC ATCGAGGGAC ATCGCGGCGG TAAAGTACCG CGCCGATAAG TTCGCGGGTT TCGGAAAGGA AAACACCGAG ACGCTCGCGG AACTCTTTGT GAGCTTCTTC GCGCACTTGT GCGCCATCAA AGATTTGTTT CGGAACGCGG TGAACGCGTC CACGTATCAT GGTACGTTCA TCGTCGGTAG CTCTTGGCAA GCGTTCAAAT ATCCACTCGG TGTGGAAGAT CCGTTTGCCG CCGGCGACAA CGTCGCTCGA GCGGTTCAAA TGCGCACGAG AGATTACGTG TTGAACGCTT TTCCTGCGGC GTGTGCAGAT ATATCCAAGA TGCTGCACGC CACGGACAAC GTACAGTTCA TGCGCTCGTT ACTGTGCTTG CTCGGTGATA AGAGCGTACC ATCCGAAGTC TTGGCGCGTC TCAGACCGAC GCTCCCCGGC ATGGGCGGCG CGCCGCAGCC GCCGGGGTTG CCGGGAGCAC CGCGACCTCC TCAAGGCCCA CCTGTGATGC TTCAACAGCC AGCAAAGTCG CTCAATGAAC ACACGTTGGA TATGCTCGGC AGACAAGTCG CGCCAGGAGC GTCGGCGGAG GAGATTTTGG CGATGTTGAC GCGTCAACGA CAGGTGCAAG CGGAGGCGCA GCGAGACCAG TCGCAACCGA GCGAGCAGCA GATGTTGTTG CTGCGACGGC AGCAAGAGCT TTTGCGCATG GAACAAGCAA AAATCCAGCA GCACATGCAA CAGGGACAGC CGCCACCGCA GCCAGGTCGC ACGACGCAGA TCCCGGTGGC GAGTTTGTTT GGGCAGCCGC AGCAACAGCC GCAGCAACAG CGCGGCCTAC CGCCCGGTTT CGGTCCGACT TCGCAGCTGC AGATGCCACC ACCGCAGCCA CAGATGCCGA TGGCAACGGC GCTACCTTCG TTTGGCGCAC CCCCTCCGGC CAACGGTGGC TTTTCGAACG GCGGCCTCGG CGGCGGCGTC TTCTCGAGCA TCGCCTCCGG CGGCGGCGGC TTATTTTTCG ACGCCCCGGC GCGCCAATCC AATCCGCCGC CGCCGACCGC GCACGCCGTC GACGAAATCT CCCAGCATTT TGCCACCGGC ATGTCCATGT TCGGCTCAAA CGTCCACGCA CCTCCACCCG ATCTCGGCGT CCGCTCGCCG CCCGATCTCG CCGCCGGCGG TTCGCCGCCG GCGTCGCGCG TCGTCCCTCA GTCCGAGCTT CCTCGCACTC GCAGCGGCGT CGCCATCCCC AAACCGCGCA ACGCGCGTTA G
|
Protein sequence | MISTRFEGVR VAPFGSYVSA FHSAGSDIDI SLQIDKNGPW YDEKEEAQAR RSQRGGVRAR RQQRQGRTKR AQLLRKVASE LRYRNYRDVQ LISKARVPLI KFKDPQTGVA CDVCIENDGV YKSAVLGVVA DIDQRYRDLV FLIKLWAKHY DVNNAMEGSF NSYSLCLLCM HHLQRRPVPI LPPTMLLTLP RPDLVESEKR ELEEHLKSED DQFDTWKVSK ARVVSDASRD IAAENTETLA ELFVSFFAHL CAIKDLFRNA VNASTYHGTF IVGSSWQAFK YPLGVEDPFA AGDNVARAVQ MRTRDYVLNA FPAACADISK MLHATDNVQF MRSLLCLLGD KSVPSEVLAR LRPTLPGMGG APQPPGLPGA PRPPQGPPVM LQQPAKSLNE HTLDMLGRQV APGASAEEIL AMLTRQRQVQ AEAQRDQSQP SEQQMLLLRR QQELLRMEQA KIQQHMQQGQ PPPQPGRTTQ IPVASLFGQP QQQPQQQRGL PPGFGPTSQL QMPPPQPQMP MATALPSFGA PPPANGGFSN GGLGGGVFSS IASGGGGLFF DAPARQSNPP PPTAHAVDEI SQHFATGMSM FGSNVHAPPP DLGVRSPPDL AAGGSPPASR VVPQSELPRT RSGVAIPKPR NAR
|
| |