Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30367 |
Symbol | |
ID | 5001059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 70436 |
End bp | 72229 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416480 |
Product | predicted protein |
Protein accession | XP_001416548 |
Protein GI | 145344042 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00270274 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0884002 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAATG GTAAAGGGCG TTCGCCGCGC GCGCGCGCGG CGCGAGCATT GACGACGTCG TCGCACGCGC GCAGGCTCCT CGTGAAGCGC GCGTTCAACA GACGCGCGTT GGAGACGCAC CCAGACAAAG GAGGCGACGC GGACGCGTTC CAAGCGATGT ACGAGGCGTT TCAAGAGCTC AAACAAGCGT TTCTCGATGG CTTCTCGGAA TTCGCGAAGT GCATCGAATC GGCGCCGGCG AAGGCGAAGG CGAAGGCGAA GGCGAAACGA ACGATGGTAG ACGCGCCAGA ACTCATGGGT AAGCGACGGA CGATTCCACC GTACGAGTTC TTTCAAACCG CCGCTGAACG AGACATTCCG TTCTGGAAAA TTGAGTTGGC GGCGTCGGGC AGAGCTCGCT GCACCGGCTC GCGCAAGTCC CTCGGCAGCG TCGGTCGCTG TCGAAACACC AATGAAGGAT CGAAGGTGGA TCAGCCGAGA AGCGTGGGTC CCGGTCCCGG TCCCGGTCCC GGGTCATCGA GAGCGCTCGT CGCCGACGTG TTCGGAATCA TCAAGAAGAA CGCTGTGCGC GTGGGCTTAA TGGATCCAGA ATCTGGCGCG TACTGCCGAT TCGTTCATTT GCGATGTTGG CGTGTGCCTC AGGCGGTGTG GTACAATCTT CCGCAGTACC CCGACGATAC GACTGAGTAT AGCGTCGATG AGTTCAAAAG ACGCTTGGTC GATATGGACG GCGTCGTCTT CGTCGGGTTG AACAAACTTT GCGATGAAGA CATCACGCTC GTCGCCGAAC ACGCGATGAA TCAGGAGGCG TGGGCGGCGT TCAATCAGAA GAAATATGTC GCCGCTCGAG CTGCTGGCCT TCCGTCATCG CGGGTTCAAC TTCTTTCGAA AGACGAAATC AAAGTACTGC AGAAACTAAT GTCGAAGGCG ACGTCTTTAG TGCAAGAACC AGAGCAAGAC GAACCGCTCG CCGCGGTGAC GAAGCAGGAA TCAAAACCGG AACCAAAACG AACGGACCAC GATGAGTACG ACGTCGTGAC TGATGACGAC GAACACGAAG ATGGCTTGCA AGTGGCGCCG AATGTCGGCA CCGCGATCGC CGTGCAGAAA CCAAAAGAGG AGAAGCCACC ACCGAAGATT TTGATTCCCG TCCCTGGCGA AAACGGCGCA ATCGCCAACT CACTTCTTCG CGACAATTCC AAACCCATGA CGTTCGTGCT GACGGGAATT TTTCCGGATC TCGGAGGCGA TCGAATCGGT CTGATGCAAG GCAAAGACAT GGCGGAAAAG CTGATTCAGC GGTTTGGAGG CGTCGTGCGG AGCGCGGTTT CAGGCGCAAC TGATGTTTTG CTCGTCGGCG AAGAGCCAGG CGTTTCGAAA CTATCAGCTG CGCGCACGAA AGCGAACTGC AAAGTTCTCA ATCTCGAACA CATCCTCGAC ATGATTCACG GGCGCTCCAT TGAAGGGAAA CGGTTGCAAA TTGAAAGATT CAGCACAGGT TTTCATGGAA ATTCGTTAGC GTACGATATG ACGCGAGCCG AATTGGAAGA GCTACGGACG GGAGCCAAAG CGCTTCCTTC GACGATGAAA ACAGAGGAGC CACAACTTCT ACCATCGTCC AAGTCGACAA CGCGTAAGCG CAAAGCTCTT CCTTTGTCCA CGAAAGACGA AGCGCAAAAG AATTTGACGT TACCCGCACC TGATAAGCCG CCGCGAAATA AGAAGCCAAA ACGCTCGCCG AAAGCGCCCG TGCGAAGCTC GCGTCGTCTC GCCGCACTGA CGAACGCCGC GTGA
|
Protein sequence | MLNGKGRSPR ARAARALTTS SHARRLLVKR AFNRRALETH PDKGGDADAF QAMYEAFQEL KQAFLDGFSE FAKCIESAPA KAKAKAKAKR TMVDAPELMG KRRTIPPYEF FQTAAERDIP FWKIELAASG RARCTGSRKS LGSVGRCRNT NEGSKVDQPR SVGPGPGPGP GSSRALVADV FGIIKKNAVR VGLMDPESGA YCRFVHLRCW RVPQAVWYNL PQYPDDTTEY SVDEFKRRLV DMDGVVFVGL NKLCDEDITL VAEHAMNQEA WAAFNQKKYV AARAAGLPSS RVQLLSKDEI KVLQKLMSKA TSLVQEPEQD EPLAAVTKQE SKPEPKRTDH DEYDVVTDDD EHEDGLQVAP NVGTAIAVQK PKEEKPPPKI LIPVPGENGA IANSLLRDNS KPMTFVLTGI FPDLGGDRIG LMQGKDMAEK LIQRFGGVVR SAVSGATDVL LVGEEPGVSK LSAARTKANC KVLNLEHILD MIHGRSIEGK RLQIERFSTG FHGNSLAYDM TRAELEELRT GAKALPSTMK TEEPQLLPSS KSTTRKRKAL PLSTKDEAQK NLTLPAPDKP PRNKKPKRSP KAPVRSSRRL AALTNAA
|
| |