Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17621 |
Symbol | |
ID | 5004776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 328612 |
End bp | 330087 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | |
GC content | 50% |
IMG OID | 640420197 |
Product | predicted protein |
Protein accession | XP_001420818 |
Protein GI | 145352995 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3569] Topoisomerase IB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0110477 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000723215 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAAGACT CTGACTACGC GACCAAACCG GTCTTCGTGA AAAACTTCAT GGATGGTTTC AAGGCGGCTT TGAAGAACGG CCCGCACGCG TTCATCACGG ACTTCTCCAA GTGTGACTTT ACGAAGATGT ACATGCACTT CTTGGCGATC AGAGAGAAGA AGAAGGAGAT GACGAGCGAA GAGAAGAAAC GCATCAAGGC GCAAAAAGAT ATCGATGAGG AACCTTACAC GTGGGCCACT ATCGACGGTC GCCGCGAAAA GGTTGGTAAC TTCCGCGTCG AACCGCCTGG CTTGTTCCGC GGGCGCGGCG AGCACCCGAA GATGGGGAAA ATCAAGCGTC GCGTCGTGCC GGAAGACATC ACCATCAATA TAGGAAAAGA TGCTAAGGTT CCCGAACCTC CGGCTGGACA CTCGTGGAAA GCGGTGATTC ATAACGACAC CGCCACGTGG TTGGCGGGAT GGAACGATGT CATCAACGTG AAGGACTGGA AATACGTCCA ATTCGGCGCC ACATCTACGG TCAAGGCAGA GAGCGATCAA AAGAAGTACG AAAAGGCTCG CTCGTTGCAT AAGTATATCG AAAAGATCCG GAAGGATTAC AAGAGAAACA TGATGAGTGA ATCCAAGGAA ATGGCTCAGT TAGCGGTGGC GACTTACTTG GTGGATAAAC TCGCCCTTCG TGCTGGTGGC GAGAAGGATG AAGATCTCGC CGACACCGTC GGCGTGTGTA CCTTGCGCGC CGGTCACATC AAGTTTATGG ATGATAACGT CATCGAGTTT GATTTCTTGG GTAAGGACTC CATCCAATAT CTTCAACAAC ACAAGATCGA TGAGGTTGCA TACAAGTGCT TGCAACGGTT CGTTCAGGGC AAGGGTCCGG ATGTGGACAT CTTCGACCAC GTCGACCCGC AAAAAGTAAA CGCGCATCTT CAAACTCTCA TGCCCGGTCT CACGATCAAG GTGTTCCGTA CGTACAACGC CTCTATTACT TTGGATCGTT TGCTGAAAGA CACGAAAAAG AATGATACGA CGCTTCAAAA GAAAGCAACT TACGACGCGG CGAACAAGGA GGTGGCGATT TTGTGTAATC ACCAGAAGGG TGTGAGCAAA GCGCACGACG CGCAAATGGA AAAACTCGCG GAAAAGAAGA AGGATCTCGC GAAGCAGGTG GCCGAGATGA AAAAGAAGAC GGACGATAAG AATCAAAAGA AGAAGCTCGC GGCGCTGAAA GAAAGGCAAA GCAAGTTGGC GATTCAAATG AACATGAAGG AAGAATTGAA AACGGTGTCG CTCGGCACGT CGAAAATCAA CTACCTCGAT CCCAGAATCA CGCTCTCTTG GTGCAAACGT CACGAAGTGC CGCCAAACGT GGTATTTACC AAGGCGTTGA TAGATAAGTT CCATTGGGCG ATGGATTGTG AGATGGAGTT CAGTTTCGTT CAAGAAGACG CCGTCGAAGT CAAGCCGGCC GAATAA
|
Protein sequence | MKDSDYATKP VFVKNFMDGF KAALKNGPHA FITDFSKCDF TKMYMHFLAI REKKKEMTSE EKKRIKAQKD IDEEPYTWAT IDGRREKVGN FRVEPPGLFR GRGEHPKMGK IKRRVVPEDI TINIGKDAKV PEPPAGHSWK AVIHNDTATW LAGWNDVINV KDWKYVQFGA TSTVKAESDQ KKYEKARSLH KYIEKIRKDY KRNMMSESKE MAQLAVATYL VDKLALRAGG EKDEDLADTV GVCTLRAGHI KFMDDNVIEF DFLGKDSIQY LQQHKIDEVA YKCLQRFVQG KGPDVDIFDH VDPQKVNAHL QTLMPGLTIK VFRTYNASIT LDRLLKDTKK NDTTLQKKAT YDAANKEVAI LCNHQKGVSK AHDAQMEKLA EKKKDLAKQV AEMKKKTDDK NQKKKLAALK ERQSKLAIQM NMKEELKTVS LGTSKINYLD PRITLSWCKR HEVPPNVVFT KALIDKFHWA MDCEMEFSFV QEDAVEVKPA E
|
| |