Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37445 |
Symbol | |
ID | 5001150 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 113310 |
End bp | 116225 |
Gene Length | 2916 bp |
Protein Length | 938 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416571 |
Product | predicted protein |
Protein accession | XP_001417148 |
Protein GI | 145345288 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | [TIGR00604] DNA repair helicase (rad3) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.258887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCGCG ACGAAGATTG CGACCTGAGT CCGCGCGAGA TCGCGTCGCC GCGCGACGCC GACGACGAGG GCGATGATGT TCGCGTCGTG TGCATCGACG AAGGCGACGA CGCGACGCCG CGCGCGCGCG CGTCCGCGGA CGCGCCGGAC GCGACGACGT CGACGCGAGC ATTCCCAGGC GCGTCCACGG CGCCGGACGC GCGCGCGGCG CGGTTCAAAC CGGAACTGAG CGTCAAGTCG TACTTTAAGA AGATTGAACC AAACGACGCG ACGCGCGACG GCGACGCGAC GACGCCGGAG GACGCCTCGC CGAAGCGTCG GCGAGGGTTG GTGAAGATTG AATTCGAAAA GGACAGGCGC GAATTCACGA CGAAGGCGAT CGGGGGATGT AAGGTGAAGT TTCCGGAGGG ACTGAATCCA CATCCGGCGC AGACGATGAC GATGTCGAGC ATCATTCGAG CGTTGACGAA ACGCGAGCAC GCGATGATTG AGTCGCCCAC GGGGACGGGG AAAACGTTAG CGCTGTTGTG CGGGGCGTTG GCGTGGCAGG AACGAGAGGT GGCGCTTTCG ATGGAGAAGA ACAAGGGCTA TTGGTCGGAA AAGATGAAGT ATCAAACGGC GCGGAACGCG TACAAAGACG CGGTGGAGAA CGGGAAGACG CCGACGGTGG AGCGCGGCGT CGGTTGGGCC TCGAATGATC CGTATTTTCC GAATCCGCAC GGTTCGACGC TCCGACCGAA GATTTTCATC TGTTCGCGAA CGCATTCGCA GATTAATCAA ATTTTGCGTG AACTCAAGCG AACGGGGTAT TCGCCGCGGT ACAGCGTGCT GTCCTCGAGA CAACGGATGT GCCCGATGGA AAAGAACGAC GCGCAGTGCA AAGAATTACT CGGCACTGGA GTGGCTCAAC AGAGCGGTCG CACGGCGTGC GGGTTTTTCA ATCGACACAA GCACGTGTCG TCGAACATGG AGCGATACCC GAAGGCGGGC GAAGAAGGCA TGTTTCCGAG CGCGTGGGAC ATGGAAGACT TTGAGCGCGT CGTGAATGAA ATCGAAGGTT GTTCGTTTTA CGCGCTTCGC GAGATGGTGA AGACCGCGGA TTTGATCTTT TGCCCGTACA ATTATCTGTT TGACGTGAAC ATTCGACGAA AGATGAAGAT AGATCTGAAG GACGCCGCCG TCATCATCGA TGAAGGGCAC AATCTTGAAG ACGTGTGCAG AGAAGGTTCG TCGATTGAGT TTTCGCTCGA CACCATCGGA AAAGGTATGG ATGAGTTGTC GAAGACTTGG GGGAAAATCA ACAACGAAAC GAGTTTGATC GCCAGATTCT TTCGAGCGAT TTCCGCGTTC ATGGAGGGTC TTTTCTGCAC CAACACCGCG CCGACGACGG TGATTCAATC GAACCAGCTC GAGTACTTCG TAAACGACAT GCTGAAAACA TTCAACGCGG TGGGTGAGTA CACGCAGGAC ATCATAGACG TCGTAGAAAA GTTACTCACT GAGGACAGCA ATTTGATGTC GCCTATCATC GCGCCGCACA TTACGTTCGC CGGAGATCTA GCCGAAGTTT TGCAAAACGC CGTCAAGCAC GCGGCTGCGT ACAACATTTT CATCGGCTCG CGGGTTGAAA TCGACGGCGA TGAATGTCCA GGTATGGTGA TTCAGTGCAT GAAGCCGTCA GTCGCGTTTC ACGCCGTTGC GGAGAAAGCG CGCTCGGTAA TCATCACCTC TGGTACGTTG TCGCCGATGA ATACGTTCGA GGCGGAGCTC GCCGAGAAGT TTCCGACGAA AATCGAAGCG CCCCACGTGG TGCCAAACGA TCACGTGTAC GTCGAAGTGA CGAGCGCGAT TGGCGAGGTG ACGTACAAGG CGACAGATGG ACACGTCCAA GGACCGAAGT TCGCGAAAAA GTTGGGCGAA TACTTGTTGA AATACGCTCA AGTCATCCCG GGAGGGATGC TCGTGTTTCT TCCAAAGTAT AGTCTCATCG ACCGCGTCTT GCGCGAATGG CACGTGACTG GTTTGTTTGG GAAGATGAAC GATTACAAAC GCGTCGTCGT GGAAACGCGG GGCGCGCGAG GTTTCCAGGA CACCCTGAAC GAATTCAATC TCGGAAACAC GAACGGTAAA GGATCGTTGA TGCTTGCCGT GTACCGCGGT AAAGTGTCCG AAGGTATCGA TTTCAAAGAC GACAGCGCGC GGGCTGTGTT TTGCGTCGGC ATTCCGTTTC CGAGCGTCTA CGACATCAAG GTGAAGGCGA AGAAAGAATT TAACGATTTA CCCGTTTCTC GAGCGCAAGG CATGCTATCG GGCGGCGAGT GGTACCGCGC GCAGGCGTAC CGCGCGTACA ACCAAGCGCT CGGTCGTTGC ATTCGCCATC CGAAGGATTA CGCCGCGTTG TTCCTCGTCG ACTCGCGTTT TCGCGAAAGT CGTTGGATGC TGAACAACAT CTCTAAATGG ATTCGCAACA ACGCGCAGGC GTCCGACGAC GTCAATCAAA GCGTGCGAGT GGTGGATGGA TTCTTTAAGC GCCTTCGCGG CGCATCCGAC GGAAACGCGC CGGCGGGGTC GACGGCAAAG CACGAGGAAA CTCAACAAAA AGAAAATCTC CTCTGCGGCC TGGGGTGCGA GAAAAACGCC GCGCTCTGCG CCGCGCTCGA GAGACTCCAC GAAGCGAGTA TGCTTTCGCC CGAGGAGCCG ATGAAAGTGA ACGCGTATGC GAAAGCGTTG ACGTCCATTC GCGCGTTGAA GTACGAAGTC ACGAGTGGCG CGAAGATGAG TAAAGCGGGT CCCGACAAAG TGCAGCACGT CGGACCGTCC ATGGGCGCGC AAATCGATTT CTTTCTCAAG CACGGCGTCT TCGAACGAAT GGAGTATTAC GAGCGCAAAG AGCTTCCCCC GTCGTCGGCG AAGTAG
|
Protein sequence | MSRDEDCDLS PREIASPRDA DDEGDDVRVV CIDEGDDATP RARASADAPD ATTSTRAFPG ASTAPDARAA RFKPELSVKS YFKKIEPNDA TRDGDATTPE DASPKRRRGL VKIEFEKDRR EFTTKAIGGC KVKFPEGLNP HPAQTMTMSS IIRALTKREH AMIESPTGTG KTLALLCGAL AWQEREVALS MEKNKGYWSE KMKYQTARNA YKDAIFICSR THSQINQILR ELKRTGYSPR YSVLSSRQRM CPMEKNDAQC KELLGTGVAQ QSGRTACGFF NRHKHVSSNM ERYPKAGEEG MFPSAWDMED FERVVNEIEG CSFYALREMV KTADLIFCPY NYLFDVNIRR KMKIDLKDAA VIIDEGHNLE DVCREGSSIE FSLDTIGKGM DELSKTWGKI NNETSLIARF FRAISAFMEG LFCTNTAPTT VIQSNQLEYF VNDMLKTFNA VGEYTQDIID VVEKLLTEDS NLMSPIIAPH ITFAGDLAEV LQNAVKHAAA YNIFIGSRVE IDGDECPGMV IQCMKPSVAF HAVAEKARSV IITSGTLSPM NTFEAELAEK FPTKIEAPHV VPNDHVYVEV TSAIGEVTYK ATDGHVQGPK FAKKLGEYLL KYAQVIPGGM LVFLPKYSLI DRVLREWHVT GLFGKMNDYK RVVVETRGAR GFQDTLNEFN LGNTNGKGSL MLAVYRGKVS EGIDFKDDSA RAVFCVGIPF PSVYDIKVKA KKEFNDLPVS RAQGMLSGGE WYRAQAYRAY NQALGRCIRH PKDYAALFLV DSRFRESRWM LNNISKWIRN NAQASDDVNQ SVRVVDGFFK RLRGASDGNA PAGSTAKHEE TQQKENLLCG LGCEKNAALC AALERLHEAS MLSPEEPMKV NAYAKALTSI RALKYEVTSG AKMSKAGPDK VQHVGPSMGA QIDFFLKHGV FERMEYYERK ELPPSSAK
|
| |