Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36189 |
Symbol | |
ID | 5000476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 124230 |
End bp | 127226 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | |
GC content | 62% |
IMG OID | 640415897 |
Product | predicted protein |
Protein accession | XP_001416316 |
Protein GI | 145343362 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1643] HrpA-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.804187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGGTC GACGCGAGGG CGAAGAGGCG ACGAACGCGC GAGCGGCGCG CTTGCGGGCG AAGATCGCGA GCGATGCGTC GTTGGCGGCG ATACAGACGA AACGCGAACA GTTACCGGTG CGGGAGTTTA AGGATGCGAT ATTGAACGCG GTACGGGCGA ATCAAGTCGT GCTCGTCGCC GGGTCGACGG GTTGCGGGAA GACGACGCAG GTGCCGCAGT ACGTCTTGGA CGATGCGTGG GCGAACGGGC GCGGGGCGTC GATCGTGTGC ACGCAACCGC GAAGGATTAG CGCGATGACG GTTTCCGAGC GCATCGCGAA CGAGCGCGGG GAGAGCATCG GGCAGAGCAC GGTCGGTTAC CAGATTCGAT TGGAAAGCCG GGTCTCGGCG GATTGTTCGT TGTTGTTTTG CACGTCCGGC GTGCTGTTGC GACGACTCAC GAGCGAGGCG TCGGATAAGC TGTGCGAGTC ATTGACGCAT ATCATCATCG ACGAGCTGCA CGAGCGAGAT TTGTTTGCGG ATTTCCTAAC CATCATTTTG AAGGGCGTGA TTCCGAAGCA TCCGCACCTA AAGCTCGTGC TGATGTCGGC GACGATGCGC GAAGATTTGT TTAGCGAATA CTTTGGTGGG TGTCCGGTGA TTTCAGTGCC AGGTTATACG CATCCGGTGA ATGAGTATCA CCTGGAAGAT ATCTTGCCCA TGATCGGATG GGGCGGCGTG CATCACACGT CGAAGAAGGC GAGCGGAGGC GGCGGCGGCG AACCGAGAGT GCGCGCACCC ACTTCGGGCG CGAGCGTGGA CGTCATGCGC GAGGCAATCA TGCGAGCATT TTTAGAGGAC ACCGACGAGT CGTTCGATTG GCTCATGCAG TGCGCGCGCG AGACAGATTC TGCGAGCGGG TTGTCGCACG TAAACGTCGC GCACTCCACG GGCGCCACCG CGCTCATGGC GGCGGCGGGT AAGGGAAGAC AGATGGAAGT GTCGCAGCTT TTAGGTTTAG GAGCGTCGCC CGCGATGCGA AGCACCGACG GGAGCAACGC CGCGGATTGG GCGGATAAGT TTGGACACGT CGAGCTCGCG GACGCGTTGC GAAGCGTGGA CGACGAAAAC GAAGACGCGG GAAGTCACGA GCAGTCGGCG CTTCTATTGA GCGATTACCA GCTCTCCGTG GATCCAGACG AGGTGGACGT GGACTTAATC CACAATTTGA TCGTTTGGAT CATGAAAGAG CGCGCGATCG ACGAAGGATC CGAGGGCGCG ATTTTAGTCT TTTTGCCCGG CTGGGACGAA ATCTCCAAAC TTCGCGACTC GTTGACGGCG GATTACAACG TCTGTCACTC GGCGAGCGTC CTACCTTTGC ACTCCATGGT CGCCCCGGCG GATCAACGAA AAGTCTTCCA ACGTCCACCT AAAGGGTTGC GCAAAATCGT CCTCTCCACC AACATCGCGG AGACGGCGGT GACGATTGAC GACGTCGTCT TCGTCATCGA CAGCGGGCGG TTGAAGGAAA AGAGTTACGA CGCGTACTCT GCGGTCTCTA CGCTCCAGGC GGCTTGGATC TCGCAAGCGA GTGCGAAACA GCGACGCGGT CGCGCCGGTC GCGTGCGTCC CGGCGAGTGC TATCGCGTGT ACTCCACCTC ACGGTACGAC TCGTTCGCGC AGTACCAGTT GCCCGAGATG CAGCGGTCGC CGCTCGAGGA GCTGTGCTTG CAGGTGCGCG TGTTGGCCGA AAGCGGCGCG GGCGTCGTGG ACGATGGGCC GGGAAGCACG GCTGGGTTTC TCGCGCGCGC GGTCGAGCCC CCTGTGGCGC AAGCGACGGA CAATGCGGTG CAATTGCTCA AGGACATCGG CGCTTTGACG GAGGAGGAGC GCCTCACGCG ACTCGGCCGC CATCTCGGCG AGCTTCCGTT GCACCCGCGC GTGGGGAAGA TGATCTTGTA CGCCGCTCTG TTTGGCGTTC TCGATCCGAT TCTCACCGTC GCGTGCGCTG CGGCGTATCG TCCGCCCTTC ATCATCTCCG CCGACGGTCG AAAATCGGGC GACGCCAGTC GCGCGGCGTT TTCCAACGAA GCCGGCGGCG GGAGCGATCA CTTGGCGGTG ACCAAGGCGT ACATGGCGTG GGAGCAAGTT CAGCGCGATG GGCGTCAAAA TGAAAGGTAC TTTTTGAACG CGAATTCTTT GTCGCCGTCG ACGCTGCACA TGATCAAGGG CATGCGACAG CAATTAATCA CGGCGTTGAT TCAGCGCGGC ATCATTTCAG ATTTGCGAAG CGCGAGCGCA AACTCGTCAT CCGGCGCGCT TGTGCGCGCG GTGCTCGCCG TGGGCATGTA CCCTTTGGTG GGACGATTTT TACCAAAGTG CAAAGCGCCG ACGTTGGCAA CGCTTCGCGG CGAGCGCGTG CGCGTGCACG CGTTTAGCGT CAACGGCAAA CTCGACGTGA GCGCGCTCGG CGAGCTCAAC GAATCGGGTG AAAAAATTGC CACCTTGGCG TGCTTCGACG AACTCATTCG AGGCCCTCAC GCGGTGCAAG TGCGCGAGTG CACGTTGGTC GCCGCCGCGG CGATCGTGTT CGTGTGCTCC ACGCTCACGG TGAAACCAGA CGTGCCGCAA ATCGATCCCG AAACCGGCGA GGCGCGCGCG AGAGACGGTC CGCCGTCGGC GTTGTTGGTC GTGGACAATT GGTTGAGATT TCGCGTGCCC TTGCGCGCGG TGGCGCAGAT CACGGTATTG CGCTTACGTT TGCACAAAGC GTTCGCCATG CGCGTCGAGC GACCGAAAGA CGCGCTACCG GCGGATATGC GAGGCGCCGT GGACGCCATC GCGCGCGTGC TGAGCGACGC CGACGCCGCG TTCATCGAGT CCTCGAGTTT CGCTCGCAGT TTCGCAGGCT TCGGCGGCGG GCGCGGCGGC GGCGGGCGCG GCGATGGAGG TCGCGGCGGT CGCGCGCGCG GCGGTCGCGC GCGCGGCGGC GCGAGAATCC CGGCGCCTCG ACGATAG
|
Protein sequence | MRGRREGEEA TNARAARLRA KIASDASLAA IQTKREQLPV REFKDAILNA VRANQVVLVA GSTGCGKTTQ VPQYVLDDAW ANGRGASIVC TQPRRISAMT VSERIANERG ESIGQSTVGY QIRLESRVSA DCSLLFCTSG VLLRRLTSEA SDKLCESLTH IIIDELHERD LFADFLTIIL KGVIPKHPHL KLVLMSATMR EDLFSEYFGG CPVISVPGYT HPVNEYHLED ILPMIGWGGV HHTSKKASGG GGGEPRVRAP TSGASVDVMR EAIMRAFLED TDESFDWLMQ CARETDSASG LSHVNVAHST GATALMAAAG KGRQMEVSQL LGLGASPAMR STDGSNAADW ADKFGHVELA DALRSVDDEN EDAGSHEQSA LLLSDYQLSV DPDEVDVDLI HNLIVWIMKE RAIDEGSEGA ILVFLPGWDE ISKLRDSLTA DYNVCHSASV LPLHSMVAPA DQRKVFQRPP KGLRKIVLST NIAETAVTID DVVFVIDSGR LKEKSYDAYS AVSTLQAAWI SQASAKQRRG RAGRVRPGEC YRVYSTSRYD SFAQYQLPEM QRSPLEELCL QVRVLAESGA GVVDDGPGST AGFLARAVEP PVAQATDNAV QLLKDIGALT EEERLTRLGR HLGELPLHPR VGKMILYAAL FGVLDPILTV ACAAAYRPPF IISADGRKSG DASRAAFSNE AGGGSDHLAV TKAYMAWEQV QRDGRQNERY FLNANSLSPS TLHMIKGMRQ QLITALIQRG IISDLRSASA NSSSGALVRA VLAVGMYPLV GRFLPKCKAP TLATLRGERV RVHAFSVNGK LDVSALGELN ESGEKIATLA CFDELIRGPH AVQVRECTLV AAAAIVFVCS TLTVKPDVPQ IDPETGEARA RDGPPSALLV VDNWLRFRVP LRAVAQITVL RLRLHKAFAM RVERPKDALP ADMRGAVDAI ARVLSDADAA FIESSSFARS FAGFGGGRGG GGRGDGGRGG RARGGRARGG ARIPAPRR
|
| |